Statlog (German Credit Data)

About

This dataset classifies people described by a set of attributes as good or bad credit risks. Comes in two formats (one all numeric). Also comes with a cost matrix Two datasets are provided. the original dataset, in the form provided by Prof. Hofmann, contains categorical/symbolic attributes and is in the file "german.data". For algorithms that need numerical attributes, Strathclyde University produced the file "german.data-numeric". This file has been edited and several indicator variables added to make it suitable for algorithms which cannot cope with categorical variables. Several attributes that are ordered categorical (such as attribute 17) have been coded as integer. This was the form used by StatLog. This dataset requires use of a cost matrix (see below) ..... 1 2 ---------------------------- 1 0 1 ----------------------- 2 5 0 (1 = Good, 2 = Bad) The rows represent the actual classification and the columns the predicted classification. It is worse to class a customer as good when they are bad (5), than it is to class a customer as bad when they are good (1).

Subject Area

Social Science

Instances

1,000

Features

Data Types

Multivariate

Tasks

Classification

Feature Types

Categorical, Integer

Features

Name	Role	Type	Units	Missing Values
Attribute1	Feature	Categorical	-	No
└ Status of existing checking account
Attribute2	Feature	Integer	months	No
└ Duration
Attribute3	Feature	Categorical	-	No
└ Credit history
Attribute4	Feature	Categorical	-	No
└ Purpose
Attribute5	Feature	Integer	-	No
└ Credit amount
Attribute6	Feature	Categorical	-	No
└ Savings account/bonds
Attribute7	Feature	Categorical	-	No
└ Present employment since
Attribute8	Feature	Integer	-	No
└ Installment rate in percentage of disposable income
Attribute9	Feature	Categorical	-	No
└ Personal status and sex
Attribute10	Feature	Categorical	-	No
└ Other debtors / guarantors
Attribute11	Feature	Integer	-	No
└ Present residence since
Attribute12	Feature	Categorical	-	No
└ Property
Attribute13	Feature	Integer	years	No
└ Age
Attribute14	Feature	Categorical	-	No
└ Other installment plans
Attribute15	Feature	Categorical	-	No
└ Housing
Attribute16	Feature	Integer	-	No
└ Number of existing credits at this bank
Attribute17	Feature	Categorical	-	No
└ Job
Attribute18	Feature	Integer	-	No
└ Number of people being liable to provide maintenance for
Attribute19	Feature	Binary	-	No
└ Telephone
Attribute20	Feature	Binary	-	No
└ foreign worker
class	Target	Binary	-	No
└ 1 = Good, 2 = Bad

Introductory Paper

–

Additional Metadata

Keywords

finance

Authors

Hans Hofmann

Year Created

1994

DOI

10.24432/C5NC77

License

CC BY 4.0

Statlog (German Credit Data)

About

Features

Introductory Paper

Additional Metadata

Authors

Donation Information