Statlog (German Credit Data)
About
This dataset classifies people described by a set of attributes as good or bad credit risks. Comes in two formats (one all numeric). Also comes with a cost matrix
Two datasets are provided. the original dataset, in the form provided by Prof. Hofmann, contains categorical/symbolic attributes and is in the file "german.data".
For algorithms that need numerical attributes, Strathclyde University produced the file "german.data-numeric". This file has been edited and several indicator variables added to make it suitable for algorithms which cannot cope with categorical variables. Several attributes that are ordered categorical (such as attribute 17) have been coded as integer. This was the form used by StatLog.
This dataset requires use of a cost matrix (see below)
..... 1 2
----------------------------
1 0 1
-----------------------
2 5 0
(1 = Good, 2 = Bad)
The rows represent the actual classification and the columns the predicted classification.
It is worse to class a customer as good when they are bad (5), than it is to class a customer as bad when they are good (1).
Subject Area
Social Science
Instances
1,000
Features
20
Data Types
Multivariate
Tasks
Classification
Feature Types
Categorical, Integer
Features
| Name | Role | Type | Units | Missing Values | Description |
|---|---|---|---|---|---|
| Attribute1 | Feature | Categorical | - | No | |
| Attribute2 | Feature | Integer | months | No | |
| Attribute3 | Feature | Categorical | - | No | |
| Attribute4 | Feature | Categorical | - | No | |
| Attribute5 | Feature | Integer | - | No | |
| Attribute6 | Feature | Categorical | - | No | |
| Attribute7 | Feature | Categorical | - | No | |
| Attribute8 | Feature | Integer | - | No | |
| Attribute9 | Feature | Categorical | - | No | |
| Attribute10 | Feature | Categorical | - | No | |
| Attribute11 | Feature | Integer | - | No | |
| Attribute12 | Feature | Categorical | - | No | |
| Attribute13 | Feature | Integer | years | No | |
| Attribute14 | Feature | Categorical | - | No | |
| Attribute15 | Feature | Categorical | - | No | |
| Attribute16 | Feature | Integer | - | No | |
| Attribute17 | Feature | Categorical | - | No | |
| Attribute18 | Feature | Integer | - | No | |
| Attribute19 | Feature | Binary | - | No | |
| Attribute20 | Feature | Binary | - | No | |
| class | Target | Binary | - | No |
Introductory Paper
–