
Adult
Donated on 5/1/1996
Predict whether income exceeds $50K/yr based on census data. Also known as "Census Income" dataset.
Dataset Characteristics
Multivariate
Subject Area
Social
Associated Tasks
Classification
Attribute Type
Categorical, Integer
# Instances
48842
# Attributes
15
Information
Additional Information
Extraction was done by Barry Becker from the 1994 Census database. A set of reasonably clean records was extracted using the following conditions: ((AAGE>16) && (AGI>100) && (AFNLWGT>1)&& (HRSWK>0)) Prediction task is to determine whether a person makes over 50K a year. Listing of attributes: >50K, <=50K. age: continuous. workclass: Private, Self-emp-not-inc, Self-emp-inc, Federal-gov, Local-gov, State-gov, Without-pay, Never-worked. fnlwgt: continuous. education: Bachelors, Some-college, 11th, HS-grad, Prof-school, Assoc-acdm, Assoc-voc, 9th, 7th-8th, 12th, Masters, 1st-4th, 10th, Doctorate, 5th-6th, Preschool. education-num: continuous. marital-status: Married-civ-spouse, Divorced, Never-married, Separated, Widowed, Married-spouse-absent, Married-AF-spouse. occupation: Tech-support, Craft-repair, Other-service, Sales, Exec-managerial, Prof-specialty, Handlers-cleaners, Machine-op-inspct, Adm-clerical, Farming-fishing, Transport-moving, Priv-house-serv, Protective-serv, Armed-Forces. relationship: Wife, Own-child, Husband, Not-in-family, Other-relative, Unmarried. race: White, Asian-Pac-Islander, Amer-Indian-Eskimo, Other, Black. sex: Female, Male. capital-gain: continuous. capital-loss: continuous. hours-per-week: continuous. native-country: United-States, Cambodia, England, Puerto-Rico, Canada, Germany, Outlying-US(Guam-USVI-etc), India, Japan, Greece, South, China, Cuba, Iran, Honduras, Philippines, Italy, Poland, Jamaica, Vietnam, Mexico, Portugal, Ireland, France, Dominican-Republic, Laos, Ecuador, Taiwan, Haiti, Columbia, Hungary, Guatemala, Nicaragua, Scotland, Thailand, Yugoslavia, El-Salvador, Trinadad&Tobago, Peru, Hong, Holand-Netherlands.
Has Missing Values
Symbol: 1
Features
Attribute Name | Role | Type | Description | Units | Missing Values |
---|---|---|---|---|---|
age | Feature | Continuous | N/A | false | |
workclass | Feature | Categorical | false | ||
fnlwgt | Feature | Continuous | false | ||
education | Feature | Categorical | false | ||
education-num | Feature | Continuous | false | ||
marital-status | Feature | Categorical | false | ||
occupation | Feature | Categorical | false | ||
relationship | Feature | Categorical | false | ||
race | Feature | Categorical | false | ||
sex | Feature | Categorical | false |
1 to 10 of 15
Baseline Model Performance
Adult. (1996). UCI Machine Learning Repository.
@misc{misc_adult_2, title = {{Adult}}, year = {1996}, howpublished = {UCI Machine Learning Repository} }
Keywords
License
This dataset is licensed under a Creative Commons Attribution 4.0 International (CC BY 4.0) license.
This allows for the sharing and adaptation of the datasets for any purpose, provided that the appropriate credit is given.