UC Irvine
ML Repository
Theme

National Health and Nutrition Health Survey 2013-2014 (NHANES) Age Prediction Subset

Download(28.7 KB)

About

The National Health and Nutrition Examination Survey (NHANES), administered by the Centers for Disease Control and Prevention (CDC), collects extensive health and nutritional information from a diverse U.S. population. Though expansive, the dataset is often too broad for specific analytical purposes. In this sub-dataset, we narrow our focus to predicting respondents' age by extracting a subset of features from the larger NHANES dataset. These selected features include physiological measurements, lifestyle choices, and biochemical markers, which were hypothesized to have strong correlations with age. The original full dataset can be found at: https://wwwn.cdc.gov/nchs/nhanes/search/DataPage.aspx?Component=Questionnaire&CycleBeginYear=2013 Preprocessing description: For this subset respondents 65 years old and older were labeled as “senior” and all individuals under 65 years old as “non-senior.”
Subject Area
Health and Medicine
Instances
6,287
Features
7
Data Types
Tabular
Tasks
Classification
Feature Types
Continuous, Categorical, Integer

Features

NameRoleTypeUnitsMissing ValuesDescription

Introductory Paper

A data-driven approach to predicting diabetes and cardiovascular disease with machine learning
An Dinh, Stacey Miertschin, Amber Young, S. Mohanty. 2019.
BMC Medical Informatics and Decision Making

Additional Metadata

Keywords
Authors
NA NA
Year Created
2019
License
CC BY 4.0