UC Irvine
ML Repository
Theme

Heart Disease

Download(125.9 KB)

About

4 databases: Cleveland, Hungary, Switzerland, and the VA Long Beach This database contains 76 attributes, but all published experiments refer to using a subset of 14 of them. In particular, the Cleveland database is the only one that has been used by ML researchers to date. The "goal" field refers to the presence of heart disease in the patient. It is integer valued from 0 (no presence) to 4. Experiments with the Cleveland database have concentrated on simply attempting to distinguish presence (values 1,2,3,4) from absence (value 0). The names and social security numbers of the patients were recently removed from the database, replaced with dummy values. One file has been "processed", that one containing the Cleveland database. All four unprocessed files also exist in this directory. To see Test Costs (donated by Peter Turney), please see the folder "Costs"
Subject Area
Health and Medicine
Instances
303
Features
13
Data Types
Multivariate
Tasks
Classification
Feature Types
Categorical, Integer, Continuous

Features

NameRoleTypeUnitsMissing ValuesDescription

Introductory Paper

International application of a new probability algorithm for the diagnosis of coronary artery disease.
R. Detrano, A. Jánosi, W. Steinbrunn, M. Pfisterer, J. Schmid, S. Sandhu, K. Guppy, S. Lee, V. Froelicher. 1989.
American Journal of Cardiology

Additional Metadata

Keywords
Authors
Andras Janosi
William Steinbrunn
Matthias Pfisterer
Robert Detrano
Year Created
1989
License
CC BY 4.0