UC Irvine
ML Repository
Theme

Thyroid Disease

Download(610.3 KB)

About

10 separate databases from Garavan Institute # From Garavan Institute # Documentation: as given by Ross Quinlan # 6 databases from the Garavan Institute in Sydney, Australia # Approximately the following for each database: ** 2800 training (data) instances and 972 test instances ** Plenty of missing data ** 29 or so attributes, either Boolean or continuously-valued # 2 additional databases, also from Ross Quinlan, are also here ** Hypothyroid.data and sick-euthyroid.data ** Quinlan believes that these databases have been corrupted ** Their format is highly similar to the other databases # 1 more database of 9172 instances that cover 20 classes, and a related domain theory # Another thyroid database from Stefan Aeberhard ** 3 classes, 215 instances, 5 attributes ** No missing values # A Thyroid database suited for training ANNs ** 3 classes ** 3772 training instances, 3428 testing instances ** Includes cost data (donated by Peter Turney)
Subject Area
Health and Medicine
Instances
7,200
Features
5
Data Types
Multivariate
Tasks
Classification
Feature Types
Categorical, Continuous

Features

NameRoleTypeUnitsMissing Values

Introductory Paper

Additional Metadata

Keywords
Authors
Ross Quinlan
Year Created
1986
License
CC BY 4.0