Thyroid Disease
About
10 separate databases from Garavan Institute
# From Garavan Institute
# Documentation: as given by Ross Quinlan
# 6 databases from the Garavan Institute in Sydney, Australia
# Approximately the following for each database:
** 2800 training (data) instances and 972 test instances
** Plenty of missing data
** 29 or so attributes, either Boolean or continuously-valued
# 2 additional databases, also from Ross Quinlan, are also here
** Hypothyroid.data and sick-euthyroid.data
** Quinlan believes that these databases have been corrupted
** Their format is highly similar to the other databases
# 1 more database of 9172 instances that cover 20 classes, and a related domain theory
# Another thyroid database from Stefan Aeberhard
** 3 classes, 215 instances, 5 attributes
** No missing values
# A Thyroid database suited for training ANNs
** 3 classes
** 3772 training instances, 3428 testing instances
** Includes cost data (donated by Peter Turney)
Subject Area
Health and Medicine
Instances
7,200
Features
5
Data Types
Multivariate
Tasks
Classification
Feature Types
Categorical, Continuous
Features
Name | Role | Type | Units | Missing Values |
---|---|---|---|---|
Class | Target | Categorical | - | No |
Attribute1 | Feature | Integer | - | No |
Attribute2 | Feature | Continuous | - | No |
Attribute3 | Feature | Continuous | - | No |
Attribute4 | Feature | Continuous | - | No |
Attribute5 | Feature | Continuous | - | No |
Introductory Paper
–