UC Irvine
ML Repository
Theme

Hayes-Roth

Download(3.4 KB)

About

Topic: human subjects study This database contains 5 numeric-valued attributes. Only a subset of 3 are used during testing (the latter 3). Furthermore, only 2 of the 3 concepts are "used" during testing (i.e., those with the prototypes 000 and 111). I've mapped all values to their zero-indexing equivalents. Some instances could be placed in either category 0 or 1. I've followed the authors' suggestion, placing them in each category with equal probability. I've replaced the actual values of the attributes (i.e., hobby has values chess, sports and stamps) with numeric values. I think this is how the authors' did this when testing the categorization models described in the paper. I find this unfair. While the subjects were able to bring background knowledge to bear on the attribute values and their relationships, the algorithms were provided with no such knowledge. I'm uncertain whether the 2 distractor attributes (name and hobby) are presented to the authors' algorithms during testing. However, it is clear that only the age, educational status, and marital status attributes are given during the human subjects' transfer tests.
Subject Area
Social Science
Instances
160
Features
5
Data Types
Multivariate
Tasks
Classification
Feature Types
Categorical

Features

NameRoleTypeUnitsMissing Values

Introductory Paper

Additional Metadata

Keywords
Authors
Barbara Hayes-Roth
Frederick Hayes-Roth
Year Created
1977
License
CC BY 4.0