UC Irvine
ML Repository
Theme

Breast Cancer Wisconsin (Diagnostic)

Download(50.1 KB)

About

Diagnostic Wisconsin Breast Cancer Database. Features are computed from a digitized image of a fine needle aspirate (FNA) of a breast mass. They describe characteristics of the cell nuclei present in the image. A few of the images can be found at http://www.cs.wisc.edu/~street/images/ Separating plane described above was obtained using Multisurface Method-Tree (MSM-T) [K. P. Bennett, "Decision Tree Construction Via Linear Programming." Proceedings of the 4th Midwest Artificial Intelligence and Cognitive Science Society, pp. 97-101, 1992], a classification method which uses linear programming to construct a decision tree. Relevant features were selected using an exhaustive search in the space of 1-4 features and 1-3 separating planes. The actual linear program used to obtain the separating plane in the 3-dimensional space is that described in: [K. P. Bennett and O. L. Mangasarian: "Robust Linear Programming Discrimination of Two Linearly Inseparable Sets", Optimization Methods and Software 1, 1992, 23-34]. This database is also available through the UW CS ftp server: ftp ftp.cs.wisc.edu cd math-prog/cpo-dataset/machine-learn/WDBC/
Subject Area
Health and Medicine
Instances
569
Features
30
Data Types
Multivariate
Tasks
Classification
Feature Types
Continuous

Features

NameRoleTypeUnitsMissing Values

Introductory Paper

Nuclear feature extraction for breast tumor diagnosis
W. Street, W. Wolberg, O. Mangasarian. 1993.
Electronic imaging

Additional Metadata

Keywords
Authors
William Wolberg
Olvi Mangasarian
Nick Street
W. Street
Year Created
1993
License
CC BY 4.0