Letter Recognition

About

Database of character image features; try to identify the letter The objective is to identify each of a large number of black-and-white rectangular pixel displays as one of the 26 capital letters in the English alphabet. The character images were based on 20 different fonts and each letter within these 20 fonts was randomly distorted to produce a file of 20,000 unique stimuli. Each stimulus was converted into 16 primitive numerical attributes (statistical moments and edge counts) which were then scaled to fit into a range of integer values from 0 through 15. We typically train on the first 16000 items and then use the resulting model to predict the letter category for the remaining 4000. See the article cited above for more details.

Subject Area

Computer Science

Instances

20,000

Features

Data Types

Multivariate

Tasks

Classification

Feature Types

Integer

Features

Name	Role	Type	Units	Missing Values
lettr	Target	Categorical	-	No
└ capital letter
x-box	Feature	Integer	-	No
└ horizontal position of box
y-box	Feature	Integer	-	No
└ vertical position of box
width	Feature	Integer	-	No
└ width of box
high	Feature	Integer	-	No
└ height of box
onpix	Feature	Integer	-	No
└ total # on pixels
x-bar	Feature	Integer	-	No
└ mean x of on pixels in box
y-bar	Feature	Integer	-	No
└ mean y of on pixels in box
x2bar	Feature	Integer	-	No
└ mean x variance
y2bar	Feature	Integer	-	No
└ mean y variance
xybar	Feature	Integer	-	No
└ mean x y correlation
x2ybr	Feature	Integer	-	No
└ mean of x * x * y
xy2br	Feature	Integer	-	No
└ mean of x * y * y
x-ege	Feature	Integer	-	No
└ mean edge count left to right
xegvy	Feature	Integer	-	No
└ correlation of x-ege with y
y-ege	Feature	Integer	-	No
└ mean edge count bottom to top
yegvx	Feature	Integer	-	No
└ correlation of y-ege with x

Introductory Paper

–

Additional Metadata

Keywords

object recognition

Authors

David Slate

Year Created

1991

DOI

10.24432/C5ZP40

License

CC BY 4.0

Letter Recognition

About

Features

Introductory Paper

Additional Metadata

Authors

Donation Information