Artificial Characters
About
Dataset artificially generated by using first order theory which describes structure of ten capital letters of English alphabet
This database has been artificially generated by using a first order theory which describes the structure of ten capital letters of the English alphabet and a random choice theorem prover which accounts for etherogeneity in the instances. The capital letters represented are the following: A, C, D, E, F, G, H, L, P, R. Each instance is structured and is described by a set of segments (lines) which resemble the way an automatic program would segment an image. Each instance is stored in a separate file whose format is the following:
CLASS OBJNUM TYPE XX1 YY1 XX2 YY2 SIZE DIAG
where CLASS is an integer number indicating the class as described below, OBJNUM is an integer identifier of a segment (starting from 0) in the instance and the remaining columns represent attribute values. For further details, contact the author.
Subject Area
Computer Science
Instances
6,000
Features
7
Data Types
Multivariate
Tasks
Classification
Feature Types
Categorical, Integer, Continuous
Features
–
Introductory Paper
–