Keyword: Data Sets : Search

Advanced Search

Results: 1 - 1of1

Follow results:

per page:

Context for search term 1Search term 1*

All Dates

LastSelect static range

Custom Range

Select starting monthSelect starting year

Select ending monthSelect ending year

chapterNo Access
DATA SETS FOR OCR AND DOCUMENT IMAGE UNDERSTANDING RESEARCH
Handbook of Character Recognition and Document Image Analysis01 May 1997
Preview Abstract
Several significant sets of labeled samples of image data are surveyed that can be used in the development of algorithms for offline and online handwriting recognition as well as for machine printed text recognition. The method used to gather each data set, the numbers of samples they contain, and the associated truth data are discussed. In the domain of offline handwriting, the CEDAR, NIST, and CENPARMI data sets are presented. These contain primarily isolated digits and alphabetic characters. The UNIPEN data set of online handwriting was collected from a number of independent sources and it contains individual characters as well as handwritten phrases. The University of Washington document image databases are also discussed. They contain a large number of English and Japanese document images that were selected from a range of publications.