Training on severely degraded text-line images

Prateek Sarkar, Henry S. Baird, Xiaohu Zhang

Abstract

We show that document image decoding (DID) supervised training algorithms, as a result of recent refinements, achieve high accuracy with low manual effort even under conditions of severe image degradation in both training and test data. We describe improvements in DID training of character template, set-width, and channel (noise) models. Large-scale experimental trials, using synthetically degraded images of text, have established two new and practically important advantages of DID algorithms: 1. high accuracy ( > 99% chraracters correct) in decoding using models trained on even severely degraded images from the same distribution; and 2. greatly improved accuracy ( < 1/10 the error rate) across a wide range of image degradations compared to untrained (idealized) models. This ability to train reliably on low-quality images that suffer from massive fragmentation and merging of characters, without the need for manual segmentation and labeling of character images, significantly reduces the manual effort of DID training.

Download paper

PDF

Bibtex entry

@inproceedings{sarkar:icdar2003
,author = "Prateek Sarkar and Henry S. Baird and Xiaohu Zhang"
,title = "Training on severely degraded text-line images"
,booktitle = "Proceedings of the Seventh ICDAR"
,address = "Edinburgh, Scotland"
,month = "August"
,year = "2003"
,pages = "38-43"
,http = {http://www.parc.xerox.com/istl/members/psarkar/PUBLICATIONS/ICDAR2003/download.html}
}
Prateek Sarkar
Last modified: Wed Jan 28 14:48:39 PST 2004