Adaptive Algorithms for Automated Processing of Document Images

Agrawal, Mudit

Adaptive Algorithms for Automated Processing of Document Images

dc.contributor.advisor	Davis, Larry	en_US
dc.contributor.advisor	Doermann, David	en_US
dc.contributor.author	Agrawal, Mudit	en_US
dc.contributor.department	Computer Science	en_US
dc.contributor.publisher	Digital Repository at the University of Maryland	en_US
dc.contributor.publisher	University of Maryland (College Park, Md.)	en_US
dc.date.accessioned	2011-10-08T05:34:18Z
dc.date.available	2011-10-08T05:34:18Z
dc.date.issued	2011	en_US
dc.description.abstract	Large scale document digitization projects continue to motivate interesting document understanding technologies such as script and language identification, page classification, segmentation and enhancement. Typically, however, solutions are still limited to narrow domains or regular formats such as books, forms, articles or letters and operate best on clean documents scanned in a controlled environment. More general collections of heterogeneous documents challenge the basic assumptions of state-of-the-art technology regarding quality, script, content and layout. Our work explores the use of adaptive algorithms for the automated analysis of noisy and complex document collections. We first propose, implement and evaluate an adaptive clutter detection and removal technique for complex binary documents. Our distance transform based technique aims to remove irregular and independent unwanted foreground content while leaving text content untouched. The novelty of this approach is in its determination of best approximation to clutter-content boundary with text like structures. Second, we describe a page segmentation technique called Voronoi++ for complex layouts which builds upon the state-of-the-art method proposed by Kise [Kise1999]. Our approach does not assume structured text zones and is designed to handle multi-lingual text in both handwritten and printed form. Voronoi++ is a dynamically adaptive and contextually aware approach that considers components' separation features combined with Docstrum [O'Gorman1993] based angular and neighborhood features to form provisional zone hypotheses. These provisional zones are then verified based on the context built from local separation and high-level content features. Finally, our research proposes a generic model to segment and to recognize characters for any complex syllabic or non-syllabic script, using font-models. This concept is based on the fact that font files contain all the information necessary to render text and thus a model for how to decompose them. Instead of script-specific routines, this work is a step towards a generic character and recognition scheme for both Latin and non-Latin scripts.	en_US
dc.identifier.uri	http://hdl.handle.net/1903/11875
dc.subject.pqcontrolled	Computer science	en_US
dc.subject.pquncontrolled	document image processing	en_US
dc.subject.pquncontrolled	machine learning	en_US
dc.subject.pquncontrolled	noise removal	en_US
dc.subject.pquncontrolled	optical character recognition	en_US
dc.subject.pquncontrolled	page segmentation	en_US
dc.subject.pquncontrolled	pattern recognition	en_US
dc.title	Adaptive Algorithms for Automated Processing of Document Images	en_US
dc.type	Dissertation	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Agrawal_umd_0117E_12419.pdf
Size:: 10.82 MB
Format:: Adobe Portable Document Format

Download

Collections

UMD Theses and Dissertations
Computer Science Theses and Dissertations