Computing Reviews, the leading online review service for computing literature.

Search

Text and non-text separation in offline document images: a survey
Bhowmik S., Sarkar R., Nasipuri M., Doermann D. International Journal on Document Analysis and Recognition21 (1-2):1-20,2018.Type:Article

Date Reviewed: Sep 7 2018

This survey on text and no-text separation in images presents a quite complete review (list of references) of image document analysis, including printed and handwritten texts. The authors present tables comparing the performance of the methods found in the literature; however, the results are usually not directly comparable since they consider different datasets. Although this paper is interesting and presents a very good review of many different methods, there are some drawbacks. First, the authors classify the different types of image documents into four classes, but mix both printed and handwritten texts in classes 2, 3, and 4. Handwritten text segmentation should be considered separately, since it is very different from printed text analysis. There are probably different and better handwritten document categorizations than the one presented here. Next, although this paper is related to documents, one very important text and non-text separation problem is related to reading real-world text using cameras, that is, text detection and segmentation can be more complex than well-aligned/well-posed texts in documents. What about capturing text from a distorted view (camera not aligned with the text, text detection and segmentation from real-world camera images), which can be very helpful for people who rely on automated reading devices. Finally, while the included tables present references and performance over different datasets, a table that summarizes the different features and approaches found in the literature would be more useful and interesting.

Reviewer: Fernando Osorio	Review #: CR146236 (1902-0060)

General (I.7.0 )

Segmentation (I.4.6 )

Would you recommend this review?

yes

Other reviews under "General":	Date

The processing of words Rahtz S. (ed), Halsted Press, New York, NY, 1987. Type: Book (9789780470208526)	Mar 1 1988

An introduction to text processing: a systematic approach to the study of text structure and operations and the design of text processing software Sampath G., River Valley Publ., Jeffersontown, KY, 1985. Type: Book (9789780961507008)	Apr 1 1986

On-line text management: hypertext and other techniques McGrew P., McDaniel W., Intertext Pubs./McGraw-Hill Book Co., New York, NY, 1989. Type: Book (9780070462632)	Aug 1 1990

more...

Reproduction in whole or in part without permission is prohibited. Copyright 1999-2024 ThinkLoud^®
Terms of Use | Privacy Policy