Text-Based Image Segmentation Methodology

被引:7
作者
Mehul, Gupta [1 ]
Ankita, Patel [1 ]
Namrata, Dave [2 ]
Rahul, Goradia [3 ]
Sheth, Saurin [4 ]
机构
[1] GH Patel Coll Engn & Technol, Vallabh Vidyanagar 388120, Gujarat, India
[2] GCET Coll, Dept Comp Engn, Vallabh Vidyanagar 388120, Gujarat, India
[3] GCET Coll, Elect & Commun Engn Dept, Vallabh Vidyanagar 388120, Gujarat, India
[4] GCET Coll, Mechatron Engn Dept, Vallabh Vidyanagar 388120, Gujarat, India
来源
2ND INTERNATIONAL CONFERENCE ON INNOVATIONS IN AUTOMATION AND MECHATRONICS ENGINEERING, ICIAME 2014 | 2014年 / 14卷
关键词
Handwritten text and printed text comparison; levels of segmentation; segmentation methodologies; text document image analysis; DOCUMENTS; SYSTEM;
D O I
10.1016/j.protcy.2014.08.059
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In computer vision, segmentation is the process of partitioning a digital image into multiple segments (sets of pixels). Image segmentation is thus inevitable. Segmentation used for text-based images aim in retrieval of specific information from the entire image. This information can be a line or a word or even a character. This paper proposes various methodologies to segment a text based image at various levels of segmentation. This material serves as a guide and update for readers working on the text based segmentation area of Computer Vision. First, the need for segmentation is justified in the context of text based information retrieval. Then, the various factors affecting the segmentation process are discussed. Followed by the levels of text segmentation are explored. Finally, the available techniques with their superiorities and weaknesses are reviewed, along with directions for quick referral are suggested. Special attention is given to the handwriting recognition since this area requires more advanced techniques for efficient information extraction and to reach the ultimate goal of machine simulation of human reading. (C) 2014 The Authors. Published by Elsevier Ltd.
引用
收藏
页码:465 / 472
页数:8
相关论文
共 33 条
[1]  
[Anonymous], 2001, Computer Vision
[2]  
Arica Nafiz, 2001, IEEE T SYSTEMS MAN C
[3]   A font and size-independent OCR system for printed Kannada documents using support vector machines [J].
Ashwin, TV ;
Sastry, PS .
SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2002, 27 (1) :35-58
[4]  
Brodic D., 2011, J UNIVERSAL COMPUTER, V17
[5]  
Das S., 2010, INT J ENG SCI TECHNO, V2, P6606
[6]  
dos Santos RP, 2009, 2009 10 INT C DOC AN
[7]   A METHOD OF DETECTING THE ORIENTATION OF ALIGNED COMPONENTS [J].
HASHIZUME, A ;
YEH, PS ;
ROSENFELD, A .
PATTERN RECOGNITION LETTERS, 1986, 4 (02) :125-132
[8]   Text information extraction in images and video: a survey [J].
Jung, K ;
Kim, KI ;
Jain, AK .
PATTERN RECOGNITION, 2004, 37 (05) :977-997
[9]  
Kanai J., 1998, International Journal on Document Analysis and Recognition, V1, P43
[10]  
Kluever K. A., 2008, STUDY REPORT CHARACT, P1