Illumination Invariant Text Recognition System Based On Contrast Limit Adaptive Histogram Equalization in Videos/Images

被引:2
作者
Smitha, M. L. [1 ]
Shekar, B. H. [2 ]
机构
[1] KVG Coll Engn, Dept Master Comp Applicat, Sullia, Karnataka, India
[2] Mangalore Univ, Dept Comp Sci, Mangalore, Karanataka, India
来源
PROCEEDING OF THE THIRD INTERNATIONAL SYMPOSIUM ON WOMEN IN COMPUTING AND INFORMATICS (WCI-2015) | 2015年
关键词
Text Detection; Text Localization; Segmentation; REcognition; Speech synthesis;
D O I
10.1145/2791405.2791408
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The textual information present in the image/video plays a major role in understanding scene/video. Tn this paper, we propose a system for detecting the text in videos possessing varying illumination based on Contrast Limit Adaptive Histogram Equalization(CLAFIE). Certain heuristic rules are applied to the preprocessed video frame to detect the text clusters. Further, the newly designed geometrical rules and morphological operations are employed OH the obtained text detection results for text localization. Once the text gets localized, the text lines are segmented followed by character segmentation. The segmented characters are then recognized using OCR and then synthesized as voice output message which is audible so that one can hear and understand the text content present in videos. The experimental results obtained on publicly available standard datasets, TRECVID video dataset and our own video dataset illustrate that the proposed method can detect, localize and recognize the texts of various sizes, fonts and colors.
引用
收藏
页码:174 / 179
页数:6
相关论文
共 23 条
[1]  
[Anonymous], 2011, ICCV
[2]  
[Anonymous], ICCV
[3]  
Cai M., 2002, P IEEE ICIP
[4]  
Chaddha N., 1994, C SIGN SYST COMP
[5]  
Chen Datong, 2004, PATTERN RECOGNITION
[6]  
Chen X., 2004, COMPUTER VISION PATT
[7]  
Coates A, 2011, ICDAR
[8]  
Dekel S., 2008, PREPRINT
[9]  
Epshtein B, 2010, PROC CVPR IEEE, P2963, DOI 10.1109/CVPR.2010.5540041
[10]   Automatic text detection and tracking in digital video [J].
Li, HP ;
Doermann, D ;
Kia, O .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2000, 9 (01) :147-156