Uyghur Language Text Detection in Images

被引:0
作者
Liu, Shun [1 ,2 ]
Xie, Hongtao [2 ]
Yin, Jian [1 ]
Chen, Yajun [3 ]
机构
[1] Shandong Univ, Dept Comp, Weihai, Peoples R China
[2] Chinese Acad Sci, Inst Informat Engn, Natl Engn Lab Informat Secur Technol, Beijing 100093, Peoples R China
[3] 51 Stadium Rd Liangxiang North,Unit 91917, Beijing, Peoples R China
来源
EIGHTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2016) | 2016年 / 10033卷
关键词
Uyghur language; text detection; the channel-enhanced MSERs algorithm;
D O I
10.1117/12.2244133
中图分类号
O43 [光学];
学科分类号
070207 ; 0803 ;
摘要
Text detection in images is an important prerequisite for many image content analysis tasks. Actually, nearly all the widely-used methods focus on English and Chinese text detection while some minority language, such as Uyghur language, text detection is paid less attention by researchers. In this paper, we propose a system which detects Uyghur language text in images. First, component candidates are detected by channel-enhanced Maximally Stable Extremal Regions (MSERs) algorithm. Then, most non-text regions are removed by a two-layer filtering mechanism. Next, the rest component regions are connected into short chains, and the short chains are connected into complete chains. Finally, the non-text chains are pruned by a chain elimination filter. To evaluate our algorithm, we generate a new dataset by various Uyghur texts. As a result, experimental comparisons on the proposed dataset prove that our algorithm is effective for detecting Uyghur Language text in complex background images. The F-measure is 83.5%, much better than the state-of-the-art performance of 75.5%.
引用
收藏
页数:5
相关论文
共 5 条
[1]   Histograms of oriented gradients for human detection [J].
Dalal, N ;
Triggs, B .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893
[2]  
Leo B, 2001, MACH LEARN, V45, P422
[3]   Robust wide-baseline stereo from maximally stable extremal regions [J].
Matas, J ;
Chum, O ;
Urban, M ;
Pajdla, T .
IMAGE AND VISION COMPUTING, 2004, 22 (10) :761-767
[4]  
Neumann L, 2012, PROC CVPR IEEE, P3538, DOI 10.1109/CVPR.2012.6248097
[5]  
Yao C, 2012, PROC CVPR IEEE, P1083, DOI 10.1109/CVPR.2012.6247787