Korean-English bilingual videotext recognition for news headline generation based on a split-merge strategy

被引：0

作者：

Jung, Cheolkon ^{[1
]}

Jiao, Licheng ^{[1
]}

机构：

[1] Xidian Univ, Key Lab Intelligent Percept & Image Understanding, Minist Educ, Xian 710071, Peoples R China

来源：

JOURNAL OF REAL-TIME IMAGE PROCESSING | 2016年 / 11卷 / 01期

基金：

新加坡国家研究基金会; 中国国家自然科学基金;

关键词：

Korean-English videotext recognition; Content-based video retrieval; News headline generation; Split-merge strategy; Video OCR; TEXT;

D O I：

10.1007/s11554-012-0298-x

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper deals with Korean-English bilingual videotext recognition for news headline generation. Because videotext contains semantic content information, it can be effectively used for understanding videos. Despite its usefulness, it is a challengeable task to apply text recognition technologies to practical video applications because of the computational complexity and recognition accuracy. In this paper, we propose a novel Korean-English bilingual videotext recognition method to overcome the computational complexity as well as achieve comparable recognition accuracy. To recognize both Korean and English characters effectively, the proposed method employs an elaborate split-merge strategy in which the split segments are merged into characters using the recognition scores. Moreover, it avoids unnecessary computation using geometric features such as squareness and internal gap, and thus its computational overhead is remarkably reduced. Therefore, the proposed method is successfully employed in generating news headlines. The effectiveness and efficiency of the proposed method are verified by extensive experiments on a challenging database containing 51,290 text images (176,884 characters).

引用

页码：167 / 177

页数：11

共 40 条

[1] Bagdanov A, 1997, PROC INT CONF DOC, P401, DOI 10.1109/ICDAR.1997.619878
[2] Caption analysis and recognition for building video indexing systems
Chang, F
Chen, GC
Lin, CC
Lin, WH
[J]. MULTIMEDIA SYSTEMS, 2005, 10 (04) : 344 - 355
[3] An image-based automatic Arabic translation system
Chang, Yi
Chen, Datong
Zhang, Ying
Yang, Jie
[J]. PATTERN RECOGNITION, 2009, 42 (09) : 2127 - 2134
[4] Text detection and recognition in images and video frames
Chen, DT
Odobez, JM
Bourlard, H
[J]. PATTERN RECOGNITION, 2004, 37 (03) : 595 - 608
[5] Chin S, 2006, LECT NOTES ARTIF INT, V4114, P476
[6] Applications of video-content analysis and retrieval
Dimitrova, N
Zhang, HJ
Shahraray, B
Sezan, I
Huang, T
Zakhor, A
[J]. IEEE MULTIMEDIA, 2002, 9 (03) : 42 - 55
[7] Dimitrova N., 1997, Proceedings of the Sixth International Conference on Information and Knowledge Management. CIKM'97, P113, DOI 10.1145/266714.266876
[8] The statistical utilization of multiple measurements
Fisher, RA
[J]. ANNALS OF EUGENICS, 1938, 8 : 376 - 386
[9] Gao XB, 2003, ICCIMA 2003: FIFTH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND MULTIMEDIA APPLICATIONS, PROCEEDINGS, P108
[10] Haojin Yang, 2011, Proceedings of the 2011 IEEE International Symposium on Multimedia (ISM 2011), P111, DOI 10.1109/ISM.2011.26

← 1 2 3 4 →