A Caption Text Detection Method from Images/Videos for Efficient Indexing and Retrieval of Multimedia Data

被引:4
作者
Tehsin, Samabia [1 ]
Masood, Asif [1 ]
Kausar, Sumaira [2 ]
Javed, Yunous [2 ]
机构
[1] NUST, MCS, Islamabad, Pakistan
[2] NUST, Coll E&ME, Islamabad, Pakistan
关键词
Text extraction; image retrieval; caption text; document analysis; ICDAR; 2013; VIDEO; IMAGES; EXTRACTION; SEGMENTATION; LOCALIZATION; RECOGNITION; CHARACTERS; ALGORITHM;
D O I
10.1142/S0218001415550034
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Textual information embedded in multimedia can provide a vital tool for indexing and retrieval. Text extraction process has many inherent problems due to the variation in font sizes, color, backgrounds and resolution. Text detection and localization are the most challenging phases of text extraction process whereas text extraction results are highly dependent upon these phases. This paper focuses on the text localization because of its very fundamental importance. Two effective feature vectors are introduced for the classification of the text and nontext objects. First feature vector is represented by the Radon transform of text candidate objects. Second feature vector is derived from the detailed geometrical analysis of text contents. Union of two feature vectors is used for the classification of text and nontext objects using support vector machine (SVM). Text detection and localization results are evaluated on two publicly available datasets namely ICDAR 2013 and IPC-Artificial text. Moreover, results are compared with state-of-the-art techniques and the Comparison demonstrates the superiority of the presented research.
引用
收藏
页数:23
相关论文
共 50 条
  • [1] [Anonymous], 2013, P 12 INT C DOC AN RE
  • [2] [Anonymous], IEEE REG 10 C TENCON
  • [3] Anthimopoulos Marios, 2010, Proceedings of the 2010 20th International Conference on Pattern Recognition (ICPR 2010), P3264, DOI 10.1109/ICPR.2010.798
  • [4] A two-stage scheme for text detection in video images
    Anthimopoulos, Marios
    Gatos, Basilis
    Pratikakis, Ioannis
    [J]. IMAGE AND VISION COMPUTING, 2010, 28 (09) : 1413 - 1426
  • [5] Arthur D., posium on Discrete algorithms, P1027
  • [6] DISCRETE RADON-TRANSFORM
    BEYLKIN, G
    [J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1987, 35 (02): : 162 - 172
  • [7] CAI M, 2002, IEEE INT C IMAGE PRO, V1, P1
  • [8] Text detection and recognition in images and video frames
    Chen, DT
    Odobez, JM
    Bourlard, H
    [J]. PATTERN RECOGNITION, 2004, 37 (03) : 595 - 608
  • [9] Chen H., 2011, 2011 18th IEEE International Conference on Image Processing (ICIP 2011), P2609, DOI 10.1109/ICIP.2011.6116200
  • [10] Dudoit S, 2002, GENOME BIOL, V3