Automated Latin Text Detection in Document Images and Natural Scene Images based on Connected Component Analysis

被引:0
|
作者
Khan, Muhammad Jaleed [1 ]
Said, Naina [2 ]
Khan, Aqsa [3 ]
Rehman, Naila [3 ]
Khurshid, Khurram [1 ]
机构
[1] Inst Space Technol, Dept Elect Engn, iVis Lab, Islamabad, Pakistan
[2] Univ Engn & Technol, Dept Comp Syst Engn, Peshawar, Pakistan
[3] Ghulam Ishaq Khan Inst Engn Sci & Technol, Fac Comp Sci & Engn, Topi, Pakistan
来源
2019 2ND INTERNATIONAL CONFERENCE ON COMPUTING, MATHEMATICS AND ENGINEERING TECHNOLOGIES (ICOMET) | 2019年
关键词
text detection; connected componenet; maximally stable extremal region; geometric checks; canny edge detector;
D O I
10.1109/icomet.2019.8673477
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Robust and accurate detection of text in natural scene images and document images is a very challenging and common research problem. Over the past few decades, a variety of algorithms for text detection in images have been developed but there is still need for more robust and accurate text detection methods. In this work, we have proposed an accurate and robust text detection framework in which canny edge detection, maximally stable extremal regions and geometric filtering are employed in combination to efficiently collect and filter letter candidates in an image. Subsequently, individual letter patches are grouped to detect text sequences, which are then fragmented into isolated word patches. Finally, optical character recognition is employed to digitize the word patches. The proposed algorithm is tested on images representing different scenarios ranging from documents to natural scenes. Promising results have been reported which prove the accuracy and robustness of the proposed framework and encourage its practical implementation in real world applications.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Text Detection in Natural Scene Images Using Morphological Component Analysis and Laplacian Dictionary
    Shuping Liu
    Yantuan Xian
    Huafeng Li
    Zhengtao Yu
    IEEE/CAAJournalofAutomaticaSinica, 2020, 7 (01) : 214 - 222
  • [2] Text detection in natural scene images using morphological component analysis and Laplacian dictionary
    Liu, Shuping
    Xian, Yantuan
    Li, Huafeng
    Yu, Zhengtao
    IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2020, 7 (01) : 214 - 222
  • [3] Text detection and restoration in natural scene images
    Ye, Qixiang
    Hao, Jianbin
    Huang, Jun
    Yu, Hua
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2007, 18 (06) : 504 - 513
  • [4] Forged text detection in video, scene, and document images
    Nandanwar, Lokesh
    Shivakumara, Palaiahnakote
    Mondal, Prabir
    Raghunandan, Karpuravalli Srinivas
    Pal, Umapada
    Lu, Tong
    Lopresti, Daniel
    IET IMAGE PROCESSING, 2020, 14 (17) : 4744 - 4755
  • [5] Text Detection and Recognition in Natural Scene Images
    Pise, Amruta
    Ruikar, S. D.
    2014 INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND SIGNAL PROCESSING (ICCSP), 2014,
  • [6] Text detection in natural scene images based on color prior guided MSER
    Zhang, Xiangnan
    Gao, Xinbo
    Tian, Chunna
    NEUROCOMPUTING, 2018, 307 : 61 - 71
  • [7] Research on the Text Detection and Recognition in Natural Scene Images
    Wei Zi-han
    Du Xiao-ping
    Cao Lei
    ELEVENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING (ICGIP 2019), 2020, 11373
  • [8] Fast and Accurate Text Detection in Natural Scene Images
    Xiao, Chengqiu
    Ji, Lixin
    Gao, Chao
    Li, Shaomei
    INTELLIGENCE SCIENCE AND BIG DATA ENGINEERING: IMAGE AND VIDEO DATA ENGINEERING, ISCIDE 2015, PT I, 2015, 9242 : 1 - 10
  • [9] Text detection in natural scene images with feature combination
    Ye, Qixiang
    Jiao, Jianbin
    Huang, Jun
    Yu, Hua
    PROCEEDINGS OF THE NINTH IASTED INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING, 2007, : 397 - 402
  • [10] Comparative Study of Text Detection in Natural Scene Images
    Saini, Shareen
    Marawaha, Chetan
    2016 IEEE INTERNATIONAL CONFERENCE ON RECENT TRENDS IN ELECTRONICS, INFORMATION & COMMUNICATION TECHNOLOGY (RTEICT), 2016, : 1981 - 1985