A Robust Algorithm for Text Extraction from Images

被引:0
|
作者
Chidiac, Najwa-Maria [1 ]
Damien, Pascal [1 ]
Yaacoub, Charles [1 ]
机构
[1] Holy Spirit Univ Kaslik USEK, Fac Engn, POB 446, Jounieh, Lebanon
来源
2016 39TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP) | 2016年
关键词
MSER; OCR; Segmentation; SWD; Text Extraction; SEGMENTATION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A robust algorithm that detects text from natural scene images and extracts them regardless of the orientation is proposed. All existing methods are designed to operate under a certain constraint, like detecting text only in one direction. Maximally Stable Extremal Regions (MSER) detector is chosen to extract binary regions since it has proven to be robust to lighting conditions. An enhancement technique for MSER images is designed to obtain clear letter boundaries. Images are then fed into a Stroke Width Detector and several heuristics are applied to remove non-text pixels. Afterwards, detected text regions are fed into an Optical Character Recognition module and then filtered according to their confidence measure. The recognition of characters is not part of the algorithm and the results are only about the detection of text. Our algorithm proved to be effective on blurred images and noisy images as well, based on both subjective and objective evaluations.
引用
收藏
页码:493 / 497
页数:5
相关论文
共 50 条
  • [31] Text line extraction from handwritten document pages using spiral run length smearing algorithm
    Malakar, Samir
    Halder, Sougata
    Sarkar, Ram
    Das, Nibaran
    Basu, Subhadip
    Nasipuri, Mita
    PROCEEDINGS OF THE 2012 INTERNATIONAL CONFERENCE ON COMMUNICATIONS, DEVICES AND INTELLIGENT SYSTEMS (CODLS), 2012, : 616 - 619
  • [32] A robust audio fingerprint extraction algorithm
    Lebosse, Jerome
    Brun, Luc
    Pailles, Jean Claude
    PROCEEDINGS OF THE FOURTH IASTED INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, PATTERN RECOGNITION, AND APPLICATIONS, 2007, : 269 - +
  • [33] Multilingual Artificial Text Detection and Extraction from Still Images
    Raza, Ahsen
    Abidi, Ali
    Siddiqi, Imran
    DOCUMENT RECOGNITION AND RETRIEVAL XX, 2013, 8658
  • [34] Text region extraction from quality degraded document images
    Abirami, S.
    Manjula, D.
    PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PROCEEDINGS, 2007, 4815 : 519 - 527
  • [35] Image Enhancer-Text Extraction from Still Images
    Ladha, Uma
    Alshi, Ankita
    Shah, Dhara
    Sawant, Rupali
    INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER SCIENCE AND ENGINEERING (ACSE 2014), 2014, : 216 - 220
  • [36] A Survey on Text Information Extraction from Born-Digital and Scene Text Images
    S. P. Faustina Joan
    S. Valli
    Proceedings of the National Academy of Sciences, India Section A: Physical Sciences, 2019, 89 : 77 - 101
  • [37] A robust system for text extraction in video
    Zhou, Jingchao
    Xu, Lei
    Xiao, Baihua
    Dai, Ruwei
    Si, Si
    INTERNATIONAL CONFERENCE ON MACHINE VISION 2007, PROCEEDINGS, 2007, : 119 - +
  • [38] Text Extraction from Images using Gamma Correction Method and different Text Extraction Methods - A Comparative Analysis
    Devi, G. Gayathri
    Sumathi, C. P.
    2014 INTERNATIONAL CONFERENCE ON INFORMATION COMMUNICATION AND EMBEDDED SYSTEMS (ICICES), 2014,
  • [39] Text Line Extraction in Document Images
    Wang, Liuan
    Fan, Wei
    Sun, Jun
    Naoi, Satshi
    Tanaka, Hiroshi
    2015 13TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2015, : 191 - 195
  • [40] Unsupervised Text Extraction from G-Maps
    Adak, Chandranath
    2013 INTERNATIONAL CONFERENCE ON HUMAN COMPUTER INTERACTIONS (ICHCI), 2013,