Improving OCR for an Under-Resourced Script Using Unsupervised Word-Spotting

被引:0
作者
Silberpfennig, Adi [1 ]
Wolf, Lior [1 ]
Dershowitz, Nachum [1 ]
Bhagesh, Seraogi [2 ]
Chaudhuri, Bidyut B. [2 ]
机构
[1] Tel Aviv Univ, Blavatnik Sch Comp Sci, Tel Aviv, Israel
[2] Indian Stat Inst, Kolkata, India
来源
2015 13TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR) | 2015年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Optical character recognition (OCR) quality, especially for under-resourced scripts like Bangia, as well as for documents printed in old typefaces, is a major concern. An efficient and effective pipeline for OCR betterment is proposed here. The method is unsupervised. It employs a baseline OCR engine as a black box plus a dataset of unlabeled document images. That engine is applied to the images, followed by a visual encoding designed to support efficient word spotting. Given a new document to be analyzed, the black-box recognition engine is first applied. Then, for each result, word spotting is carried out within the dataset. The unreliable OCR outputs of the retrieved word spotting results are then considered. The word that is the centroid of the set of OCR words, measured by edit distance, is deemed a candidate reading.
引用
收藏
页码:706 / 710
页数:5
相关论文
共 50 条
  • [21] A Novel Word-Spotting Method for Handwritten Documents Using an Optimization-Based Classifier
    Tavoli, Reza
    Keyvanpour, Mohammadreza
    APPLIED ARTIFICIAL INTELLIGENCE, 2017, 31 (04) : 346 - 375
  • [22] Speech recognition of under-resourced languages using mismatched transcriptions
    Do, Van Hai
    Chen, Nancy F.
    Lim, Boon Pang
    Hasegawa-Johnson, Mark
    PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2016, : 112 - 115
  • [23] Unsupervised word spotting using a graph representation based on invariants
    Bui, Quang Anh
    Visani, Muriel
    Mullot, Remy
    2015 13TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2015, : 616 - 620
  • [24] Improving Under-Resourced Language ASR Through Latent Subword Unit Space Discovery
    Razavi, Marzieh
    Magimai-Doss, Mathew
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3873 - 3877
  • [25] Enhancing ASR Systems for Under-Resourced Languages through a Novel Unsupervised Acoustic Model Training Technique
    Cucu, Horia
    Buzo, Andi
    Besacier, Laurent
    Burileanu, Corneliu
    ADVANCES IN ELECTRICAL AND COMPUTER ENGINEERING, 2015, 15 (01) : 63 - 68
  • [26] Sub-word Based End-to-End Speech Recognition for an Under-Resourced Language: Amharic
    Gebreegziabher, Nirayo Hailu
    Nuernberger, Andreas
    2020 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2020, : 3466 - 3470
  • [27] Improving healthy connections in under-resourced youth: A YMCA San Diego mental health initiative
    Chavez, Noe Ruben
    Halmai-Gillan, Kristina T. K.
    Esquivel, Krysta
    McCarthy, Megan
    DeVico, Nicholas
    Lee, Sophia
    Ferrer, Mildred
    Ramos, Amy L.
    CHILDREN AND YOUTH SERVICES REVIEW, 2023, 150
  • [28] Using out-of-language data to improve an under-resourced speech recognizer
    Imseng, David
    Motlicek, Petr
    Bourlard, Herve
    Garner, Philip N.
    SPEECH COMMUNICATION, 2014, 56 : 142 - 151
  • [29] Building synthetic voices for under-resourced languages: the feasibility of using audiobook data
    de Wet, Febe
    Van der Walt, Willem
    Dlamini, Nkosikhona
    Govender, Avashna
    2017 PATTERN RECOGNITION ASSOCIATION OF SOUTH AFRICA AND ROBOTICS AND MECHATRONICS (PRASA-ROBMECH), 2017, : 225 - 229
  • [30] USING KL-DIVERGENCE AND MULTILINGUAL INFORMATION TO IMPROVE ASR FOR UNDER-RESOURCED LANGUAGES
    Imseng, David
    Bourlard, Herve
    Garner, Philip N.
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4869 - 4872