Identifying script on word-level with informational confidence

被引:34
|
作者
Jaeger, S [1 ]
Ma, HF [1 ]
Doermann, D [1 ]
机构
[1] Univ Maryland, Inst Adv Comp Studies, College Pk, MD 20742 USA
来源
EIGHTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS 1 AND 2, PROCEEDINGS | 2005年
关键词
D O I
10.1109/ICDAR.2005.134
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we present a multiple classifier system for script identification. Applying a Gabor filter analysis of textures on word-level, our system identifies Latin and non-Latin words in bilingual printed documents. The classfier system comprises four different architectures based on nearest neighbors, weighted Euclidean distances, Gaussian mixture models, and support vector machines. We report results for Arabic, Chinese, Hindi, and Korean script. Moreover we show that combining informational confidence values using sum-rule can consistently outperform the best single recognition rate.
引用
收藏
页码:416 / 420
页数:5
相关论文
共 50 条
  • [1] WORD-LEVEL RECOGNITION OF CURSIVE SCRIPT
    FARAG, RFH
    IEEE TRANSACTIONS ON COMPUTERS, 1979, 28 (02) : 172 - 175
  • [2] Word-Level Script Identification from Scene Images
    Fasil, O. K.
    Manjunath, S.
    Aradhya, V. N. Manjunath
    PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON FRONTIERS IN INTELLIGENT COMPUTING: THEORY AND APPLICATIONS, (FICTA 2016), VOL 2, 2017, 516 : 417 - 426
  • [3] Word-level Script Identification for Handwritten Indic scripts
    Singh, Pawan Kumar
    Sarkar, Ram
    Nasipuri, Mita
    Doermann, David
    2015 13TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2015, : 1106 - 1110
  • [4] Word-level Confidence Estimation for CTC Models
    Naowarat, Burin
    Kongthaworn, Thananchai
    Chuangsuwanich, Ekapol
    INTERSPEECH 2023, 2023, : 3297 - 3301
  • [5] WORD-LEVEL CONFIDENCE ESTIMATION FOR RNN TRANSDUCERS
    Wang, Mingqiu
    Soltau, Hagen
    El Shafey, Laurent
    Shafran, Izhak
    2021 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2021, : 1170 - 1177
  • [6] Word-level confidence estimation for machine translation
    Ueffing, Nicola
    Ney, Hermann
    COMPUTATIONAL LINGUISTICS, 2007, 33 (01) : 9 - 40
  • [7] Word-Level Script Identification from Handwritten Multi-script Documents
    Singh, Pawan Kumar
    Mondal, Arafat
    Bhowmik, Showmik
    Sarkar, Ram
    Nasipuri, Mita
    PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON FRONTIERS OF INTELLIGENT COMPUTING: THEORY AND APPLICATIONS (FICTA) 2014, VOL 1, 2015, 327 : 551 - 558
  • [8] Word-Level Script Identification Using Texture Based Features
    Singh, Pawan Kumar
    Sarkar, Ram
    Nasipuri, Mita
    INTERNATIONAL JOURNAL OF SYSTEM DYNAMICS APPLICATIONS, 2015, 4 (02) : 74 - 94
  • [9] Word-Level Thirteen Official Indic Languages Database for Script Identification in Multi-script Documents
    Obaidullah, Sk Md
    Santosh, K. C.
    Halder, Chayan
    Das, Nibaran
    Roy, Kaushik
    RECENT TRENDS IN IMAGE PROCESSING AND PATTERN RECOGNITION (RTIP2R 2016), 2017, 709 : 16 - 27
  • [10] A Texture based approach to Word-level Script Identification from Multi-script Handwritten Documents
    Singh, Pawan Kumar
    Khan, Aparajita
    Sarkar, Ram
    Nasipuri, Mita
    2014 6TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMMUNICATION NETWORKS, 2014, : 228 - 232