INTEGRATION OF n-GRAM LANGUAGE MODELS IN MULTIPLE CLASSIFIER SYSTEMS FOR OFFLINE HANDWRITTEN TEXT LINE RECOGNITION

被引:3
|
作者
Bertolami, Roman [1 ]
Bunke, Horst [1 ]
机构
[1] Univ Bern, Inst Comp Sci & Appl Math, CH-3012 Bern, Switzerland
基金
瑞士国家科学基金会;
关键词
Offline handwritten text line recognition; multiple classifier systems; language modeling;
D O I
10.1142/S0218001408006855
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Current multiple classifier systems for unconstrained handwritten text recognition do not provide a straightforward way to utilize language model information. In this paper, we describe a generic method to integrate a statistical n-gram language model into the combination of multiple offline handwritten text line recognizers. The proposed method first builds a word transition network and then rescores this network with an n-gram language model. Experimental evaluation conducted on a large dataset of offline handwritten text lines shows that the proposed approach improves the recognition accuracy over a reference system as well as over the original combination method that does not include a language model.
引用
收藏
页码:1301 / 1321
页数:21
相关论文
共 50 条
  • [1] N-gram language models for offline handwritten text recognition
    Zimmermann, M
    Bunke, H
    NINTH INTERNATIONAL WORKSHOP ON FRONTIERS IN HANDWRITING RECOGNITION, PROCEEDINGS, 2004, : 203 - 208
  • [2] Multiple classifier methods for offline handwritten text line recognition
    Bertolami, Roman
    Bunke, Horst
    MULTIPLE CLASSIFIER SYSTEMS, PROCEEDINGS, 2007, 4472 : 72 - +
  • [3] Revisiting N-Gram Models: Their Impact in Modern Neural Networks for Handwritten Text Recognition
    Tarride, Solene
    Kermorvant, Christopher
    DOCUMENT ANALYSIS AND RECOGNITION-ICDAR 2024, PT VI, 2024, 14809 : 167 - 182
  • [4] Multiple handwritten text line recognition systems derived from specific integration of a language model
    Bertolami, R
    Bunke, H
    EIGHTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS 1 AND 2, PROCEEDINGS, 2005, : 521 - 525
  • [5] Improved N-gram Phonotactic Models For Language Recognition
    BenZeghiba, Mohamed Faouzi
    Gauvain, Jean-Luc
    Lamel, Lori
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2718 - 2721
  • [6] Enhancing Handwritten Text Recognition with N-gram sequence decomposition and Multitask Learning
    Tassopoulou, Vasiliki
    Retsinas, George
    Maragos, Petros
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 10555 - 10560
  • [7] Language Identification of Short Text Segments with N-gram Models
    Vatanen, Tommi
    Vayrynen, Jaakko J.
    Virpioja, Sami
    LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010, : 3423 - 3430
  • [8] N-gram and N-class models for on line handwriting recognition
    Perraud, F
    Viard-Gaudin, C
    Morin, E
    Lallican, PM
    SEVENTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS I AND II, PROCEEDINGS, 2003, : 1053 - 1057
  • [9] Combining naive Bayes and n-gram language models for text classification
    Peng, FC
    Schuurmans, D
    ADVANCES IN INFORMATION RETRIEVAL, 2003, 2633 : 335 - 350
  • [10] Rejection strategies for offline handwritten text line recognition
    Bertolami, Roman
    Zimmermann, Matthias
    Bunke, Horst
    PATTERN RECOGNITION LETTERS, 2006, 27 (16) : 2005 - 2012