Context-Aware Confidence Estimation for Rejection in Handwritten Chinese Text Recognition

被引:0
作者
Liu, Yangyang [1 ,2 ]
Chen, Yi [1 ,2 ]
Yin, Fei [1 ,2 ]
Liu, Cheng-Lin [1 ,2 ]
机构
[1] Chinese Acad Sci, Inst Automat, State Key Lab Multimodal Artificial Intelligence, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China
来源
DOCUMENT ANALYSIS AND RECOGNITION-ICDAR 2024, PT I | 2024年 / 14804卷
基金
中国国家自然科学基金;
关键词
Handwritten Chinese Text Recognition; Confidence Estimation; Geometric Context; Bayesian probability formula; DISCRIMINATIVE UTTERANCE VERIFICATION; TRANSFORMATION; ONLINE;
D O I
10.1007/978-3-031-70533-5_9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Handwritten Chinese Text Recognition (HCTR) has been advanced largely by deep learning in recent years. However, the remaining recognition errors still hinder reliability-critical applications where zero-error is desired. Rejecting low-confidence patterns can help reduce the error rate but the increased rejection rate is also harmful. In this paper, we propose a character confidence estimation method incorporating contexts for character rejection in HCTR. Based on a text line recognizer outputting character segmentation and classification results, the confidence of each segmented character is estimated by combining the scores of a re-trained character classifier, the linguistic and geometric contexts. We introduce a probabilistic formula for estimating the confidence by combining the classifier and contextual scores, and an improved approach for scoring the geometric context using unary and binary geometric features. Experimental evaluations on the CASIA-HWDB and ICDAR2013 datasets demonstrate that our method can significantly improve the rejection performance in respect of low error rate at moderate rejection rate. The re-trained classifier, the linguistic context and the geometric context are all justified effective to improve the confidence.
引用
收藏
页码:134 / 151
页数:18
相关论文
共 32 条
  • [21] A comprehensive study of hybrid neural network hidden Markov model for offline handwritten Chinese text recognition
    Wang, Zi-Rui
    Du, Jun
    Wang, Wen-Chao
    Zhai, Jian-Fang
    Hu, Jin-Shui
    INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2018, 21 (04) : 241 - 251
  • [22] PageNet: Towards End-to-End Weakly Supervised Page-Level Handwritten Chinese Text Recognition
    Peng, Dezhi
    Jin, Lianwen
    Liu, Yuliang
    Luo, Canjie
    Lai, Songxuan
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2022, 130 (11) : 2623 - 2645
  • [23] Weakly Supervised Learning for Over-Segmentation Based Handwritten Chinese Text Recognition
    Wang, Zhen-Xing
    Wang, Qiu-Feng
    Yin, Fei
    Liu, Cheng-Lin
    2020 17TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR 2020), 2020, : 157 - 162
  • [24] Improving handwritten Chinese text recognition using neural network language models and convolutional neural network shape models
    Wu, Yi-Chao
    Yin, Fei
    Liu, Cheng-Lin
    PATTERN RECOGNITION, 2017, 65 : 251 - 264
  • [25] Writer Code Based Adaptation of Deep Neural Network for Offline Handwritten Chinese Text Recognition
    Wang, Zi-Rui
    Du, Jun
    PROCEEDINGS OF 2016 15TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2016, : 548 - 553
  • [26] High Performance Offline Handwritten Chinese Text Recognition with a New Data Preprocessing and Augmentation Pipeline
    Xie, Canyu
    Lai, Songxuan
    Liao, Qianying
    Jin, Lianwen
    DOCUMENT ANALYSIS SYSTEMS, 2020, 12116 : 45 - 59
  • [27] Deep Neural Network based Hidden Markov Model for Offline Handwritten Chinese Text Recognition
    Du, Jun
    Wang, Zi-Rui
    Zhai, Jian-Fang
    Hu, Jin-Shui
    2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 3428 - 3433
  • [28] Handwritten Chinese/Japanese Text Recognition Using Semi-Markov Conditional Random Fields
    Zhou, Xiang-Dong
    Wang, Da-Han
    Tian, Feng
    Liu, Cheng-Lin
    Nakagawa, Masaki
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (10) : 2413 - 2426
  • [29] A comprehensive study of hybrid neural network hidden Markov model for offline handwritten Chinese text recognition
    Zi-Rui Wang
    Jun Du
    Wen-Chao Wang
    Jian-Fang Zhai
    Jin-Shui Hu
    International Journal on Document Analysis and Recognition (IJDAR), 2018, 21 : 241 - 251
  • [30] PageNet: Towards End-to-End Weakly Supervised Page-Level Handwritten Chinese Text Recognition
    Dezhi Peng
    Lianwen Jin
    Yuliang Liu
    Canjie Luo
    Songxuan Lai
    International Journal of Computer Vision, 2022, 130 : 2623 - 2645