A Tree-Structure Analysis Network on Handwritten Chinese Character Error Correction

被引:9
|
作者
Li, Yunqing [1 ]
Du, Jun [1 ]
Zhang, Jianshu [2 ]
Wu, Changjie [1 ]
机构
[1] Univ Sci & Technol China, Natl Engn Res Ctr Speech & Language Informat Proc, Hefei 230026, Peoples R China
[2] AI Res iFLYTEK, Hefei 230088, Peoples R China
关键词
Measurement; Handwriting recognition; Analytical models; Error analysis; Statistical analysis; Layout; Error correction; Handwritten Chinese character error correction; CNN; tree-structure analysis network; triplet loss; quantitative analysis; RECOGNITION;
D O I
10.1109/TMM.2022.3163517
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Existing researches on handwritten Chinese characters are mainly based on recognition network designed to solve the complex structure and numerous amount characteristics of Chinese characters. In this paper, we investigate Chinese characters from the perspective of error correction, which is to diagnose a handwritten character to be right or wrong and provide a feedback on error analysis. For this handwritten Chinese character error correction task, we define a benchmark by unifying both the evaluation metrics and data splits for the first time. Then we design a diagnosis system that includes decomposition, judgement and correction stages. Specifically, a novel tree-structure analysis network (TAN) is proposed to model a Chinese character as a tree layout, which mainly consists of a CNN-based encoder and a tree-structure based decoder. Using the predicted tree layout for judgement, correction operation is performed for the wrongly written characters to do error analysis. The correction stage is composed of three steps: fetch the ideal character, correct the errors and locate the errors. Additionally, we propose a novel bucketing mining strategy to apply triplet loss at radical level to alleviate feature dispersion. Experiments on handwritten character dataset demonstrate that our proposed TAN shows great superiority on all three metrics comparing with other state-of-the-art recognition models. Through quantitative analysis, TAN is proved to capture more accurate spatial position information than regular encoder-decoder models, showing better generalization ability.
引用
收藏
页码:3615 / 3627
页数:13
相关论文
共 14 条
  • [1] Count, decompose and correct: A new approach to handwritten Chinese character error correction
    Hu, Pengfei
    Ma, Jiefeng
    Zhang, Zhenrong
    Du, Jun
    Zhang, Jianshu
    PATTERN RECOGNITION, 2025, 160
  • [2] Deep Matching Network for Handwritten Chinese Character Recognition
    Li, Zhiyuan
    Wu, Qi
    Xiao, Yi
    Jin, Min
    Lu, Huaxiang
    PATTERN RECOGNITION, 2020, 107
  • [3] A sophisticated offline network developed for recognizing handwritten Chinese character efficiently
    Xu, Xingmiao
    Yang, Cihui
    Wang, Lei
    Zhong, Jingting
    Bao, Wuzhida
    Guo, Jianyong
    COMPUTERS & ELECTRICAL ENGINEERING, 2022, 100
  • [4] Error handling approach using characterization and correction steps for handwritten document analysis
    Quiniou, Solen
    Cheriet, Mohamed
    Anquetil, Eric
    INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2012, 15 (02) : 125 - 141
  • [5] Interpretation of Ambiguous Zone in Handwritten Chinese Character Images Using Bayesian Network
    Cao, Zhongsheng
    Su, Zhewen
    Wang, Yuanzhen
    ADVANCES IN NEURAL NETWORKS - ISNN 2009, PT 3, PROCEEDINGS, 2009, 5553 : 380 - 389
  • [6] Offline Handwritten Chinese Character Using Convolutional Neural Network: State-of-the-Art Methods
    Zhong, Yingna
    Daud, Kauthar Mohd
    Nor, Ain Najiha Binti Mohamad
    Ikuesan, Richard Adeyemi
    Moorthy, Kohbalan
    JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2023, 27 (04) : 567 - 575
  • [7] Error handling approach using characterization and correction steps for handwritten document analysis
    Solen Quiniou
    Mohamed Cheriet
    Eric Anquetil
    International Journal on Document Analysis and Recognition (IJDAR), 2012, 15 : 125 - 141
  • [8] Stroke Sequence-Dependent Deep Convolutional Neural Network for Online Handwritten Chinese Character Recognition
    Liu, Xin
    Hu, Baotian
    Chen, Qingcai
    Wu, Xiangping
    You, Jinghan
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (11) : 4637 - 4648
  • [9] Local Discriminant Training and Global Optimization for Convolutional Neural Network based Handwritten Chinese Character Recognition
    Zeng, Xiangsheng
    Xiang, Donglai
    Peng, Liangrui
    Liu, Changsong
    Ding, Xiaoqing
    2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 382 - 387
  • [10] Statistical Analysis on Large Scale Chinese Short Message Corpus and Automatic Short Message Error Correction
    Hu, Rile
    Tang, Yuezhong
    Li, Chen
    Wang, Xia
    PACLIC 22: PROCEEDINGS OF THE 22ND PACIFIC ASIA CONFERENCE ON LANGUAGE, INFORMATION AND COMPUTATION, 2008, : 397 - 403