Chinese Character Recognition with Augmented Character Profile Matching

被引:13
作者
Zu, Xinyan [1 ]
Yu, Haiyang [1 ]
Li, Bin [1 ]
Xue, Xiangyang [1 ]
机构
[1] Fudan Univ, Sch Comp Sci, Shanghai Key Lab IIP, Shanghai, Peoples R China
来源
PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022 | 2022年
基金
中国国家自然科学基金;
关键词
Chinese character recognition; OCR; character profile matching; Chinese character knowledge;
D O I
10.1145/3503161.3547827
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Chinese character recognition (CCR) has drawn continuous research interest due to its wide applications. After decades of study, there still exist several challenges, e.g., different characters with similar appearance and the one-to-many problem. There is no unified solution to the above challenges as previous methods tend to address these problems separately. In this paper, we propose a Chinese character recognition method named Augmented Character Profile Matching (ACPM), which utilizes a collection of character knowledge from three decomposition levels to recognize Chinese characters. Specifically, the feature maps of each character image are utilized as the character-level knowledge. In addition, we introduce a radical-stroke counting module (RSC) to help produce augmented character profiles, including the number of radicals, the number of strokes, and the total length of strokes, which characterize the character more comprehensively. The feature maps of the character image and the outputs of the RSC module are collected to constitute a character profile for selecting the closest candidate character through joint matching. The experimental results show that the proposed method outperforms the state-of-the-art methods on both the ICDAR 2013 and CTW datasets by 0.35% and 2.23%, respectively. Moreover, it also clearly outperforms the compared methods in the zero-shot settings. Code is available at https://github.com/FudanVI/FudanOCR/character-profilematching.
引用
收藏
页码:6094 / 6102
页数:9
相关论文
共 28 条
  • [1] [Anonymous], 2019, ICDARW, DOI DOI 10.1109/ICDARW.2019.40092
  • [2] [Anonymous], 2015, IJCNN
  • [3] Zero-shot Handwritten Chinese Character Recognition with hierarchical decomposition embedding
    Cao, Zhong
    Lu, Jiang
    Cui, Sen
    Zhang, Changshui
    [J]. PATTERN RECOGNITION, 2020, 107
  • [4] Chang Fu, 2006, SACHR, P161
  • [5] Chen JY, 2021, PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, P615
  • [6] Chen JY, 2022, AAAI CONF ARTIF INTE, P285
  • [7] Text Recognition in the Wild: A Survey
    Chen, Xiaoxue
    Jin, Lianwen
    Zhu, Yuanzhi
    Luo, Canjie
    Wang, Tianwei
    [J]. ACM COMPUTING SURVEYS, 2021, 54 (02)
  • [8] Deep Residual Learning for Image Recognition
    He, Kaiming
    Zhang, Xiangyu
    Ren, Shaoqing
    Sun, Jian
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778
  • [9] Densely Connected Convolutional Networks
    Huang, Gao
    Liu, Zhuang
    van der Maaten, Laurens
    Weinberger, Kilian Q.
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 2261 - 2269
  • [10] Jin LW, 2001, COMPUTER SCIENCE AND TECHNOLOGY IN NEW CENTURY, P232