Characters as graphs: Interpretable handwritten Chinese character recognition via Pyramid Graph Transformer

被引:9
|
作者
Gan, Ji [1 ,2 ]
Chen, Yuyan [1 ]
Hu, Bo [1 ,2 ]
Leng, Jiaxu [1 ,2 ]
Wang, Weiqiang [3 ]
Gao, Xinbo [1 ,2 ]
机构
[1] Chongqing Univ Posts & Telecommun, Coll Comp Sci & Technol, Chongqing, Peoples R China
[2] Chongqing Inst Brain & Intelligence, Guangyang Bay Lab, Chongqing, Peoples R China
[3] Univ Chinese Acad Sci, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Handwritten Chinese character Recognition; Transformer; Graph convolutional network; Pyramid graph; ONLINE; REPRESENTATION; EXTRACTION;
D O I
10.1016/j.patcog.2023.109317
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
It is meaningful but challenging to teach machines to recognize handwritten Chinese characters. However, conventional approaches typically view handwritten Chinese characters as either static images or tempo-ral trajectories, which may ignore the inherent geometric semantics of characters. Instead, here we first propose to represent handwritten characters as skeleton graphs, explicitly considering the natural charac-teristics of characters (i.e., characters as graphs). Furthermore, we propose a novel Pyramid Graph Trans-former (PyGT) to specifically process the graph-structured characters, which fully integrates the advan-tages of Transformers and graph convolutional networks. Specifically, our PyGT can learn better graph fea-tures through (i) capturing the global information from all nodes with graph attention mechanism and (ii) modelling the explicit local adjacency structures of nodes with graph convolutions. Furthermore, the PyGT learns the multi-resolution features by constructing a progressive shrinking pyramid. Compared with ex-isting approaches, it is more interpretable to recognize characters as geometric graphs. Moreover, the pro-posed method is generic for both online and offline handwritten Chinese character recognition (HCCR), and it also can be feasibly extended to handwritten text recognition. Extensive experiments empirically demonstrate the superiority of PyGT over the prevalent approaches including 2D-CNN, RNN/1D-CNN, and Vision Transformer (ViT) for HCCR. The code is available at https://github.com/ganji15/PyGT-HCCR .& COPY; 2023 Elsevier Ltd. All rights reserved.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] A hybrid post-processing system for handwritten Chinese character recognition
    Xu, RF
    Yeung, D
    Shu, WH
    Liu, JF
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2002, 16 (06) : 657 - 679
  • [42] A new linguistic decoding method for online handwritten Chinese character recognition
    Zhiming Xu
    Xiaolong Wang
    Journal of Computer Science and Technology, 2000, 15 : 597 - 603
  • [43] Radical Region based CNN for Offline Handwritten Chinese Character Recognition
    Luo, Weike
    Kamata, Sei-Ichiro
    PROCEEDINGS 2017 4TH IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION (ACPR), 2017, : 542 - 547
  • [44] A new linguistic decoding method for online handwritten Chinese character recognition
    Xu, ZM
    Wang, XL
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2000, 15 (06) : 597 - 603
  • [45] Invariant handwritten Chinese character recognition using fuzzy ring data
    Tseng, DC
    Chiu, HP
    Cheng, JC
    IMAGE AND VISION COMPUTING, 1996, 14 (09) : 647 - 657
  • [46] The Handwritten Chinese Character Recognition Uses Convolutional Neural Networks with the GoogLeNet
    Bi, Ning
    Chen, Jiahao
    Tan, Jun
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2019, 33 (11)
  • [47] Compressing the CNN architecture for in-air handwritten Chinese character recognition
    Gan, Ji
    Wang, Weiqiang
    Lu, Ke
    PATTERN RECOGNITION LETTERS, 2020, 129 : 190 - 197
  • [48] Building efficient CNN architecture for offline handwritten Chinese character recognition
    Zhiyuan Li
    Nanjun Teng
    Min Jin
    Huaxiang Lu
    International Journal on Document Analysis and Recognition (IJDAR), 2018, 21 : 233 - 240
  • [49] The advances of handwritten Chinese character recognition and its application in multimodal interface
    Wang, Y
    Pu, JT
    Chen, WG
    Wang, H
    OBJECT DETECTION, CLASSIFICATION, AND TRACKING TECHNOLOGIES, 2001, 4554 : 113 - 117
  • [50] Deep Convolutional Neural Networks Based on Knowledge Distillation for Offline Handwritten Chinese Character Recognition
    He, Hongli
    Zhu, Zongnan
    Li, Zhuo
    Dan, Yongping
    JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2024, 28 (02) : 231 - 238