Characters as graphs: Interpretable handwritten Chinese character recognition via Pyramid Graph Transformer

被引:9
|
作者
Gan, Ji [1 ,2 ]
Chen, Yuyan [1 ]
Hu, Bo [1 ,2 ]
Leng, Jiaxu [1 ,2 ]
Wang, Weiqiang [3 ]
Gao, Xinbo [1 ,2 ]
机构
[1] Chongqing Univ Posts & Telecommun, Coll Comp Sci & Technol, Chongqing, Peoples R China
[2] Chongqing Inst Brain & Intelligence, Guangyang Bay Lab, Chongqing, Peoples R China
[3] Univ Chinese Acad Sci, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Handwritten Chinese character Recognition; Transformer; Graph convolutional network; Pyramid graph; ONLINE; REPRESENTATION; EXTRACTION;
D O I
10.1016/j.patcog.2023.109317
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
It is meaningful but challenging to teach machines to recognize handwritten Chinese characters. However, conventional approaches typically view handwritten Chinese characters as either static images or tempo-ral trajectories, which may ignore the inherent geometric semantics of characters. Instead, here we first propose to represent handwritten characters as skeleton graphs, explicitly considering the natural charac-teristics of characters (i.e., characters as graphs). Furthermore, we propose a novel Pyramid Graph Trans-former (PyGT) to specifically process the graph-structured characters, which fully integrates the advan-tages of Transformers and graph convolutional networks. Specifically, our PyGT can learn better graph fea-tures through (i) capturing the global information from all nodes with graph attention mechanism and (ii) modelling the explicit local adjacency structures of nodes with graph convolutions. Furthermore, the PyGT learns the multi-resolution features by constructing a progressive shrinking pyramid. Compared with ex-isting approaches, it is more interpretable to recognize characters as geometric graphs. Moreover, the pro-posed method is generic for both online and offline handwritten Chinese character recognition (HCCR), and it also can be feasibly extended to handwritten text recognition. Extensive experiments empirically demonstrate the superiority of PyGT over the prevalent approaches including 2D-CNN, RNN/1D-CNN, and Vision Transformer (ViT) for HCCR. The code is available at https://github.com/ganji15/PyGT-HCCR .& COPY; 2023 Elsevier Ltd. All rights reserved.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] LW-ViT: The Lightweight Vision Transformer Model Applied in Offline Handwritten Chinese Character Recognition
    Geng, Shiyong
    Zhu, Zongnan
    Wang, Zhida
    Dan, Yongping
    Li, Hengyi
    ELECTRONICS, 2023, 12 (07)
  • [22] An integration approach to handwritten Chinese character recognition system
    郝红卫
    戴汝为
    Science in China(Series E:Technological Sciences), 1998, (01) : 101 - 105
  • [23] Handwritten Chinese characters recognition by greedy matching with geometric constraint
    Hsieh, AJ
    Fan, KC
    Fan, TI
    IMAGE AND VISION COMPUTING, 1996, 14 (02) : 91 - 104
  • [24] A preclassification method for handwritten Chinese character recognition via fuzzy rules and SEART neural net
    Lee, HM
    Lin, CC
    Chen, JM
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 1998, 12 (06) : 743 - 761
  • [25] Handwritten Chinese character recognition using fuzzy image alignment
    Fangyi Li
    Qiang Shen
    Ying Li
    Neil Mac Parthaláin
    Soft Computing, 2016, 20 : 2939 - 2949
  • [26] Improving Offline Handwritten Chinese Character Recognition by Iterative Refinement
    Yang, Xiao
    He, Dafang
    Zhou, Zihan
    Kifer, Daniel
    Giles, C. Lee
    2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 5 - 10
  • [27] Offline Handwritten Chinese Character Recognition Based on Improved Googlenet
    Min, Feng
    Zhu, Sicheng
    Wang, Yansong
    AIPR 2020: 2020 3RD INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND PATTERN RECOGNITION, 2020, : 42 - 46
  • [28] EMBEDDED LARGE-SCALE HANDWRITTEN CHINESE CHARACTER RECOGNITION
    Chherawala, Youssouf
    Dolfing, Hans J. G. A.
    Dixon, Ryan S.
    Bellegarda, Jerome R.
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 8169 - 8173
  • [29] Novel design of neural networks for handwritten Chinese character recognition
    Yip, DHF
    Yu, WWH
    VISION GEOMETRY VII, 1998, 3454 : 324 - 329
  • [30] Writing Style Adversarial Network for Handwritten Chinese Character Recognition
    Liu, Huan
    Lyu, Shujing
    Zhan, Hongjian
    Lu, Yue
    NEURAL INFORMATION PROCESSING (ICONIP 2019), PT IV, 2019, 1142 : 66 - 74