Characters as graphs: Interpretable handwritten Chinese character recognition via Pyramid Graph Transformer

被引:9
|
作者
Gan, Ji [1 ,2 ]
Chen, Yuyan [1 ]
Hu, Bo [1 ,2 ]
Leng, Jiaxu [1 ,2 ]
Wang, Weiqiang [3 ]
Gao, Xinbo [1 ,2 ]
机构
[1] Chongqing Univ Posts & Telecommun, Coll Comp Sci & Technol, Chongqing, Peoples R China
[2] Chongqing Inst Brain & Intelligence, Guangyang Bay Lab, Chongqing, Peoples R China
[3] Univ Chinese Acad Sci, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Handwritten Chinese character Recognition; Transformer; Graph convolutional network; Pyramid graph; ONLINE; REPRESENTATION; EXTRACTION;
D O I
10.1016/j.patcog.2023.109317
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
It is meaningful but challenging to teach machines to recognize handwritten Chinese characters. However, conventional approaches typically view handwritten Chinese characters as either static images or tempo-ral trajectories, which may ignore the inherent geometric semantics of characters. Instead, here we first propose to represent handwritten characters as skeleton graphs, explicitly considering the natural charac-teristics of characters (i.e., characters as graphs). Furthermore, we propose a novel Pyramid Graph Trans-former (PyGT) to specifically process the graph-structured characters, which fully integrates the advan-tages of Transformers and graph convolutional networks. Specifically, our PyGT can learn better graph fea-tures through (i) capturing the global information from all nodes with graph attention mechanism and (ii) modelling the explicit local adjacency structures of nodes with graph convolutions. Furthermore, the PyGT learns the multi-resolution features by constructing a progressive shrinking pyramid. Compared with ex-isting approaches, it is more interpretable to recognize characters as geometric graphs. Moreover, the pro-posed method is generic for both online and offline handwritten Chinese character recognition (HCCR), and it also can be feasibly extended to handwritten text recognition. Extensive experiments empirically demonstrate the superiority of PyGT over the prevalent approaches including 2D-CNN, RNN/1D-CNN, and Vision Transformer (ViT) for HCCR. The code is available at https://github.com/ganji15/PyGT-HCCR .& COPY; 2023 Elsevier Ltd. All rights reserved.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] HMMRF: A stochastic model for offline handwritten Chinese character recognition
    Wang, Q
    Zhao, RC
    Chi, ZR
    Feng, DD
    2000 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I-III, 2000, : 1475 - 1478
  • [32] Offline handwritten mathematical expression recognition with graph encoder and transformer decoder
    Tang, Jia-Man
    Guo, Hong-Yu
    Wu, Jin-Wen
    Yin, Fei
    Huang, Lin-Lin
    PATTERN RECOGNITION, 2024, 148
  • [33] A Handwritten Chinese Character Recognition Method Combining Sub-Structure Recognition
    Zhu, Yuanping
    An, Xingle
    Zhang, Kuang
    PROCEEDINGS OF 2016 15TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2016, : 518 - 523
  • [34] Building fast and compact convolutional neural networks for offline handwritten Chinese character recognition
    Xiao, Xuefeng
    Jin, Lianwen
    Yang, Yafeng
    Yang, Weixin
    Sun, Jun
    Chang, Tianhai
    PATTERN RECOGNITION, 2017, 72 : 72 - 81
  • [35] Radical aggregation network for few-shot offline handwritten Chinese character recognition
    Wang, Tianwei
    Xie, Zecheng
    Li, Zhe
    Jin, Lianwen
    Chen, Xiangle
    PATTERN RECOGNITION LETTERS, 2019, 125 : 821 - 827
  • [36] A high-performance CNN method for offline handwritten Chinese character recognition and visualization
    Melnyk, Pavlo
    You, Zhiqiang
    Li, Keqin
    SOFT COMPUTING, 2020, 24 (11) : 7977 - 7987
  • [37] A Novel Multilevel Stacked SqueezeNet Model for Handwritten Chinese Character Recognition
    Du, Yuankun
    Liu, Fengping
    Liu, Zhilong
    COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2023, 20 (04) : 1771 - 1795
  • [38] Adaptively Transfer Category-Classifier for Handwritten Chinese Character Recognition
    Zhu, Yongchun
    Zhuang, Fuzhen
    Yang, Jingyuan
    Yang, Xi
    He, Qing
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2019, PT I, 2019, 11439 : 110 - 122
  • [39] A New Linguistic Decoding Method for Online Handwritten Chinese Character Recognition
    徐志明
    王晓龙
    Journal of Computer Science and Technology, 2000, (06) : 597 - 604
  • [40] Fast self-generation voting for handwritten Chinese character recognition
    Yunxue Shao
    Chunheng Wang
    Baihua Xiao
    International Journal on Document Analysis and Recognition (IJDAR), 2013, 16 : 413 - 424