Handwriting Trajectory Reconstruction Using Spatial-Temporal Encoder-Decoder Network

被引:0
作者
Wei, Feilong [1 ]
Zhu, Yuanping [1 ]
机构
[1] Tianjin Normal Univ, 393 Binshuixi Rd, Tianjin, Peoples R China
来源
PATTERN RECOGNITION AND COMPUTER VISION, PT I | 2021年 / 13019卷
关键词
Handwriting trajectory reconstruction; Full convolutional network; Encoder-decoder network; Deep learning;
D O I
10.1007/978-3-030-88004-0_28
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Chinese handwriting characters have complex strokes and various writing styles, which makes it difficult to reconstruct handwriting. Aiming at this problem, we propose a handwriting reconstruction method based on a spatial-temporal encoder-decoder network with constrains. Different from other models that generate trajectory coordinates through a fully connected network, the method proposed in this paper outputs heat map sequence. The model is consists of three modules: key point detection module, spatial encoder-decoder module and reconstruction constraint module. The key point detector module and the spatial encoder part of encoder-decoder module are composed of a full convolutional network. The former generates heat maps of all key points which is a branch of the spatial encoder, and the mainly encoding the spatial information of each position on the offline image. The temporal decoder module is composed of a GRU network and an MLP network. Finally, we combine temporal information and reconstruction constraints to generate the final sequence. At each time, the features encoding by the spatial encoder module are combined with the features at the previous time that generate a corresponding heat map. The main contribution of the work of this paper is to propose a method that more suitable for handwriting reconstruction of Chinese handwritten characters. Experimental results show that the CT [6] accuracy of our method has already reached 87.6% on OLHWDB1.1 dataset.
引用
收藏
页码:342 / 354
页数:13
相关论文
共 20 条
  • [1] [Anonymous], EMNLP
  • [2] Bhunia AK, 2018, INT C PATT RECOG, P3639, DOI 10.1109/ICPR.2018.8546093
  • [3] Cao Z.W., 2009, J. Image Graph, V10, P2074
  • [4] OpenPose: Realtime Multi-Person 2D Pose Estimation Using Part Affinity Fields
    Cao, Zhe
    Hidalgo, Gines
    Simon, Tomas
    Wei, Shih-En
    Sheikh, Yaser
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (01) : 172 - 186
  • [5] Schwing AG, 2015, Arxiv, DOI [arXiv:1503.02351, DOI 10.48550/ARXIV.1503.02351]
  • [6] Graves A, 2012, STUD COMPUT INTELL, V385, P1, DOI [10.1162/neco.1997.9.1.1, 10.1007/978-3-642-24797-2]
  • [7] DeeperCut: A Deeper, Stronger, and Faster Multi-person Pose Estimation Model
    Insafutdinov, Eldar
    Pishchulin, Leonid
    Andres, Bjoern
    Andriluka, Mykhaylo
    Schiele, Bernt
    [J]. COMPUTER VISION - ECCV 2016, PT VI, 2016, 9910 : 34 - 50
  • [8] Lai S., 2021, IEEE T PATTERN ANAL, V8, P99
  • [9] CornerNet: Detecting Objects as Paired Keypoints
    Law, Hei
    Deng, Jia
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2020, 128 (03) : 642 - 656
  • [10] Liangcheng Li, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12370), P85, DOI 10.1007/978-3-030-58595-2_6