Handwriting Trajectory Reconstruction Using Spatial-Temporal Encoder-Decoder Network

被引：0

作者：

Wei, Feilong ^{[1
]}

Zhu, Yuanping ^{[1
]}

机构：

[1] Tianjin Normal Univ, 393 Binshuixi Rd, Tianjin, Peoples R China

来源：

PATTERN RECOGNITION AND COMPUTER VISION, PT I | 2021年 / 13019卷

关键词：

Handwriting trajectory reconstruction; Full convolutional network; Encoder-decoder network; Deep learning;

D O I：

10.1007/978-3-030-88004-0_28

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Chinese handwriting characters have complex strokes and various writing styles, which makes it difficult to reconstruct handwriting. Aiming at this problem, we propose a handwriting reconstruction method based on a spatial-temporal encoder-decoder network with constrains. Different from other models that generate trajectory coordinates through a fully connected network, the method proposed in this paper outputs heat map sequence. The model is consists of three modules: key point detection module, spatial encoder-decoder module and reconstruction constraint module. The key point detector module and the spatial encoder part of encoder-decoder module are composed of a full convolutional network. The former generates heat maps of all key points which is a branch of the spatial encoder, and the mainly encoding the spatial information of each position on the offline image. The temporal decoder module is composed of a GRU network and an MLP network. Finally, we combine temporal information and reconstruction constraints to generate the final sequence. At each time, the features encoding by the spatial encoder module are combined with the features at the previous time that generate a corresponding heat map. The main contribution of the work of this paper is to propose a method that more suitable for handwriting reconstruction of Chinese handwritten characters. Experimental results show that the CT [6] accuracy of our method has already reached 87.6% on OLHWDB1.1 dataset.

引用

页码：342 / 354

页数：13

共 20 条

[1] [Anonymous], EMNLP
[2] Bhunia AK, 2018, INT C PATT RECOG, P3639, DOI 10.1109/ICPR.2018.8546093
[3] Cao Z.W., 2009, J. Image Graph, V10, P2074
[4] OpenPose: Realtime Multi-Person 2D Pose Estimation Using Part Affinity Fields
Cao, Zhe
Hidalgo, Gines
Simon, Tomas
Wei, Shih-En
Sheikh, Yaser
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (01) : 172 - 186
[5] Schwing AG, 2015, Arxiv, DOI [arXiv:1503.02351, DOI 10.48550/ARXIV.1503.02351]
[6] Graves A, 2012, STUD COMPUT INTELL, V385, P1, DOI [10.1162/neco.1997.9.1.1, 10.1007/978-3-642-24797-2]
[7] DeeperCut: A Deeper, Stronger, and Faster Multi-person Pose Estimation Model
Insafutdinov, Eldar
Pishchulin, Leonid
Andres, Bjoern
Andriluka, Mykhaylo
Schiele, Bernt
[J]. COMPUTER VISION - ECCV 2016, PT VI, 2016, 9910 : 34 - 50
[8] Lai S., 2021, IEEE T PATTERN ANAL, V8, P99
[9] CornerNet: Detecting Objects as Paired Keypoints
Law, Hei
Deng, Jia
[J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2020, 128 (03) : 642 - 656
[10] Liangcheng Li, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12370), P85, DOI 10.1007/978-3-030-58595-2_6

← 1 2 →