Handwriting Trajectory Recovery using End-to-End Deep Encoder-Decoder Network

被引：0

作者：

Bhunia, Ayan Kumar ^{[1
]}

Bhowmick, Abir ^{[1
]}

Bhunia, Ankan Kumar ^{[2
]}

Konwer, Aishik ^{[1
]}

Banerjee, Prithaj ^{[3
]}

Roy, Partha Pratim ^{[4
]}

Pal, Umapada ^{[5
]}

机构：

[1] Inst Engn & Management, Dept ECE, Kolkata, India

[2] Jadavpur Univ, Dept EE, Kolkata, India

[3] Inst Engn & Management, Dept CSE, Kolkata, India

[4] Indian Inst Technol Roorkee, Dept CSE, Roorkee, Uttar Pradesh, India

[5] Indian Stat Inst, CVPR Unit, Kolkata, India

来源：

2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR) | 2018年

关键词：

Handwriting Trajectory Recovery; Encoder-Decoder Network; Sequence to Sequence Model; Deep Learning; PEN TRAJECTORIES; NEURAL-NETWORK; RECOGNITION; INFORMATION; ORDER;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we introduce a novel technique to recover the pen trajectory of offline characters which is a crucial step for handwritten character recognition. Generally, online acquisition approach has more advantage than its offline counterpart as the online technique keeps track of the pen movement. Hence, pen tip trajectory retrieval from offline text can bridge the gap between online and offline methods. Our proposed framework employs sequence to sequence model which consists of an encoder-decoder LSTM module. The proposed encoder module consists of Convolutional LSTM network, which takes an offline character image as the input and encodes the feature sequence to a hidden representation. The output of the encoder is fed to a decoder LSTM and we get the successive coordinate points from every time step of the decoder LSTM. Although the sequence to sequence model is a popular paradigm in various computer vision and language translation tasks, the main contribution of our work lies in designing an end-to-end network for a decade old popular problem in document image analysis community. Tamil, Telugu and Devanagari characters of LIPI Toolkit dataset are used for our experiments. Our proposed method has achieved superior performance compared to the other conventional approaches.

引用

页码：3639 / 3644

页数：6

共 50 条

[1] End-to-End Deep Background Subtraction based on Encoder-Decoder Network
Le, Duy H.
Pham, Tuan, V
PROCEEDINGS OF 2019 6TH NATIONAL FOUNDATION FOR SCIENCE AND TECHNOLOGY DEVELOPMENT (NAFOSTED) CONFERENCE ON INFORMATION AND COMPUTER SCIENCE (NICS), 2019, : 381 - 386
[2] Encoder-Decoder Based Attractors for End-to-End Neural Diarization
Horiguchi, Shota
Fujita, Yusuke
Watanabe, Shinji
Xue, Yawen
Garcia, Paola
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 1493 - 1507
[3] GPS Trajectory Completion Using End-to-End Bidirectional Convolutional Recurrent Encoder-Decoder Architecture with Attention Mechanism
Nawaz, Asif
Huang, Zhiqiu
Wang, Senzhang
Akbar, Azeem
AlSalman, Hussain
Gumaei, Abdu
SENSORS, 2020, 20 (18) : 1 - 16
[4] Handwriting Trajectory Reconstruction Using Spatial-Temporal Encoder-Decoder Network
Wei, Feilong
Zhu, Yuanping
PATTERN RECOGNITION AND COMPUTER VISION, PT I, 2021, 13019 : 342 - 354
[5] End-to-End Trained CNN Encoder-Decoder Networks for Image Steganography
Rehman, Atique ur
Rahim, Rafia
Nadeem, Shahroz
ul Hussain, Sibt
COMPUTER VISION - ECCV 2018 WORKSHOPS, PT IV, 2019, 11132 : 723 - 729
[6] End-to-end trained encoder-decoder convolutional neural network for fetal electrocardiogram signal denoising
Fotiadou, Eleni
Konopczynski, Tomasz
Hesser, Juergen
Vullings, Rik
PHYSIOLOGICAL MEASUREMENT, 2020, 41 (01)
[7] ON TRAINING THE RECURRENT NEURAL NETWORK ENCODER-DECODER FOR LARGE VOCABULARY END-TO-END SPEECH RECOGNITION
Lu, Liang
Zhang, Xingxing
Renals, Steve
2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 5060 - 5064
[8] An end-to-end RoI-based encoder-decoder for fetal ECG recovery and QRS complex detection
Remus, Julia C.
da Silveira, Thiago L. T.
2024 IEEE INTERNATIONAL SYMPOSIUM ON MEDICAL MEASUREMENTS AND APPLICATIONS, MEMEA 2024, 2024,
[9] SEQUENCE TRAINING OF ENCODER-DECODER MODEL USING POLICY GRADIENT FOR END-TO-END SPEECH RECOGNITION
Karita, Shigeki
Ogawa, Atsunori
Delcroix, Marc
Nakatani, Tomohiro
2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5839 - 5843
[10] Attention-Based Encoder-Decoder End-to-End Neural Diarization With Embedding Enhancer
Chen, Zhengyang
Han, Bing
Wang, Shuai
Qian, Yanmin
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 1636 - 1649

← 1 2 3 4 5 →