Handwriting Trajectory Recovery using End-to-End Deep Encoder-Decoder Network

被引:0
|
作者
Bhunia, Ayan Kumar [1 ]
Bhowmick, Abir [1 ]
Bhunia, Ankan Kumar [2 ]
Konwer, Aishik [1 ]
Banerjee, Prithaj [3 ]
Roy, Partha Pratim [4 ]
Pal, Umapada [5 ]
机构
[1] Inst Engn & Management, Dept ECE, Kolkata, India
[2] Jadavpur Univ, Dept EE, Kolkata, India
[3] Inst Engn & Management, Dept CSE, Kolkata, India
[4] Indian Inst Technol Roorkee, Dept CSE, Roorkee, Uttar Pradesh, India
[5] Indian Stat Inst, CVPR Unit, Kolkata, India
关键词
Handwriting Trajectory Recovery; Encoder-Decoder Network; Sequence to Sequence Model; Deep Learning; PEN TRAJECTORIES; NEURAL-NETWORK; RECOGNITION; INFORMATION; ORDER;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we introduce a novel technique to recover the pen trajectory of offline characters which is a crucial step for handwritten character recognition. Generally, online acquisition approach has more advantage than its offline counterpart as the online technique keeps track of the pen movement. Hence, pen tip trajectory retrieval from offline text can bridge the gap between online and offline methods. Our proposed framework employs sequence to sequence model which consists of an encoder-decoder LSTM module. The proposed encoder module consists of Convolutional LSTM network, which takes an offline character image as the input and encodes the feature sequence to a hidden representation. The output of the encoder is fed to a decoder LSTM and we get the successive coordinate points from every time step of the decoder LSTM. Although the sequence to sequence model is a popular paradigm in various computer vision and language translation tasks, the main contribution of our work lies in designing an end-to-end network for a decade old popular problem in document image analysis community. Tamil, Telugu and Devanagari characters of LIPI Toolkit dataset are used for our experiments. Our proposed method has achieved superior performance compared to the other conventional approaches.
引用
收藏
页码:3639 / 3644
页数:6
相关论文
共 50 条
  • [1] End-to-End Deep Background Subtraction based on Encoder-Decoder Network
    Le, Duy H.
    Pham, Tuan, V
    PROCEEDINGS OF 2019 6TH NATIONAL FOUNDATION FOR SCIENCE AND TECHNOLOGY DEVELOPMENT (NAFOSTED) CONFERENCE ON INFORMATION AND COMPUTER SCIENCE (NICS), 2019, : 381 - 386
  • [2] Encoder-Decoder Based Attractors for End-to-End Neural Diarization
    Horiguchi, Shota
    Fujita, Yusuke
    Watanabe, Shinji
    Xue, Yawen
    Garcia, Paola
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 1493 - 1507
  • [3] GPS Trajectory Completion Using End-to-End Bidirectional Convolutional Recurrent Encoder-Decoder Architecture with Attention Mechanism
    Nawaz, Asif
    Huang, Zhiqiu
    Wang, Senzhang
    Akbar, Azeem
    AlSalman, Hussain
    Gumaei, Abdu
    SENSORS, 2020, 20 (18) : 1 - 16
  • [4] Handwriting Trajectory Reconstruction Using Spatial-Temporal Encoder-Decoder Network
    Wei, Feilong
    Zhu, Yuanping
    PATTERN RECOGNITION AND COMPUTER VISION, PT I, 2021, 13019 : 342 - 354
  • [5] End-to-End Trained CNN Encoder-Decoder Networks for Image Steganography
    Rehman, Atique ur
    Rahim, Rafia
    Nadeem, Shahroz
    ul Hussain, Sibt
    COMPUTER VISION - ECCV 2018 WORKSHOPS, PT IV, 2019, 11132 : 723 - 729
  • [6] End-to-end trained encoder-decoder convolutional neural network for fetal electrocardiogram signal denoising
    Fotiadou, Eleni
    Konopczynski, Tomasz
    Hesser, Juergen
    Vullings, Rik
    PHYSIOLOGICAL MEASUREMENT, 2020, 41 (01)
  • [7] ON TRAINING THE RECURRENT NEURAL NETWORK ENCODER-DECODER FOR LARGE VOCABULARY END-TO-END SPEECH RECOGNITION
    Lu, Liang
    Zhang, Xingxing
    Renals, Steve
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 5060 - 5064
  • [8] An end-to-end RoI-based encoder-decoder for fetal ECG recovery and QRS complex detection
    Remus, Julia C.
    da Silveira, Thiago L. T.
    2024 IEEE INTERNATIONAL SYMPOSIUM ON MEDICAL MEASUREMENTS AND APPLICATIONS, MEMEA 2024, 2024,
  • [9] SEQUENCE TRAINING OF ENCODER-DECODER MODEL USING POLICY GRADIENT FOR END-TO-END SPEECH RECOGNITION
    Karita, Shigeki
    Ogawa, Atsunori
    Delcroix, Marc
    Nakatani, Tomohiro
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5839 - 5843
  • [10] Attention-Based Encoder-Decoder End-to-End Neural Diarization With Embedding Enhancer
    Chen, Zhengyang
    Han, Bing
    Wang, Shuai
    Qian, Yanmin
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 1636 - 1649