Handwriting Trajectory Recovery using End-to-End Deep Encoder-Decoder Network

被引:0
|
作者
Bhunia, Ayan Kumar [1 ]
Bhowmick, Abir [1 ]
Bhunia, Ankan Kumar [2 ]
Konwer, Aishik [1 ]
Banerjee, Prithaj [3 ]
Roy, Partha Pratim [4 ]
Pal, Umapada [5 ]
机构
[1] Inst Engn & Management, Dept ECE, Kolkata, India
[2] Jadavpur Univ, Dept EE, Kolkata, India
[3] Inst Engn & Management, Dept CSE, Kolkata, India
[4] Indian Inst Technol Roorkee, Dept CSE, Roorkee, Uttar Pradesh, India
[5] Indian Stat Inst, CVPR Unit, Kolkata, India
关键词
Handwriting Trajectory Recovery; Encoder-Decoder Network; Sequence to Sequence Model; Deep Learning; PEN TRAJECTORIES; NEURAL-NETWORK; RECOGNITION; INFORMATION; ORDER;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we introduce a novel technique to recover the pen trajectory of offline characters which is a crucial step for handwritten character recognition. Generally, online acquisition approach has more advantage than its offline counterpart as the online technique keeps track of the pen movement. Hence, pen tip trajectory retrieval from offline text can bridge the gap between online and offline methods. Our proposed framework employs sequence to sequence model which consists of an encoder-decoder LSTM module. The proposed encoder module consists of Convolutional LSTM network, which takes an offline character image as the input and encodes the feature sequence to a hidden representation. The output of the encoder is fed to a decoder LSTM and we get the successive coordinate points from every time step of the decoder LSTM. Although the sequence to sequence model is a popular paradigm in various computer vision and language translation tasks, the main contribution of our work lies in designing an end-to-end network for a decade old popular problem in document image analysis community. Tamil, Telugu and Devanagari characters of LIPI Toolkit dataset are used for our experiments. Our proposed method has achieved superior performance compared to the other conventional approaches.
引用
收藏
页码:3639 / 3644
页数:6
相关论文
共 50 条
  • [41] Deep Encoder-Decoder Neural Network Architectures for Graph Output Signals
    Rey, Samuel
    Tenorio, Victor
    Rozada, Sergio
    Martino, Luca
    Marques, Antonio G.
    CONFERENCE RECORD OF THE 2019 FIFTY-THIRD ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, 2019, : 225 - 229
  • [42] Automated tongue segmentation using deep encoder-decoder model
    Kusakunniran, Worapan
    Borwarnginn, Punyanuch
    Imaromkul, Thanandon
    Aukkapinyo, Kittinun
    Thongkanchorn, Kittikhun
    Wattanadhirach, Disathon
    Mongkolluksamee, Sophon
    Thammasudjarit, Ratchainant
    Ritthipravat, Panrasee
    Tuakta, Pimchanok
    Benjapornlert, Paitoon
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (24) : 37661 - 37686
  • [43] EEG Channel Interpolation Using Deep Encoder-decoder Networks
    Saba-Sadiya, Sari
    Alhanai, Tuka
    Liu, Taosheng
    Ghassemi, Mohammad M.
    2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, : 2432 - 2439
  • [44] An encoder-decoder deep neural network for binary segmentation of seismic facies
    Lima, Gefersom
    Zeiser, Felipe Andre
    Da Silveira, Ariane
    Rigo, Sandro
    Ramos, Gabriel de Oliveira
    COMPUTERS & GEOSCIENCES, 2024, 183
  • [45] Deep residual inception encoder-decoder network for amyloid PET harmonization
    Shah, Jay
    Gao, Fei
    Li, Baoxin
    Ghisays, Valentina
    Luo, Ji
    Chen, Yinghua
    Lee, Wendy
    Zhou, Yuxiang
    Benzinger, Tammie L. S.
    Reiman, Eric M.
    Chen, Kewei
    Su, Yi
    Wu, Teresa
    ALZHEIMERS & DEMENTIA, 2022, 18 (12) : 2448 - 2457
  • [46] Inferring contextual preferences using deep encoder-decoder learners
    Unger, Moshe
    Shapira, Bracha
    Rokach, Lior
    Livne, Amit
    NEW REVIEW OF HYPERMEDIA AND MULTIMEDIA, 2018, 24 (03) : 262 - 290
  • [47] Automated tongue segmentation using deep encoder-decoder model
    Worapan Kusakunniran
    Punyanuch Borwarnginn
    Thanandon Imaromkul
    Kittinun Aukkapinyo
    Kittikhun Thongkanchorn
    Disathon Wattanadhirach
    Sophon Mongkolluksamee
    Ratchainant Thammasudjarit
    Panrasee Ritthipravat
    Pimchanok Tuakta
    Paitoon Benjapornlert
    Multimedia Tools and Applications, 2023, 82 : 37661 - 37686
  • [48] Deep Residual Inception Encoder-Decoder Network for Medical Imaging Synthesis
    Gao, Fei
    Wu, Teresa
    Chu, Xianghua
    Yoon, Hyunsoo
    Xu, Yanzhe
    Patel, Bhavika
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2020, 24 (01) : 39 - 49
  • [49] Iterative Deep Convolutional Encoder-Decoder Network for Medical Image Segmentation
    Kim, Jung Uk
    Kim, Hak Gu
    Ro, Yong Man
    2017 39TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2017, : 685 - 688
  • [50] Deep capsule encoder-decoder network for surrogate modeling and uncertainty quantification
    Thakur, Akshay
    Chakraborty, Souvik
    INTERNATIONAL JOURNAL FOR NUMERICAL METHODS IN ENGINEERING, 2023, 124 (12) : 2783 - 2800