Improving Recurrent Neural Networks with Predictive Propagation for Sequence Labelling

被引:1
|
作者
Tran, Son N. [1 ]
Zhang, Qing [1 ]
Nguyen, Anthony [1 ]
Vu, Xuan-Son [2 ]
Ngo, Son [3 ]
机构
[1] CSIRO, Australian E Hlth Res Ctr, Brisbane, Qld 4026, Australia
[2] Umea Univ, Dept Comp Sci, Umea, Sweden
[3] FPT Univ, Dept Comp Sci, Hanoi, Vietnam
来源
NEURAL INFORMATION PROCESSING (ICONIP 2018), PT I | 2018年 / 11301卷
关键词
Natural language processing; Recurrent neural networks; Sequence labelling;
D O I
10.1007/978-3-030-04167-0_41
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recurrent neural networks (RNNs) is a useful tool for sequence labelling tasks in natural language processing. Although in practice RNNs suffer a problem of vanishing/exploding gradient, their compactness still offers efficiency and make them less prone to overfitting. In this paper we show that by propagating the prediction of previous labels we can improve the performance of RNNs while keeping the number of parameters in RNNs unchanged and adding only one more step for inference. As a result, the models are still more compact and efficient than other models with complex memory gates. In the experiment, we evaluate the idea on optical character recognition and Chunking which achieve promising results.
引用
收藏
页码:452 / 462
页数:11
相关论文
共 50 条
  • [1] Improving Sequence Modeling Ability of Recurrent Neural Networks via Sememes
    Qin, Yujia
    Qi, Fanchao
    Ouyang, Sicong
    Liu, Zhiyuan
    Yang, Cheng
    Wang, Yasheng
    Liu, Qun
    Sun, Maosong
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 : 2364 - 2373
  • [2] Target Propagation in Recurrent Neural Networks
    Manchev, Nikolay
    Spratling, Michael
    JOURNAL OF MACHINE LEARNING RESEARCH, 2020, 21
  • [3] Improving Neural Sequence Labelling using Additional Linguistic Information
    Ahmed, Mahtab
    Samee, Muhammad Rifayat
    Mercer, Robert E.
    2018 17TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2018, : 650 - 657
  • [4] Sequence Metric Learning as Synchronization of Recurrent Neural Networks
    Compagnon, Paul
    Lefebvre, Gregoire
    Duffner, Stefan
    Garcia, Christophe
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [5] Partially recurrent neural networks for production of temporal sequence
    Araujo, AFR
    D'Arbo, H
    IEEE WORLD CONGRESS ON COMPUTATIONAL INTELLIGENCE, 1998, : 474 - 479
  • [6] SEQUENCE-DISCRIMINATIVE TRAINING OF RECURRENT NEURAL NETWORKS
    Voigtlaender, Paul
    Doetsch, Patrick
    Wiesler, Simon
    Schlueter, Ralf
    Ney, Hermann
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 2100 - 2104
  • [7] Embedding Graph Convolutional Networks in Recurrent Neural Networks for Predictive Monitoring
    Rama-Maneiro, Efren
    Vidal, Juan C.
    Lama, Manuel
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (01) : 137 - 151
  • [8] PREDICTIVE CONTROL USING RECURRENT NEURAL NETWORKS FOR INDUSTRIAL PROCESSES
    Lu, Chi-Huang
    Tsai, Ching-Chih
    JOURNAL OF THE CHINESE INSTITUTE OF ENGINEERS, 2009, 32 (02) : 277 - 283
  • [9] Deep Learning for Load Forecasting: Sequence to Sequence Recurrent Neural Networks With Attention
    Sehovac, Ljubisa
    Grolinger, Katarina
    IEEE ACCESS, 2020, 8 : 36411 - 36426
  • [10] Trajectory Prediction with Recurrent Neural Networks for Predictive Resource Allocation
    Zhang, Wenjing
    Liu, Yuan
    Liu, Tingting
    Yang, Chenyang
    PROCEEDINGS OF 2018 14TH IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP), 2018, : 634 - 639