LayerGLAT: A Flexible Non-autoregressive Transformer for Single-Pass and Multi-pass Prediction

被引:0
|
作者
Li, Shijie [1 ]
Unanue, Inigo Jauregi [1 ,2 ]
Piccardi, Massimo [1 ]
机构
[1] Univ Technol Sydney, Ultimo, Australia
[2] RoZetta Technol, Sydney, NSW, Australia
来源
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, PT II, ECML PKDD 2024 | 2024年 / 14942卷
关键词
Non-autoregressive Transformer; Multi-pass Prediction; Layer-wise Training;
D O I
10.1007/978-3-031-70344-7_14
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Non-autoregressive transformers (NATs) have made substantial progress in recent years, improving their predictive accuracy while achieving speed-ups of an order of magnitude compared to their conventional, autoregressive counterparts. However, the performance gap between NATs and autoregressive transformers (ATs) is still significant, which has triggered the development of "iterative" NATs which predict through multiple passes, targeting a trade-off between accuracy and speed. Notwithstanding the manifest benefits of both fully and iterative NATs, research seems to have overlooked the possibility of integrating them effectively, so as to deliver both strong single- and multi-pass prediction while retaining the highest possible speed-up. To bridge this gap, this paper introduces LayerGLAT, a hybrid model that combines the strengths of both fully and iterative NATs, achieving competitive performance in both single-pass and iterative prediction. The key idea of the proposed approach is a layer-wise training strategy that is able to emulate the generating conditions of both single-pass and multi-pass generation, leading to strong performance in both cases. The experimental results over three machine translation datasets have given evidence to the remarkable performance of the proposed model, which has been able to outperform leading NATs in accuracy and speed and near the accuracy of ATs (Our code is publicly available at https://github.com/lsj72123/layer-GLAT).
引用
收藏
页码:233 / 249
页数:17
相关论文
共 50 条
  • [1] Align-Denoise: Single-Pass Non-Autoregressive Speech Recognition
    Chen, Nanxin
    Zelasko, Piotr
    Moro-Velazquez, Laureano
    Villalba, Jesus
    Dehak, Najim
    INTERSPEECH 2021, 2021, : 3770 - 3774
  • [2] Performance prediction of single-pass and multi-pass low-cost solar air heater
    Omotosho, Emmanuel
    Hackney, Philip
    THERMAL SCIENCE AND ENGINEERING PROGRESS, 2024, 47
  • [3] Thermal stress of abrasive grain during single-pass and multi-pass grinding
    Ivanova, T. N.
    MATERIALS TODAY-PROCEEDINGS, 2019, 19 : 2283 - 2285
  • [4] Prediction of no-recrystallization temperature by simulation of multi-pass flow stress curves from single-pass curves
    Solhjoo, Soheil
    Ebrahimi, R.
    JOURNAL OF MATERIALS SCIENCE, 2010, 45 (21) : 5960 - 5966
  • [5] Prediction of no-recrystallization temperature by simulation of multi-pass flow stress curves from single-pass curves
    Soheil Solhjoo
    R. Ebrahimi
    Journal of Materials Science, 2010, 45 : 5960 - 5966
  • [6] Investigations of the Laser Ablation Mechanism of PMMA Microchannels Using Single-Pass and Multi-Pass Laser Scans
    Li, Xiao
    Tang, Rujun
    Li, Ding
    Li, Fengping
    Chen, Leiqing
    Zhu, Dehua
    Feng, Guang
    Zhang, Kunpeng
    Han, Bing
    POLYMERS, 2024, 16 (16)
  • [7] FLEXIBLE, ROBOT GUIDED SINGLE-PASS HONING
    Uhlmann, Eckart
    Zimmermann, Sascha
    PROCEEDINGS OF THE ASME 11TH INTERNATIONAL MANUFACTURING SCIENCE AND ENGINEERING CONFERENCE, 2016, VOL 1, 2016,
  • [8] PREDICTION OF ULTRAFILTRATION FLUXES IN SINGLE-PASS SYSTEMS
    LOPEZLEIVA, MH
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 1979, (SEP): : 80 - 80
  • [9] Single-pass and multi-pass laser cutting of Si-SiC:: Assessment of the cut quality and microstructure in the heat affected zone
    Quintero, F.
    Pou, J.
    Lusquinos, F.
    Riveiro, A.
    Perez-Amor, M.
    Fernandes, A. J. S.
    JOURNAL OF LASER APPLICATIONS, 2007, 19 (03) : 170 - 176
  • [10] Non-Autoregressive Sparse Transformer Networks for Pedestrian Trajectory Prediction
    Liu, Di
    Li, Qiang
    Li, Sen
    Kong, Jun
    Qi, Miao
    APPLIED SCIENCES-BASEL, 2023, 13 (05):