Improving semi-autoregressive machine translation with the guidance of syntactic dependency parsing structure

被引:0
|
作者
Chen, Xinran [1 ]
Duan, Sufeng [1 ]
Liu, Gongshen [1 ]
机构
[1] Shanghai Jiao Tong Univ, Sch Elect Informat & Elect Engn, Shanghai, Peoples R China
基金
中国国家自然科学基金;
关键词
Non-autoregressive; Machine translation; Syntactic dependency parsing;
D O I
10.1016/j.neucom.2024.128828
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The advent of non-autoregressive machine translation (NAT) accelerates the decoding superior to autoregressive machine translation (AT) significantly, while bringing about a performance decrease. Semi-autoregressive neural machine translation (SAT), as a compromise, enjoys the merits of both autoregressive and nonautoregressive decoding. However, current SAT methods face the challenges of information-limited initialization and rigorous termination. This paper develops a layer-and-length-based syntactic labeling method and introduces a syntactic dependency parsing structure-guided two-stage semi-autoregressive translation (SDPSAT) structure, which addresses the above challenges with a syntax-based initialization and termination. Additionally, we also present a Mixed Training strategy to shrink exposure bias. Experiments on seven widelyused datasets reveal that our SDPSAT surpasses traditional SAT models with reduced word repetition and achieves competitive results with the AT baseline at a 2x similar to 3x speedup.
引用
收藏
页数:10
相关论文
共 37 条
  • [21] Enriching Non-Autoregressive Transformer with Syntactic and Semantic Structures for Neural Machine Translation
    Liu, Ye
    Wan, Yao
    Zhang, Jian-Guo
    Zhao, Wenting
    Yu, Philip S.
    16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2021), 2021, : 1235 - 1244
  • [22] Detecting Grammatical Errors in Machine Translation Output Using Dependency Parsing and Treebank Querying
    Tezcan, Arda
    Hoste, Veronique
    Macken, Lieve
    BALTIC JOURNAL OF MODERN COMPUTING, 2016, 4 (02): : 203 - 217
  • [23] Improving Non-autoregressive Neural Machine Translation with Monolingual Data
    Zhou, Jiawei
    Keung, Phillip
    58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 1893 - 1898
  • [24] Top-down Tree Structured Decoding with Syntactic Connections for Neural Machine Translation and Parsing
    Gu, Jetic
    Shavarani, Hassan S.
    Sarkar, Anoop
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 401 - 413
  • [25] Expected dependency pair match: predicting translation quality with expected syntactic structure
    Kahn, Jeremy G.
    Snover, Matthew
    Ostendorf, Mari
    MACHINE TRANSLATION, 2009, 23 (2-3) : 169 - 179
  • [26] Improving Non-autoregressive Machine Translation with Error Exposure and Consistency Regularization
    Chen, Xinran
    Duan, Sufeng
    Liu, Gongshen
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, PT III, NLPCC 2024, 2025, 15361 : 240 - 252
  • [27] Improving Neural Machine Translation by Transferring Knowledge from Syntactic Constituent Alignment Learning
    Su, Chao
    Huang, Heyan
    Shi, Shumin
    Jian, Ping
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2022, 21 (05)
  • [28] Japanese Argument Reordering Based on Dependency Structure for Statistical Machine Translation
    Goh, Chooi-Ling
    Watanabe, Taro
    Sumita, Eiichiro
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2012, E95D (06): : 1668 - 1675
  • [29] Improving Semantic Parsing with Enriched Synchronous Context-Free Grammars in Statistical Machine Translation
    Li, Junhui
    Zhu, Muhua
    Lu, Wei
    Zhou, Guodong
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2016, 16 (01)
  • [30] Improving tree-based neural machine translation with dynamic lexicalized dependency encoding
    Yang, Baosong
    Wong, Derek F.
    Chao, Lidia S.
    Zhang, Min
    KNOWLEDGE-BASED SYSTEMS, 2020, 188