Uncertainty-Aware Semantic Augmentation for Neural Machine Translation

被引:0
作者
Wei, Xiangpeng [1 ,2 ]
Yu, Heng [3 ]
Hu, Yue [1 ,2 ]
Weng, Rongxiang [3 ]
Xing, Luxi [1 ,2 ]
Luo, Weihua [3 ]
机构
[1] Chinese Acad Sci, Inst Informat Engn, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Sch Cyber Secur, Beijing, Peoples R China
[3] Alibaba Grp, Machine Intelligence Technol Lab, Hangzhou, Peoples R China
来源
PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP) | 2020年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As a sequence-to-sequence generation task, neural machine translation (NMT) naturally contains intrinsic uncertainty, where a single sentence in one language has multiple valid counterparts in the other. However, the dominant methods for NMT only observe one of them from the parallel corpora for the model training but have to deal with adequate variations under the same meaning at inference. This leads to a discrepancy of the data distribution between the training and the inference phases. To address this problem, we propose uncertainty-aware semantic augmentation, which explicitly captures the universal semantic information among multiple semantically-equivalent source sentences and enhances the hidden representations with this information for better translations. Extensive experiments on various translation tasks reveal that our approach significantly outperforms the strong baselines and the existing methods.
引用
收藏
页码:2724 / 2735
页数:12
相关论文
共 50 条
  • [21] Lexical-Constraint-Aware Neural Machine Translation via Data Augmentation
    Chen, Guanhua
    Chen, Yun
    Wang, Yong
    Li, Victor O. K.
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 3587 - 3593
  • [22] Semantic-Aware Deep Neural Attention Network for Machine Translation Detection
    Shi, Yangbin
    Lu, Jun
    Gu, Shuqin
    Wang, Qiang
    Zheng, Xiaolin
    MACHINE TRANSLATION, CCMT 2021, 2021, 1464 : 63 - 76
  • [23] Uncertainty-Aware Point-Cloud Semantic Segmentation for Unstructured Roads
    Liu, Pengfei
    Yu, Guizhen
    Wang, Zhangyu
    Zhou, Bin
    Ming, Ruotong
    Jin, Chunhua
    IEEE SENSORS JOURNAL, 2023, 23 (13) : 15071 - 15080
  • [24] Uncertainty-aware consistency regularization for cross-domain semantic segmentation
    Zhou, Qianyu
    Feng, Zhengyang
    Gu, Qiqi
    Cheng, Guangliang
    Lu, Xuequan
    Shi, Jianping
    Ma, Lizhuang
    Computer Vision and Image Understanding, 2022, 221
  • [25] Uncertainty-aware Pseudo Label Refinery for Domain Adaptive Semantic Segmentation
    Wang, Yuxi
    Peng, Junran
    Zhang, Zhaoxiang
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 9072 - 9081
  • [26] Uncertainty-Aware Source-Free Domain Adaptive Semantic Segmentation
    Lu, Zhihe
    Li, Da
    Song, Yi-Zhe
    Xiang, Tao
    Hospedales, Timothy M. M.
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 4664 - 4676
  • [27] Uncertainty-aware consistency regularization for cross-domain semantic segmentation
    Zhou, Qianyu
    Feng, Zhengyang
    Gu, Qiqi
    Cheng, Guangliang
    Lu, Xuequan
    Shi, Jianping
    Ma, Lizhuang
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2022, 221
  • [28] Uncertainty-aware estimation of population abundance using machine learning
    Bastiaan J. Boom
    Emma Beauxis-Aussalet
    Lynda Hardman
    Robert B. Fisher
    Multimedia Systems, 2016, 22 : 737 - 749
  • [29] Uncertainty-aware estimation of population abundance using machine learning
    Boom, Bastiaan J.
    Beauxis-Aussalet, Emma
    Hardman, Lynda
    Fisher, Robert B.
    MULTIMEDIA SYSTEMS, 2016, 22 (06) : 737 - 749
  • [30] Importance-Aware Data Augmentation for Document-Level Neural Machine Translation
    Wu, Minghao
    Wang, Yufei
    Foster, George
    Qiu, Lizhen
    Haffari, Gholamreza
    PROCEEDINGS OF THE 18TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, : 740 - 752