Uncertainty-Aware Semantic Augmentation for Neural Machine Translation

被引:0
作者
Wei, Xiangpeng [1 ,2 ]
Yu, Heng [3 ]
Hu, Yue [1 ,2 ]
Weng, Rongxiang [3 ]
Xing, Luxi [1 ,2 ]
Luo, Weihua [3 ]
机构
[1] Chinese Acad Sci, Inst Informat Engn, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Sch Cyber Secur, Beijing, Peoples R China
[3] Alibaba Grp, Machine Intelligence Technol Lab, Hangzhou, Peoples R China
来源
PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP) | 2020年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As a sequence-to-sequence generation task, neural machine translation (NMT) naturally contains intrinsic uncertainty, where a single sentence in one language has multiple valid counterparts in the other. However, the dominant methods for NMT only observe one of them from the parallel corpora for the model training but have to deal with adequate variations under the same meaning at inference. This leads to a discrepancy of the data distribution between the training and the inference phases. To address this problem, we propose uncertainty-aware semantic augmentation, which explicitly captures the universal semantic information among multiple semantically-equivalent source sentences and enhances the hidden representations with this information for better translations. Extensive experiments on various translation tasks reveal that our approach significantly outperforms the strong baselines and the existing methods.
引用
收藏
页码:2724 / 2735
页数:12
相关论文
共 50 条
[21]   NPCL: Neural Processes for Uncertainty-Aware Continual Learning [J].
Jha, Saurav ;
Gong, Dong ;
Zhao, He ;
Yao, Lina .
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[22]   An Optimized Uncertainty-Aware Training Framework for Neural Networks [J].
Tabarisaadi, Pegah ;
Khosravi, Abbas ;
Nahavandi, Saeid ;
Shafie-Khah, Miadreza ;
Catalao, Joao P. S. .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (05) :6928-6935
[23]   Lexical-Constraint-Aware Neural Machine Translation via Data Augmentation [J].
Chen, Guanhua ;
Chen, Yun ;
Wang, Yong ;
Li, Victor O. K. .
PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, :3587-3593
[24]   Semantic-Aware Deep Neural Attention Network for Machine Translation Detection [J].
Shi, Yangbin ;
Lu, Jun ;
Gu, Shuqin ;
Wang, Qiang ;
Zheng, Xiaolin .
MACHINE TRANSLATION, CCMT 2021, 2021, 1464 :63-76
[25]   Uncertainty-Aware Point-Cloud Semantic Segmentation for Unstructured Roads [J].
Liu, Pengfei ;
Yu, Guizhen ;
Wang, Zhangyu ;
Zhou, Bin ;
Ming, Ruotong ;
Jin, Chunhua .
IEEE SENSORS JOURNAL, 2023, 23 (13) :15071-15080
[26]   Uncertainty-aware Pseudo Label Refinery for Domain Adaptive Semantic Segmentation [J].
Wang, Yuxi ;
Peng, Junran ;
Zhang, Zhaoxiang .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :9072-9081
[27]   Uncertainty-aware consistency regularization for cross-domain semantic segmentation [J].
Zhou, Qianyu ;
Feng, Zhengyang ;
Gu, Qiqi ;
Cheng, Guangliang ;
Lu, Xuequan ;
Shi, Jianping ;
Ma, Lizhuang .
Computer Vision and Image Understanding, 2022, 221
[28]   Uncertainty-Aware Source-Free Domain Adaptive Semantic Segmentation [J].
Lu, Zhihe ;
Li, Da ;
Song, Yi-Zhe ;
Xiang, Tao ;
Hospedales, Timothy M. M. .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 :4664-4676
[29]   Uncertainty-aware consistency regularization for cross-domain semantic segmentation [J].
Zhou, Qianyu ;
Feng, Zhengyang ;
Gu, Qiqi ;
Cheng, Guangliang ;
Lu, Xuequan ;
Shi, Jianping ;
Ma, Lizhuang .
COMPUTER VISION AND IMAGE UNDERSTANDING, 2022, 221
[30]   Modeling unknowns: A vision for uncertainty-aware machine learning in healthcare [J].
Campagner, Andrea ;
Biganzoli, Elia Mario ;
Balsano, Clara ;
Cereda, Cristina ;
Cabitza, Federico .
International Journal of Medical Informatics, 2025, 203