Document-Level Machine Translation with Large Language Models

被引:0
|
作者
Wang, Longyue [1 ]
Lyu, Chenyang [2 ]
Ji, Tianbo [3 ]
Zhang, Zhirui [1 ]
Yu, Dian [1 ]
Shi, Shuming [1 ]
Tu, Zhaopeng [1 ]
机构
[1] Tencent AI Lab, Shenzhen, Guangdong, Peoples R China
[2] MBZUAI, Abu Dhabi, U Arab Emirates
[3] Dublin City Univ, Dublin, Ireland
来源
2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2023) | 2023年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Large language models (LLMs) such as ChatGPT can produce coherent, cohesive, relevant, and fluent answers for various natural language processing (NLP) tasks. Taking documentlevel machine translation (MT) as a testbed, this paper provides an in-depth evaluation of LLMs' ability on discourse modeling. The study focuses on three aspects: 1) Effects of Context-Aware Prompts, where we investigate the impact of different prompts on document-level translation quality and discourse phenomena; 2) Comparison of Translation Models, where we compare the translation performance of ChatGPT with commercial MT systems and advanced document-level MT methods; 3) Analysis of Discourse Modelling Abilities, where we further probe discourse knowledge encoded in LLMs and shed light on impacts of training techniques on discourse modeling. By evaluating on a number of benchmarks, we surprisingly find that LLMs have demonstrated superior performance and show potential to become a new paradigm for document-level translation: 1) leveraging their powerful long-text modeling capabilities, GPT-3.5 and GPT-4 outperform commercial MT systems in terms of human evaluation;(1) 2) GPT-4 demonstrates a stronger ability for probing linguistic knowledge than GPT-3.5. This work highlights the challenges and opportunities of LLMs for MT, which we hope can inspire the future design and evaluation of LLMs.(2)
引用
收藏
页码:16646 / 16661
页数:16
相关论文
共 50 条
  • [1] AutoRE: Document-Level Relation Extraction with Large Language Models
    Xue, Lilong
    Zhang, Dan
    Dong, Yuxiao
    Tang, Jie
    PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 3: SYSTEM DEMONSTRATIONS, 2024, : 211 - 220
  • [2] Document-Level Adaptation for Neural Machine Translation
    Kothur, Sachith Sri Ram
    Knowles, Rebecca
    Koehn, Philipp
    NEURAL MACHINE TRANSLATION AND GENERATION, 2018, : 64 - 73
  • [3] Rethinking Document-level Neural Machine Translation
    Sun, Zewei
    Wang, Mingxuan
    Zhou, Hao
    Zhao, Chengqi
    Huang, Shujian
    Chen, Jiajun
    Li, Lei
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), 2022, : 3537 - 3548
  • [4] TANDO: A Corpus for Document-level Machine Translation
    Gete, Harritxu
    Etchegoyhen, Thierry
    Ponce, David
    Labaka, Gorka
    Aranberri, Nora
    Corral, Ander
    Saralegi, Xabier
    Santos, Igor Ellakuria
    Martin, Maite
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 3026 - 3037
  • [5] Corpora for Document-Level Neural Machine Translation
    Liu, Siyou
    Zhang, Xiaojun
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 3775 - 3781
  • [6] Large language models effectively leverage document-level context for literary translation, but critical errors persist
    Karpinska, Marzena
    Iyyer, Mohit
    arXiv, 2023,
  • [7] Document-Level Machine Translation as a Re-translation Process
    Martinez Garcia, Eva
    Espana-Bonet, Cristina
    Marquez, Lluis
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2014, (53): : 103 - 110
  • [8] On Search Strategies for Document-Level Neural Machine Translation
    Herold, Christian
    Ney, Hermann
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023), 2023, : 12827 - 12836
  • [9] Exploring Discourse Structure in Document-level Machine Translation
    Hu, Xinyu
    Wan, Xiaojun
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2023), 2023, : 13889 - 13902
  • [10] Scaling Law for Document-Level Neural Machine Translation
    Zhang, Zhuocheng
    Gu, Shuhao
    Zhang, Min
    Feng, Yang
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EMNLP 2023), 2023, : 8290 - 8303