Language Model Pre-training Method in Machine Translation Based on Named Entity Recognition

被引:8
作者
Li, Zhen [1 ]
Qu, Dan [1 ]
Xie, Chaojie [2 ]
Zhang, Wenlin [1 ]
Li, Yanxia [3 ]
机构
[1] PLA Strateg Support Force Informat Engn Univ, Informat Syst Engn Coll, 93 Hightech Zone, Zhengzhou 450000, Peoples R China
[2] Zhengzhou Xinda Inst Adv Technol, 93 Hightech Zone, Zhengzhou 450000, Peoples R China
[3] PLA Strateg Support Force Informat Engn Univ, Foreign Languages Coll, 93 Hightech Zone, Zhengzhou 450000, Peoples R China
关键词
Unsupervised machine translation; language model; named entity recognition;
D O I
10.1142/S0218213020400217
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Neural Machine Translation (NMT) model has become the mainstream technology in machine translation. The supervised neural machine translation model trains with abundant of sentence-level parallel corpora. But for low-resources language or dialect with no such corpus available, it is difficult to achieve good performance. Researchers began to focus on unsupervised neural machine translation (UNMT) that monolingual corpus as training data. UNMT need to construct the language model (LM) which learns semantic information from the monolingual corpus. This paper focuses on the pre-training of LM in unsupervised machine translation and proposes a pre-training method, NER-MLM (named entity recognition masked language model). Through performing NER, the proposed method can obtain better semantic information and language model parameters with better training results. In the unsupervised machine translation task, the BLEU scores on the WMT'16 English-French, English-German, data sets are 35.30, 27.30 respectively. To the best of our knowledge, this is the highest results in the field of UNMT reported so far.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] Pattern based bootstrapping method for named entity recognition
    Ekbal, Asif
    Bandyopadhyay, Sivaji
    PROCEEDINGS OF THE SIXTH INTERNATIONAL CONFERENCE ON ADVANCES IN PATTERN RECOGNITION, 2007, : 349 - +
  • [22] Named Entity Recognition Based on Reinforcement Learning and Adversarial Training
    Peng, Shi
    Zhang, Yong
    Yu, Yuanfang
    Zuo, Haoyang
    Zhang, Kai
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT I, 2021, 12815 : 191 - 202
  • [23] Human-Machine Collaboration Based Named Entity Recognition
    Ren, Zhuoli
    Yu, Zhiwen
    Wang, Hui
    Wang, Liang
    Liu, Jiaqi
    COMPUTER SUPPORTED COOPERATIVE WORK AND SOCIAL COMPUTING, CHINESECSCW 2021, PT I, 2022, 1491 : 342 - 355
  • [24] Improved Named Entity Recognition using Machine Translation-based Cross-lingual Information
    Dandapat, Sandipan
    Way, Andy
    COMPUTACION Y SISTEMAS, 2016, 20 (03): : 495 - 504
  • [25] FlauBERT: Unsupervised Language Model Pre-training for French
    Le, Hang
    Vial, Loic
    Frej, Jibril
    Segonne, Vincent
    Coavoux, Maximin
    Lecouteux, Benjamin
    Allauzen, Alexandre
    Crabbe, Benoit
    Besacier, Laurent
    Schwab, Didier
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 2479 - 2490
  • [26] A Named Entity Recognition Model Based on Entity Trigger Reinforcement Learning
    Wang, Ping
    Si, Nong
    Tong, Haopeng
    2022 IEEE 2ND INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND ARTIFICIAL INTELLIGENCE (CCAI 2022), 2022, : 43 - 48
  • [27] Named Entity Recognition Model Based on Feature Fusion
    Sun, Zhen
    Li, Xinfu
    INFORMATION, 2023, 14 (02)
  • [28] Development of a Language Model for Named-Entity-Recognition in Aerospace Requirements
    Ray, Archana Tikayat
    Fischer, Olivia J. Pinon
    White, Ryan T.
    Cole, Bjorn F.
    Mavris, Dimitri N.
    JOURNAL OF AEROSPACE INFORMATION SYSTEMS, 2024, 21 (06): : 489 - 499
  • [29] Named entity recognition for Hindi language : A survey
    Sharma, Richa
    Morwal, Sudha
    Agarwal, Basant
    JOURNAL OF DISCRETE MATHEMATICAL SCIENCES & CRYPTOGRAPHY, 2019, 22 (04) : 569 - 580
  • [30] CALM: Context Augmentation with Large Language Model for Named Entity Recognition
    Luiggi, Tristan
    Herserant, Tanguy
    Trani, Thong
    Soulier, Laure
    Guigue, Vincent
    LINKING THEORY AND PRACTICE OF DIGITAL LIBRARIES, PT I, TPDL 2024, 2024, 15177 : 273 - 291