Neural Paraphrase Generation with Multi-domain Corpus

被引:0
|
作者
Qiao, Lin [1 ]
Li, Yida [1 ]
Zhong, ChenLi [1 ]
机构
[1] Peking Univ, Beijing, Peoples R China
关键词
Paraphrase generation; Adversarial training;
D O I
10.1007/978-3-030-86362-3_5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automatic paraphrase generation is an important task for natural language processing. However, progress in paraphrase generation has been hindered for a long time by the lack of large monolingual parallel corpora. We can alleviate the data shortage by effectively using multidomain corpus. In this paper, we propose a novel model to exploit information from other source domains (out-of-domains) which benefits our target domain (in-domain). In our method, we maintain a private encoder and a private decoder for each domain which are used to model domain-specific information. In the meantime, we introduce a shared encoder and a shared decoder shared by all domains which only contain domain-independent information. Besides, we add a domain discriminator to the shared encoder to reinforce the ability to capture common features of shared encoder by adversarial training. Experimental results show that our method not only perform well in traditional domain adaptation tasks but also improve performance in all domains together. Moreover, we show that the shared layer learned by our proposed model can be regarded as an off-the-shelf layer and can be easily adapted to new domains.
引用
收藏
页码:54 / 66
页数:13
相关论文
共 50 条
  • [1] Multi-Domain Neural Network Recommender
    Yi, Baolin
    Zhao, Shuting
    Shen, Xiaoxuan
    Zhang, Li
    2018 IEEE INTERNATIONAL CONFERENCE ON ELECTRONICS AND COMMUNICATION ENGINEERING (ICECE 2018), 2018, : 41 - 45
  • [2] A Multi-domain Corpus of Swedish Word Sense Annotation
    Johansson, Richard
    Adesam, Yvonne
    Bouma, Gerlof
    Hedberg, Karin
    LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2016, : 3019 - 3022
  • [3] Multi-domain Neural Network Language Model
    Alumae, Tanel
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 2181 - 2185
  • [4] Decomposable Neural Paraphrase Generation
    Li, Zichao
    Jiang, Xin
    Shang, Lifeng
    Liu, Qun
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 3403 - 3414
  • [5] DecOp: A Multilingual and Multi-domain Corpus For Detecting Deception In Typed Text
    Capuozzo, Pasquale
    Lauriola, Ivano
    Strapparava, Carlo
    Aiolli, Fabio
    Sartori, Giuseppe
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 1423 - 1430
  • [6] Building a multi-domain comparable corpus using a learning to rank method
    Rahimi, Razieh
    Shakery, Azadeh
    Dadashkarimi, Javid
    Ariannezhad, Mozhdeh
    Dehghani, Mostafa
    Esfahani, Hossein Nasr
    NATURAL LANGUAGE ENGINEERING, 2016, 22 (04) : 627 - 653
  • [7] An Experimental Evaluation of Algorithms for Opinion Mining in Multi-domain Corpus in Albanian
    Kote, Nelda
    Biba, Marenglen
    Trandafili, Evis
    FOUNDATIONS OF INTELLIGENT SYSTEMS (ISMIS 2018), 2018, 11177 : 439 - 447
  • [8] Efficient parametrization of multi-domain deep neural networks
    Rebuffi, Sylvestre-Alvise
    Bilen, Hakan
    Vedaldi, Andrea
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 8119 - 8127
  • [9] Factorized Transformer for Multi-Domain Neural Machine Translation
    Deng, Yongchao
    Yu, Hongfei
    Yu, Heng
    Duan, Xiangyu
    Luo, Weihua
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020, : 4221 - 4230
  • [10] Domain Adaptation and Multi-Domain Adaptation for Neural Machine Translation: A Survey
    Saunders D.
    Journal of Artificial Intelligence Research, 2022, 75 : 351 - 424