RoRePo: Detecting the role information and relative position information for contexts in multi-turn dialogue generation

被引:1
作者
Gan, Zibang [1 ]
Zeng, Biqing [1 ]
Cheng, Lianglun [3 ]
Liu, Shuai [1 ]
Yang, Heng [2 ]
Xu, Mayi [1 ]
Ding, Meirong [1 ]
机构
[1] South China Normal Univ, Sch Software, Nanhai Software Technol Pk, Foshan, Guangdong, Peoples R China
[2] South China Normal Univ, Sch Comp, Guangzhou, Guangdong, Peoples R China
[3] Guangdong Univ Technol, Guangdong Prov Key Lab Cyber Phys Syst, Guangzhou, Guangdong, Peoples R China
基金
中国国家自然科学基金;
关键词
Dialogue system; natural language generation; multi-turn dialogue; deep learning;
D O I
10.3233/JIFS-202641
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In multi-turn dialogue generation, dialogue contexts have been shown to have an important influence on the reasoning of the next round of dialogue. A multi-turn dialogue between two people should be able to give a reasonable response according to the relevant context. However, the widely used hierarchical recurrent encoder-decoder model and the latest model that detecting the relevant contexts with self-attention are facing the same problem. Their given response doesn't match the identity of the current speaker, which we call it role ambiguity. In this paper, we propose a new model, named RoRePo, to tackle this problem by detecting the role information and relative position information. Firstly, as a part of the decoder input, we add a role embedding to identity different speakers. Secondly, we incorporate self-attention mechanism with relative position representation to dialogue context understanding. Besides, the design of our model architecture considers the influence of latent variables in generating more diverse responses. Experimental results of our evaluations on the DailyDialog and DSTC7 AVSD datasets show that our proposed model advances in multi-turn dialogue generation.
引用
收藏
页码:10003 / 10015
页数:13
相关论文
共 22 条
  • [1] Audio Visual Scene-Aware Dialog
    Alamri, Huda
    Cartillier, Vincent
    Das, Abhishek
    Wang, Jue
    Cherian, Anoop
    Essa, Irfan
    Batra, Dhruv
    Marks, Tim K.
    Hori, Chiori
    Anderson, Peter
    Lee, Stefan
    Parikh, Devi
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 7550 - 7559
  • [2] [Anonymous], 2014, Advances in neural information processing systems
  • [3] [Anonymous], 2014, P INT C LEARN REPR
  • [4] [Anonymous], 2015, P INT C LEARN REPR
  • [5] [Anonymous], 2018, NAACL
  • [6] [Anonymous], 2017, 31 AAAI C ART INT
  • [7] [Anonymous], 2017, IJCNLP
  • [8] LEARNING LONG-TERM DEPENDENCIES WITH GRADIENT DESCENT IS DIFFICULT
    BENGIO, Y
    SIMARD, P
    FRASCONI, P
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 1994, 5 (02): : 157 - 166
  • [9] Cho K., 2014, P C EMP METH NAT LAN, P1724, DOI DOI 10.3115/V1/D14-1179
  • [10] Devlin J, 2018, ARXIV