RoRePo: Detecting the role information and relative position information for contexts in multi-turn dialogue generation

被引：1

作者：

Gan, Zibang ^{[1
]}

Zeng, Biqing ^{[1
]}

Cheng, Lianglun ^{[3
]}

Liu, Shuai ^{[1
]}

Yang, Heng ^{[2
]}

Xu, Mayi ^{[1
]}

Ding, Meirong ^{[1
]}

机构：

[1] South China Normal Univ, Sch Software, Nanhai Software Technol Pk, Foshan, Guangdong, Peoples R China

[2] South China Normal Univ, Sch Comp, Guangzhou, Guangdong, Peoples R China

[3] Guangdong Univ Technol, Guangdong Prov Key Lab Cyber Phys Syst, Guangzhou, Guangdong, Peoples R China

来源：

JOURNAL OF INTELLIGENT & FUZZY SYSTEMS | 2021年 / 40卷 / 05期

基金：

中国国家自然科学基金;

关键词：

Dialogue system; natural language generation; multi-turn dialogue; deep learning;

D O I：

10.3233/JIFS-202641

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In multi-turn dialogue generation, dialogue contexts have been shown to have an important influence on the reasoning of the next round of dialogue. A multi-turn dialogue between two people should be able to give a reasonable response according to the relevant context. However, the widely used hierarchical recurrent encoder-decoder model and the latest model that detecting the relevant contexts with self-attention are facing the same problem. Their given response doesn't match the identity of the current speaker, which we call it role ambiguity. In this paper, we propose a new model, named RoRePo, to tackle this problem by detecting the role information and relative position information. Firstly, as a part of the decoder input, we add a role embedding to identity different speakers. Secondly, we incorporate self-attention mechanism with relative position representation to dialogue context understanding. Besides, the design of our model architecture considers the influence of latent variables in generating more diverse responses. Experimental results of our evaluations on the DailyDialog and DSTC7 AVSD datasets show that our proposed model advances in multi-turn dialogue generation.

引用

页码：10003 / 10015

页数：13

共 22 条

[1] Audio Visual Scene-Aware Dialog
Alamri, Huda
Cartillier, Vincent
Das, Abhishek
Wang, Jue
Cherian, Anoop
Essa, Irfan
Batra, Dhruv
Marks, Tim K.
Hori, Chiori
Anderson, Peter
Lee, Stefan
Parikh, Devi
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 7550 - 7559
[2] [Anonymous], 2014, Advances in neural information processing systems
[3] [Anonymous], 2014, P INT C LEARN REPR
[4] [Anonymous], 2015, P INT C LEARN REPR
[5] [Anonymous], 2018, NAACL
[6] [Anonymous], 2017, 31 AAAI C ART INT
[7] [Anonymous], 2017, IJCNLP
[8] LEARNING LONG-TERM DEPENDENCIES WITH GRADIENT DESCENT IS DIFFICULT
BENGIO, Y
SIMARD, P
FRASCONI, P
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 1994, 5 (02): : 157 - 166
[9] Cho K., 2014, P C EMP METH NAT LAN, P1724, DOI DOI 10.3115/V1/D14-1179
[10] Devlin J, 2018, ARXIV

← 1 2 3 →