RoRePo: Detecting the role information and relative position information for contexts in multi-turn dialogue generation

被引：1

作者：

Gan, Zibang ^{[1
]}

Zeng, Biqing ^{[1
]}

Cheng, Lianglun ^{[3
]}

Liu, Shuai ^{[1
]}

Yang, Heng ^{[2
]}

Xu, Mayi ^{[1
]}

Ding, Meirong ^{[1
]}

机构：

[1] South China Normal Univ, Sch Software, Nanhai Software Technol Pk, Foshan, Guangdong, Peoples R China

[2] South China Normal Univ, Sch Comp, Guangzhou, Guangdong, Peoples R China

[3] Guangdong Univ Technol, Guangdong Prov Key Lab Cyber Phys Syst, Guangzhou, Guangdong, Peoples R China

来源：

JOURNAL OF INTELLIGENT & FUZZY SYSTEMS | 2021年 / 40卷 / 05期

基金：

中国国家自然科学基金;

关键词：

Dialogue system; natural language generation; multi-turn dialogue; deep learning;

D O I：

10.3233/JIFS-202641

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In multi-turn dialogue generation, dialogue contexts have been shown to have an important influence on the reasoning of the next round of dialogue. A multi-turn dialogue between two people should be able to give a reasonable response according to the relevant context. However, the widely used hierarchical recurrent encoder-decoder model and the latest model that detecting the relevant contexts with self-attention are facing the same problem. Their given response doesn't match the identity of the current speaker, which we call it role ambiguity. In this paper, we propose a new model, named RoRePo, to tackle this problem by detecting the role information and relative position information. Firstly, as a part of the decoder input, we add a role embedding to identity different speakers. Secondly, we incorporate self-attention mechanism with relative position representation to dialogue context understanding. Besides, the design of our model architecture considers the influence of latent variables in generating more diverse responses. Experimental results of our evaluations on the DailyDialog and DSTC7 AVSD datasets show that our proposed model advances in multi-turn dialogue generation.

引用

页码：10003 / 10015

页数：13

共 22 条

[1] Audio Visual Scene-Aware Dialog [J].

Alamri, Huda ;

Cartillier, Vincent ;

Das, Abhishek ;

Wang, Jue ;

Cherian, Anoop ;

Essa, Irfan ;

Batra, Dhruv ;

Marks, Tim K. ;

Hori, Chiori ;

Anderson, Peter ;

Lee, Stefan ;

Parikh, Devi .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :7550-7559

[2]

[Anonymous], 2014, NeurIPS6

[3]

[Anonymous], 2014, P INT C LEARN REPR

[4]

[Anonymous], 2018, NAACL

[5]

[Anonymous], 2017, 31 AAAI C ART INT

[6]

[Anonymous], 2017, IJCNLP

[7] LEARNING LONG-TERM DEPENDENCIES WITH GRADIENT DESCENT IS DIFFICULT [J].

BENGIO, Y ;

SIMARD, P ;

FRASCONI, P .

IEEE TRANSACTIONS ON NEURAL NETWORKS, 1994, 5 (02) :157-166

[8]

Cho K., 2014, P 2014 C EMP METH NA, P1724, DOI 10.3115/v1/d14-1179

[9]

Devlin J., 2018, ARXIV

[10] A Knowledge-Enhanced Pretraining Model for Commonsense Story Generation [J].

Guan, Jian ;

Huang, Fei ;

Zhao, Zhihao ;

Zhu, Xiaoyan ;

Huang, Minlie .

TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2020, 8 :93-108

← 1 2 3 →