ChatMDG: A discourse parsing graph fusion based approach for multi-party dialogue generation

被引：4

作者：

Li, Jingyang ^{[1
]}

Song, Shengli ^{[1
]}

Li, Yixin ^{[1
]}

Zhang, Hanxiao ^{[1
]}

Hu, Guangneng ^{[1
]}

机构：

[1] Xidian Univ, Sch Comp Sci & Technol, Xian 710126, Shaanxi, Peoples R China

来源：

INFORMATION FUSION | 2024年 / 110卷

基金：

中国国家自然科学基金;

关键词：

Multi-party dialogue; Dialogue generation; Discourse parsing; Semantic-enriched graph; Large language models; NETWORK;

D O I：

10.1016/j.inffus.2024.102469

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Comprehending multi -party dialogue generation poses a challenge due to intricate speaker interactions, where multiple participants engage in a dynamic exchange of questions and responses, assuming diverse roles such as speaker, receiver, and observer, with these roles evolving across conversational turns. Most existing research on multi -party dialogue generation only considers semantic information contained in each sentence and does not take into account the dialogue flow information implicit in multi -role interaction, leading to difficulties in accurately understanding the dialogue state in multi -party dialogue. To fill these gaps, we introduce an information fusion based approach for M ulti -party D ialogue G eneration named ChatMDG , which integrates role interaction into a semantic-enriched graph with context -based embeddings to cooperatively capture both global and local information in multi -party dialogue. Specifically, we proposes a graph-based network to represent the complex role-interaction dialogue structure for discourse parsing and then designs the dialogue flow encoding method to fuse role-interaction information with semantic states effectively. Furthermore, ChatMDG presents interaction strategies to correspondingly generate reactive and proactive utterances based on the fused embeddings, which lead to more dialogue coherence and user engagement. Experimental results show that ChatMDG significantly improves the accuracy of the multi -party response generation task, especially in complex scenarios with multiple interactions.

引用

页数：10

共 47 条

[1]

Addlesee A, 2024, 2024 ACM IEEE INT C, P123

[2]

Addlesee A, 2024, PROCEEDINGS OF THE 18TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: SYSTEM DEMONSTRATIONS, P62

[3]

Addlesee Angus, 2024, COMPANION 2024 ACMIE, P1273

[4]

[Anonymous], 2016, P 2016 C EMP METH NA, DOI [DOI 10.18653/V1/D16-1231, 10.18653/v1/D16-1231]

[5]

Chen JA, 2021, 2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), P1380

[6]

Chen Nuo, 2024, Compress to impress: Unleashing the potential of compressive memory in real-world long-term conversations

[7]

Chen Q, 2023, FINDINGS ASS COMPUTA

[8]

Chernyavskiy Alexander, 2024, P 5 WORKSH COMP APPR, P149

[9]

Chi TC, 2022, 23RD ANNUAL MEETING OF THE SPECIAL INTEREST GROUP ON DISCOURSE AND DIALOGUE, SIGDIAL 2022, P325

[10]

Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171

← 1 2 3 4 5 →