An emotion-sensitive dialogue policy for task-oriented dialogue system

被引：1

作者：

Zhu, Hui ^{[1
]}

Wang, Xv ^{[2
]}

Wang, Zhenyu ^{[2
]}

Xv, Kai ^{[2
]}

机构：

[1] Guangdong Mech & Elect Polytech, Guangzhou 510545, Peoples R China

[2] South China Univ Technol, Sch Software Engn, Guangzhou 510641, Peoples R China

来源：

SCIENTIFIC REPORTS | 2024年 / 14卷 / 01期

关键词：

FACTORED POMDP MODEL; FRAMEWORK;

D O I：

10.1038/s41598-024-70463-x

中图分类号：

O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

Reinforcement learning (RL) is an effective method in training dialogue policies to steer the conversation towards successful task completion. However, most RL-based methods only rely on semantic inputs that lack empathy as they ignore the user emotional information. Moreover, these methods suffer from delayed rewards caused by the user simulator returning valuable results only at dialogue end. Recently, some methods have been proposed to learn the reward function together with user emotions, but they omit considering user emotion in each dialogue turn. In this paper, we proposed an emotion-sensitive dialogue policy model (ESDP), it incorporates user emotions information into dialogue policy and selects the optimal action by the combination of top-k actions with the user emotions. The user emotion information in each turn is used as an immediate reward for the current dialogue state to solve sparse rewards and the dependency on termination. Extensive experiments validate that our method outperforms the baseline approaches when combined with different Q-Learning algorithms, and also surpasses other popular existing dialog policies' performance.

引用

页数：12

共 38 条

[11]

Li XJ, 2018, Arxiv, DOI arXiv:1703.01008

[12]

Li ZM, 2020, Arxiv, DOI arXiv:2004.03267

[13]

Lipton Z, 2018, AAAI CONF ARTIF INTE, P5237

[14] A survey on empathetic dialogue systems [J].

Ma, Yukun ;

Nguyen, Khanh Linh ;

Xing, Frank Z. ;

Cambria, Erik .

INFORMATION FUSION, 2020, 64 (50-70) :50-70

[15] Recent advances in deep learning based dialogue systems: a systematic survey [J].

Ni, Jinjie ;

Young, Tom ;

Pandelea, Vlad ;

Xue, Fuzhao ;

Cambria, Erik .

ARTIFICIAL INTELLIGENCE REVIEW, 2023, 56 (04) :3055-3155

[16]

Papangelis A, 2019, Arxiv, DOI arXiv:1907.05507

[17]

Peng BL, 2018, Arxiv, DOI arXiv:1801.06176

[18] A novel factored POMDP model for affective dialogue management [J].

Ren, Fuji ;

Wang, Yu ;

Quan, Changqin .

JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2016, 31 (01) :127-136

[19] TFSM-based dialogue management model framework for affective dialogue systems [J].

Ren, Fuji ;

Wang, Yu ;

Quan, Changqin .

IEEJ TRANSACTIONS ON ELECTRICAL AND ELECTRONIC ENGINEERING, 2015, 10 (04) :404-410

[20]

Saha T., 2022, 2022 INT JOINT C NEU, P1

← 1 2 3 4 →