Reinforcement-learning based dialogue system for human-robot interactions with socially-inspired rewards

被引：31

作者：

Ferreira, Emmanuel ^{[1
]}

Lefevre, Fabrice ^{[1
]}

机构：

[1] Univ Avignon, LIA CERI, Avignon, France

来源：

COMPUTER SPEECH AND LANGUAGE | 2015年 / 34卷 / 01期

关键词：

Human-robot interaction; POMDP-based dialogue management; Reinforcement learning; Reward shaping; FRAMEWORK;

D O I：

10.1016/j.csl.2015.03.007

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper investigates some conditions under which polarized user appraisals gathered throughout the course of a vocal interaction between a machine and a human can be integrated in a reinforcement learning-based dialogue manager. More specifically, we discuss how this information can be cast into socially-inspired rewards for speeding up the policy optimisation for both efficient task completion and user adaptation in an online learning setting. For this purpose a potential-based reward shaping method is combined with a sample efficient reinforcement learning algorithm to offer a principled framework to cope with these potentially noisy interim rewards. The proposed scheme will greatly facilitate the system's development by allowing the designer to teach his system through explicit positive/negative feedbacks given as hints about task progress, in the early stage of training. At a later stage, the approach will be used as a way to ease the adaptation of the dialogue policy to specific user profiles. Experiments carried out using a state-of-the-art goal-oriented dialogue management framework, the Hidden Information State (HIS), support our claims in two configurations: firstly, with a user simulator in the tourist information domain (and thus simulated appraisals), and secondly, in the context of man-robot dialogue with real user trials. (C) 2015 Elsevier Ltd. All rights reserved.

引用

页码：256 / 274

页数：19

共 50 条

[41] Argumentation-Based Dialogue Games for Shared Control in Human-Robot Systems
Sklar, Elizabeth I.
Azhar, M. Q.
JOURNAL OF HUMAN-ROBOT INTERACTION, 2015, 4 (03): : 120 - 148
[42] Human-Robot Interaction System with Quantum-Inspired Bidirectional Associative Memory
Masuyama, Naoki
Loo, Chu Kiong
Kubota, Naoyuki
2013 SECOND INTERNATIONAL CONFERENCE ON ROBOT, VISION AND SIGNAL PROCESSING (RVSP), 2013, : 66 - 71
[43] Asynchronous federated learning system for human-robot touch interaction
Gamboa-Montero, Juan Jose
Alonso-Martin, Fernando
Marques-Villarroya, Sara
Sequeira, Joao
Salichs, Miguel A.
EXPERT SYSTEMS WITH APPLICATIONS, 2023, 211
[44] Improving Workers' Musculoskeletal Health During Human-Robot Collaboration Through Reinforcement Learning
Xie, Ziyang
Lu, Lu
Wang, Hanwen
Su, Bingyi
Liu, Yunan
Xu, Xu
HUMAN FACTORS, 2024, 66 (06) : 1754 - 1769
[45] Task-oriented Dialogue System Based on Reinforcement Learning
Song, Meina
Chen, Zhongfu
Niu, Peiqing
Haihong, E.
PROCEEDINGS OF 2019 IEEE 10TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS 2019), 2019, : 93 - 98
[46] Safety-constrained Deep Reinforcement Learning control for human-robot collaboration in construction
Duan, Kangkang
Zou, Zhengbo
AUTOMATION IN CONSTRUCTION, 2025, 174
[47] A FRAMEWORK FOR HUMAN-ROBOT TEAMING PERFORMANCE PREDICTION: REINFORCEMENT LEARNING AND EYE MOVEMENT ANALYSIS
Galvani, Gustavo
Korivand, Soroush
Ajoudani, Arash
Gong, Jiaqi
Jalili, Nader
PROCEEDINGS OF ASME 2023 INTERNATIONAL MECHANICAL ENGINEERING CONGRESS AND EXPOSITION, IMECE2023, VOL 3, 2023,
[48] Adaptive Robot Behavior Based on Human Comfort Using Reinforcement Learning
Gonzalez-Santocildes, Asier
Vazquez, Juan-Ignacio
Eguiluz, Andoni
IEEE ACCESS, 2024, 12 : 122289 - 122299
[49] Human-to-Robot Handover Based on Reinforcement Learning
Kim, Myunghyun
Yang, Sungwoo
Kim, Beomjoon
Kim, Jinyeob
Kim, Donghan
SENSORS, 2024, 24 (19)
[50] Reinforcement Learning for Dynamic Trajectory Adjustment in Human-Robot Interaction Within Virtual Simulations
Gonzalez-Santocildes, Asier
Vazquez, Juan-Ignacio
Eguiluz, Andoni
Garcia Bringas, P.
HYBRID ARTIFICIAL INTELLIGENT SYSTEMS, PT II, HAIS 2024, 2025, 14858 : 239 - 251

← 1 2 3 4 5 →