Reinforcement-learning based dialogue system for human-robot interactions with socially-inspired rewards

被引：31

作者：

Ferreira, Emmanuel ^{[1
]}

Lefevre, Fabrice ^{[1
]}

机构：

[1] Univ Avignon, LIA CERI, Avignon, France

来源：

COMPUTER SPEECH AND LANGUAGE | 2015年 / 34卷 / 01期

关键词：

Human-robot interaction; POMDP-based dialogue management; Reinforcement learning; Reward shaping; FRAMEWORK;

D O I：

10.1016/j.csl.2015.03.007

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper investigates some conditions under which polarized user appraisals gathered throughout the course of a vocal interaction between a machine and a human can be integrated in a reinforcement learning-based dialogue manager. More specifically, we discuss how this information can be cast into socially-inspired rewards for speeding up the policy optimisation for both efficient task completion and user adaptation in an online learning setting. For this purpose a potential-based reward shaping method is combined with a sample efficient reinforcement learning algorithm to offer a principled framework to cope with these potentially noisy interim rewards. The proposed scheme will greatly facilitate the system's development by allowing the designer to teach his system through explicit positive/negative feedbacks given as hints about task progress, in the early stage of training. At a later stage, the approach will be used as a way to ease the adaptation of the dialogue policy to specific user profiles. Experiments carried out using a state-of-the-art goal-oriented dialogue management framework, the Hidden Information State (HIS), support our claims in two configurations: firstly, with a user simulator in the tourist information domain (and thus simulated appraisals), and secondly, in the context of man-robot dialogue with real user trials. (C) 2015 Elsevier Ltd. All rights reserved.

引用

页码：256 / 274

页数：19

共 50 条

[31] A Cross-Situational Learning Based Framework for Grounding of Synonyms in Human-Robot Interactions
Roesler, Oliver
FOURTH IBERIAN ROBOTICS CONFERENCE: ADVANCES IN ROBOTICS, ROBOT 2019, VOL 2, 2020, 1093 : 225 - 236
[32] Brain-Inspired Active Learning Architecture for Procedural Knowledge Understanding Based on Human-Robot Interaction
Tielin Zhang
Yi Zeng
Ruihan Pan
Mengting Shi
Enmeng Lu
Cognitive Computation, 2021, 13 : 381 - 393
[33] Brain-Inspired Active Learning Architecture for Procedural Knowledge Understanding Based on Human-Robot Interaction
Zhang, Tielin
Zeng, Yi
Pan, Ruihan
Shi, Mengting
Lu, Enmeng
COGNITIVE COMPUTATION, 2021, 13 (02) : 381 - 393
[34] Robot Reinforcement Learning Based on Learning Classifier System
Shao, Jie
Yang, Jing-yu
ADVANCED INTELLIGENT COMPUTING THEORIES AND APPLICATIONS, 2010, 93 : 200 - 207
[35] Robust Assembly Sequence Generation in a Human-Robot Collaborative Workcell by Reinforcement Learning
Antonelli, Dario
Zeng, Qingfei
Aliev, Khurshid
Liu, Xuemei
FME TRANSACTIONS, 2021, 49 (04): : 851 - 858
[36] Goal-Conditioned Reinforcement Learning within a Human-Robot Disassembly Environment
Elguea-Aguinaco, Inigo
Serrano-Munoz, Antonio
Chrysostomou, Dimitrios
Inziarte-Hidalgo, Ibai
Bogh, Simon
Arana-Arexolaleiba, Nestor
APPLIED SCIENCES-BASEL, 2022, 12 (22):
[37] Intrinsically motivated reinforcement learning for human-robot interaction in the real-world
Qureshi, Ahmed Hussain
Nakamura, Yutaka
Yoshikawa, Yuichiro
Ishiguro, Hiroshi
NEURAL NETWORKS, 2018, 107 : 23 - 33
[38] Bayesian Reinforcement Learning for Adaptive Balancing in an Assembly Line With Human-Robot Collaboration
Lee, Hyun-Rok
Park, Sanghyun
Lee, Jimin
IEEE ACCESS, 2024, 12 : 172256 - 172265
[39] An adaptive reinforcement learning-based multimodal data fusion framework for human-robot confrontation gaming
Qi, Wen
Fan, Haoyu
Karimi, Hamid Reza
Su, Hang
NEURAL NETWORKS, 2023, 164 : 489 - 496
[40] LED Strip Based Robot Movement Intention Signs for Human-Robot Interactions
Domonkos, Mark
Dombi, Zoltan
Botzheim, Janos
2020 IEEE 20TH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND INFORMATICS (CINTI), 2020,

← 1 2 3 4 5 →