Reinforcement-learning based dialogue system for human-robot interactions with socially-inspired rewards

被引：31

作者：

Ferreira, Emmanuel ^{[1
]}

Lefevre, Fabrice ^{[1
]}

机构：

[1] Univ Avignon, LIA CERI, Avignon, France

来源：

COMPUTER SPEECH AND LANGUAGE | 2015年 / 34卷 / 01期

关键词：

Human-robot interaction; POMDP-based dialogue management; Reinforcement learning; Reward shaping; FRAMEWORK;

D O I：

10.1016/j.csl.2015.03.007

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper investigates some conditions under which polarized user appraisals gathered throughout the course of a vocal interaction between a machine and a human can be integrated in a reinforcement learning-based dialogue manager. More specifically, we discuss how this information can be cast into socially-inspired rewards for speeding up the policy optimisation for both efficient task completion and user adaptation in an online learning setting. For this purpose a potential-based reward shaping method is combined with a sample efficient reinforcement learning algorithm to offer a principled framework to cope with these potentially noisy interim rewards. The proposed scheme will greatly facilitate the system's development by allowing the designer to teach his system through explicit positive/negative feedbacks given as hints about task progress, in the early stage of training. At a later stage, the approach will be used as a way to ease the adaptation of the dialogue policy to specific user profiles. Experiments carried out using a state-of-the-art goal-oriented dialogue management framework, the Hidden Information State (HIS), support our claims in two configurations: firstly, with a user simulator in the tourist information domain (and thus simulated appraisals), and secondly, in the context of man-robot dialogue with real user trials. (C) 2015 Elsevier Ltd. All rights reserved.

引用

页码：256 / 274

页数：19

共 50 条

[21] The Role of Simulated Emotions in Reinforcement Learning: Insights from a Human-Robot Interaction Experiment
Nijeholt, Floortje Lycklama A.
Broekens, Joost
2023 11TH INTERNATIONAL CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION WORKSHOPS AND DEMOS, ACIIW, 2023,
[22] "Who's a Good Robot?!" Designing Human-Robot Teaching Interactions Inspired by Dog Training
Paci, Patrizia
Tiddi, Ilaria
Preciado, Daniel
Baraka, Kim
HHAI 2023: AUGMENTING HUMAN INTELLECT, 2023, 368 : 310 - 319
[23] Learning on the Job: Long-Term Behavioural Adaptation in Human-Robot Interactions
Del Duchetto, Francesco
Hanheide, Marc
IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (03) : 6934 - 6941
[24] A graph-based reinforcement learning-enabled approach for adaptive human-robot collaborative assembly operations
Zhang, Rong
Lv, Jianhao
Li, Jie
Bao, Jinsong
Zheng, Pai
Peng, Tao
JOURNAL OF MANUFACTURING SYSTEMS, 2022, 63 : 491 - 503
[25] Role Dynamic Allocation of Human-Robot Cooperation Based on Reinforcement Learning in an Installation of Curtain Wall
Liu, Zhiguang
Wang, Shilin
Zhao, Jian
Hao, Jianhong
Yu, Fei
CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES, 2024, 138 (01): : 473 - 487
[26] Toward an Argumentation-based Dialogue Framework for Human-Robot Collaboration
Azhar, Mohammad Q.
ICMI '12: PROCEEDINGS OF THE ACM INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2012, : 305 - 308
[27] FML-Based Reinforcement Learning Agent with Fuzzy Ontology for Human-Robot Cooperative Edutainment
Lee, Chang-Shing
Wang, Mei-Hui
Tsai, Yi-Lin
Chang, Wei-Shan
Reformat, Marek
Acampora, Giovanni
Kubota, Naoyuki
INTERNATIONAL JOURNAL OF UNCERTAINTY FUZZINESS AND KNOWLEDGE-BASED SYSTEMS, 2020, 28 (06) : 1023 - 1060
[28] Psychophysics-Based Cognitive Reinforcement Learning to Optimize Human-Robot Interaction in Power-Assisted Object Manipulation
Rahman, S. M. Mizanoor
INTELLIGENT HUMAN SYSTEMS INTEGRATION 2021, 2021, 1322 : 56 - 62
[29] The Impact of Human-Robot Interface Design on the Use of a Learning Robot System
Doisy, Guillaume
Meyer, Joachim
Edan, Yael
IEEE TRANSACTIONS ON HUMAN-MACHINE SYSTEMS, 2014, 44 (06) : 788 - 795
[30] Autonomous human-robot proxemics: socially aware navigation based on interaction potential
Mead, Ross
Mataric, Maja J.
AUTONOMOUS ROBOTS, 2017, 41 (05) : 1189 - 1201

← 1 2 3 4 5 →