Reinforcement-learning based dialogue system for human-robot interactions with socially-inspired rewards

被引:31
|
作者
Ferreira, Emmanuel [1 ]
Lefevre, Fabrice [1 ]
机构
[1] Univ Avignon, LIA CERI, Avignon, France
关键词
Human-robot interaction; POMDP-based dialogue management; Reinforcement learning; Reward shaping; FRAMEWORK;
D O I
10.1016/j.csl.2015.03.007
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper investigates some conditions under which polarized user appraisals gathered throughout the course of a vocal interaction between a machine and a human can be integrated in a reinforcement learning-based dialogue manager. More specifically, we discuss how this information can be cast into socially-inspired rewards for speeding up the policy optimisation for both efficient task completion and user adaptation in an online learning setting. For this purpose a potential-based reward shaping method is combined with a sample efficient reinforcement learning algorithm to offer a principled framework to cope with these potentially noisy interim rewards. The proposed scheme will greatly facilitate the system's development by allowing the designer to teach his system through explicit positive/negative feedbacks given as hints about task progress, in the early stage of training. At a later stage, the approach will be used as a way to ease the adaptation of the dialogue policy to specific user profiles. Experiments carried out using a state-of-the-art goal-oriented dialogue management framework, the Hidden Information State (HIS), support our claims in two configurations: firstly, with a user simulator in the tourist information domain (and thus simulated appraisals), and secondly, in the context of man-robot dialogue with real user trials. (C) 2015 Elsevier Ltd. All rights reserved.
引用
收藏
页码:256 / 274
页数:19
相关论文
共 50 条
  • [21] The Role of Simulated Emotions in Reinforcement Learning: Insights from a Human-Robot Interaction Experiment
    Nijeholt, Floortje Lycklama A.
    Broekens, Joost
    2023 11TH INTERNATIONAL CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION WORKSHOPS AND DEMOS, ACIIW, 2023,
  • [22] "Who's a Good Robot?!" Designing Human-Robot Teaching Interactions Inspired by Dog Training
    Paci, Patrizia
    Tiddi, Ilaria
    Preciado, Daniel
    Baraka, Kim
    HHAI 2023: AUGMENTING HUMAN INTELLECT, 2023, 368 : 310 - 319
  • [23] Learning on the Job: Long-Term Behavioural Adaptation in Human-Robot Interactions
    Del Duchetto, Francesco
    Hanheide, Marc
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (03) : 6934 - 6941
  • [24] A graph-based reinforcement learning-enabled approach for adaptive human-robot collaborative assembly operations
    Zhang, Rong
    Lv, Jianhao
    Li, Jie
    Bao, Jinsong
    Zheng, Pai
    Peng, Tao
    JOURNAL OF MANUFACTURING SYSTEMS, 2022, 63 : 491 - 503
  • [25] Role Dynamic Allocation of Human-Robot Cooperation Based on Reinforcement Learning in an Installation of Curtain Wall
    Liu, Zhiguang
    Wang, Shilin
    Zhao, Jian
    Hao, Jianhong
    Yu, Fei
    CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES, 2024, 138 (01): : 473 - 487
  • [26] Toward an Argumentation-based Dialogue Framework for Human-Robot Collaboration
    Azhar, Mohammad Q.
    ICMI '12: PROCEEDINGS OF THE ACM INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2012, : 305 - 308
  • [27] FML-Based Reinforcement Learning Agent with Fuzzy Ontology for Human-Robot Cooperative Edutainment
    Lee, Chang-Shing
    Wang, Mei-Hui
    Tsai, Yi-Lin
    Chang, Wei-Shan
    Reformat, Marek
    Acampora, Giovanni
    Kubota, Naoyuki
    INTERNATIONAL JOURNAL OF UNCERTAINTY FUZZINESS AND KNOWLEDGE-BASED SYSTEMS, 2020, 28 (06) : 1023 - 1060
  • [28] Psychophysics-Based Cognitive Reinforcement Learning to Optimize Human-Robot Interaction in Power-Assisted Object Manipulation
    Rahman, S. M. Mizanoor
    INTELLIGENT HUMAN SYSTEMS INTEGRATION 2021, 2021, 1322 : 56 - 62
  • [29] The Impact of Human-Robot Interface Design on the Use of a Learning Robot System
    Doisy, Guillaume
    Meyer, Joachim
    Edan, Yael
    IEEE TRANSACTIONS ON HUMAN-MACHINE SYSTEMS, 2014, 44 (06) : 788 - 795
  • [30] Autonomous human-robot proxemics: socially aware navigation based on interaction potential
    Mead, Ross
    Mataric, Maja J.
    AUTONOMOUS ROBOTS, 2017, 41 (05) : 1189 - 1201