Reinforcement-learning based dialogue system for human-robot interactions with socially-inspired rewards

被引:31
|
作者
Ferreira, Emmanuel [1 ]
Lefevre, Fabrice [1 ]
机构
[1] Univ Avignon, LIA CERI, Avignon, France
关键词
Human-robot interaction; POMDP-based dialogue management; Reinforcement learning; Reward shaping; FRAMEWORK;
D O I
10.1016/j.csl.2015.03.007
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper investigates some conditions under which polarized user appraisals gathered throughout the course of a vocal interaction between a machine and a human can be integrated in a reinforcement learning-based dialogue manager. More specifically, we discuss how this information can be cast into socially-inspired rewards for speeding up the policy optimisation for both efficient task completion and user adaptation in an online learning setting. For this purpose a potential-based reward shaping method is combined with a sample efficient reinforcement learning algorithm to offer a principled framework to cope with these potentially noisy interim rewards. The proposed scheme will greatly facilitate the system's development by allowing the designer to teach his system through explicit positive/negative feedbacks given as hints about task progress, in the early stage of training. At a later stage, the approach will be used as a way to ease the adaptation of the dialogue policy to specific user profiles. Experiments carried out using a state-of-the-art goal-oriented dialogue management framework, the Hidden Information State (HIS), support our claims in two configurations: firstly, with a user simulator in the tourist information domain (and thus simulated appraisals), and secondly, in the context of man-robot dialogue with real user trials. (C) 2015 Elsevier Ltd. All rights reserved.
引用
收藏
页码:256 / 274
页数:19
相关论文
共 50 条
  • [31] A Cross-Situational Learning Based Framework for Grounding of Synonyms in Human-Robot Interactions
    Roesler, Oliver
    FOURTH IBERIAN ROBOTICS CONFERENCE: ADVANCES IN ROBOTICS, ROBOT 2019, VOL 2, 2020, 1093 : 225 - 236
  • [32] Brain-Inspired Active Learning Architecture for Procedural Knowledge Understanding Based on Human-Robot Interaction
    Tielin Zhang
    Yi Zeng
    Ruihan Pan
    Mengting Shi
    Enmeng Lu
    Cognitive Computation, 2021, 13 : 381 - 393
  • [33] Brain-Inspired Active Learning Architecture for Procedural Knowledge Understanding Based on Human-Robot Interaction
    Zhang, Tielin
    Zeng, Yi
    Pan, Ruihan
    Shi, Mengting
    Lu, Enmeng
    COGNITIVE COMPUTATION, 2021, 13 (02) : 381 - 393
  • [34] Robot Reinforcement Learning Based on Learning Classifier System
    Shao, Jie
    Yang, Jing-yu
    ADVANCED INTELLIGENT COMPUTING THEORIES AND APPLICATIONS, 2010, 93 : 200 - 207
  • [35] Robust Assembly Sequence Generation in a Human-Robot Collaborative Workcell by Reinforcement Learning
    Antonelli, Dario
    Zeng, Qingfei
    Aliev, Khurshid
    Liu, Xuemei
    FME TRANSACTIONS, 2021, 49 (04): : 851 - 858
  • [36] Goal-Conditioned Reinforcement Learning within a Human-Robot Disassembly Environment
    Elguea-Aguinaco, Inigo
    Serrano-Munoz, Antonio
    Chrysostomou, Dimitrios
    Inziarte-Hidalgo, Ibai
    Bogh, Simon
    Arana-Arexolaleiba, Nestor
    APPLIED SCIENCES-BASEL, 2022, 12 (22):
  • [37] Intrinsically motivated reinforcement learning for human-robot interaction in the real-world
    Qureshi, Ahmed Hussain
    Nakamura, Yutaka
    Yoshikawa, Yuichiro
    Ishiguro, Hiroshi
    NEURAL NETWORKS, 2018, 107 : 23 - 33
  • [38] Bayesian Reinforcement Learning for Adaptive Balancing in an Assembly Line With Human-Robot Collaboration
    Lee, Hyun-Rok
    Park, Sanghyun
    Lee, Jimin
    IEEE ACCESS, 2024, 12 : 172256 - 172265
  • [39] An adaptive reinforcement learning-based multimodal data fusion framework for human-robot confrontation gaming
    Qi, Wen
    Fan, Haoyu
    Karimi, Hamid Reza
    Su, Hang
    NEURAL NETWORKS, 2023, 164 : 489 - 496
  • [40] LED Strip Based Robot Movement Intention Signs for Human-Robot Interactions
    Domonkos, Mark
    Dombi, Zoltan
    Botzheim, Janos
    2020 IEEE 20TH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND INFORMATICS (CINTI), 2020,