Reinforcement-learning based dialogue system for human-robot interactions with socially-inspired rewards

被引:31
|
作者
Ferreira, Emmanuel [1 ]
Lefevre, Fabrice [1 ]
机构
[1] Univ Avignon, LIA CERI, Avignon, France
关键词
Human-robot interaction; POMDP-based dialogue management; Reinforcement learning; Reward shaping; FRAMEWORK;
D O I
10.1016/j.csl.2015.03.007
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper investigates some conditions under which polarized user appraisals gathered throughout the course of a vocal interaction between a machine and a human can be integrated in a reinforcement learning-based dialogue manager. More specifically, we discuss how this information can be cast into socially-inspired rewards for speeding up the policy optimisation for both efficient task completion and user adaptation in an online learning setting. For this purpose a potential-based reward shaping method is combined with a sample efficient reinforcement learning algorithm to offer a principled framework to cope with these potentially noisy interim rewards. The proposed scheme will greatly facilitate the system's development by allowing the designer to teach his system through explicit positive/negative feedbacks given as hints about task progress, in the early stage of training. At a later stage, the approach will be used as a way to ease the adaptation of the dialogue policy to specific user profiles. Experiments carried out using a state-of-the-art goal-oriented dialogue management framework, the Hidden Information State (HIS), support our claims in two configurations: firstly, with a user simulator in the tourist information domain (and thus simulated appraisals), and secondly, in the context of man-robot dialogue with real user trials. (C) 2015 Elsevier Ltd. All rights reserved.
引用
收藏
页码:256 / 274
页数:19
相关论文
共 50 条
  • [41] Argumentation-Based Dialogue Games for Shared Control in Human-Robot Systems
    Sklar, Elizabeth I.
    Azhar, M. Q.
    JOURNAL OF HUMAN-ROBOT INTERACTION, 2015, 4 (03): : 120 - 148
  • [42] Human-Robot Interaction System with Quantum-Inspired Bidirectional Associative Memory
    Masuyama, Naoki
    Loo, Chu Kiong
    Kubota, Naoyuki
    2013 SECOND INTERNATIONAL CONFERENCE ON ROBOT, VISION AND SIGNAL PROCESSING (RVSP), 2013, : 66 - 71
  • [43] Asynchronous federated learning system for human-robot touch interaction
    Gamboa-Montero, Juan Jose
    Alonso-Martin, Fernando
    Marques-Villarroya, Sara
    Sequeira, Joao
    Salichs, Miguel A.
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 211
  • [44] Improving Workers' Musculoskeletal Health During Human-Robot Collaboration Through Reinforcement Learning
    Xie, Ziyang
    Lu, Lu
    Wang, Hanwen
    Su, Bingyi
    Liu, Yunan
    Xu, Xu
    HUMAN FACTORS, 2024, 66 (06) : 1754 - 1769
  • [45] Task-oriented Dialogue System Based on Reinforcement Learning
    Song, Meina
    Chen, Zhongfu
    Niu, Peiqing
    Haihong, E.
    PROCEEDINGS OF 2019 IEEE 10TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS 2019), 2019, : 93 - 98
  • [46] Safety-constrained Deep Reinforcement Learning control for human-robot collaboration in construction
    Duan, Kangkang
    Zou, Zhengbo
    AUTOMATION IN CONSTRUCTION, 2025, 174
  • [47] A FRAMEWORK FOR HUMAN-ROBOT TEAMING PERFORMANCE PREDICTION: REINFORCEMENT LEARNING AND EYE MOVEMENT ANALYSIS
    Galvani, Gustavo
    Korivand, Soroush
    Ajoudani, Arash
    Gong, Jiaqi
    Jalili, Nader
    PROCEEDINGS OF ASME 2023 INTERNATIONAL MECHANICAL ENGINEERING CONGRESS AND EXPOSITION, IMECE2023, VOL 3, 2023,
  • [48] Adaptive Robot Behavior Based on Human Comfort Using Reinforcement Learning
    Gonzalez-Santocildes, Asier
    Vazquez, Juan-Ignacio
    Eguiluz, Andoni
    IEEE ACCESS, 2024, 12 : 122289 - 122299
  • [49] Human-to-Robot Handover Based on Reinforcement Learning
    Kim, Myunghyun
    Yang, Sungwoo
    Kim, Beomjoon
    Kim, Jinyeob
    Kim, Donghan
    SENSORS, 2024, 24 (19)
  • [50] Reinforcement Learning for Dynamic Trajectory Adjustment in Human-Robot Interaction Within Virtual Simulations
    Gonzalez-Santocildes, Asier
    Vazquez, Juan-Ignacio
    Eguiluz, Andoni
    Garcia Bringas, P.
    HYBRID ARTIFICIAL INTELLIGENT SYSTEMS, PT II, HAIS 2024, 2025, 14858 : 239 - 251