Learning Goal Conditioned Socially Compliant Navigation From Demonstration Using Risk-Based Features

被引:13
|
作者
Konar, Abhisek [1 ]
Baghi, Bobak H. [1 ]
Dudek, Gregory [1 ,2 ]
机构
[1] McGill Univ, Sch Comp Sci, Montreal, PQ H3A 2K6, Canada
[2] Samsung Elect AI Ctr Montrea1, Montreal, PQ H3B 4K4, Canada
关键词
Navigation; Trajectory; Reinforcement learning; Entropy; Computational modeling; Two dimensional displays; Robot sensing systems; Inverse reinforcement learning; learning from demonstration; motion and path planning; robot navigation; social navigation; ROBOT;
D O I
10.1109/LRA.2020.3048657
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
One of the main challenges of operating mobile robots in social environments is the safe and fluid navigation therein, specifically the ability to share a space with other human inhabitants by complying with the explicit and implicit rules that we humans follow during navigation. While these rules come naturally to us, they resist simple and explicit definitions. In this letter, we present a learning-based solution to address the question of socially compliant navigation, which is to navigate while maintaining adherence to the navigational policies a person might use. We infer these policies by learning from human examples using inverse reinforcement learning techniques. In particular, this letter contributes an efficient sampling-based approximation to enable model-free deep inverse reinforcement learning, and a goal conditioned risk-based feature representation that adequately captures local information surrounding the agent. We validate our approach by comparing against a classical algorithm and a reinforcement learning agent and evaluate our feature representation against similar feature representations from the literature. We find that the combination of our proposed method and our feature representation produce higher quality trajectories and that our proposed feature representation plays a critical role in successful navigation.
引用
收藏
页码:651 / 658
页数:8
相关论文
共 6 条
  • [1] Socially Compliant Robot Navigation in Crowded Environment by Human Behavior Resemblance Using Deep Reinforcement Learning
    Samsani, Sunil Srivatsav
    Muhammad, Mannan Saeed
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (03) : 5223 - 5230
  • [2] Risk-based implementation of COLREGs for autonomous surface vehicles using deep reinforcement learning
    Heiberg, Amalie
    Larsen, Thomas Nakken
    Meyer, Eivind
    Rasheed, Adil
    San, Omer
    Varagnolo, Damiano
    NEURAL NETWORKS, 2022, 152 : 17 - 33
  • [3] Towards Goal Based Architecture Design for Learning High-Level Representation of Behaviors from Demonstration
    Fonooni, Benjamin
    Hellstrom, Thomas
    Janlert, Lars-Erik
    2013 IEEE INTERNATIONAL MULTI-DISCIPLINARY CONFERENCE ON COGNITIVE METHODS IN SITUATION AWARENESS AND DECISION SUPPORT (COGSIMA), 2013, : 67 - 74
  • [4] Implementation of a Surface Electromyography-Based Upper Extremity Exoskeleton Controller Using Learning from Demonstration
    Siu, Ho Chit
    Arenas, Ana M.
    Sun, Tingxiao
    Stirling, Leia A.
    SENSORS, 2018, 18 (02):
  • [5] Trajectory learning and reproduction for differential drive mobile robots based on GMM/HMM and dynamic time warping using learning from demonstration framework
    Vukovic, Najdan
    Mitic, Marko
    Miljkovic, Zoran
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2015, 45 : 388 - 404
  • [6] Tropical Cyclogenesis Detection From Remotely Sensed Sea Surface Winds Using Graphical and Statistical Features-Based Broad Learning System
    Wang, Sheng
    Yuen, Ka-Veng
    Yang, Xiaofeng
    Zhang, Yang
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61