Learning Goal Conditioned Socially Compliant Navigation From Demonstration Using Risk-Based Features

被引：13

作者：

Konar, Abhisek ^{[1
]}

Baghi, Bobak H. ^{[1
]}

Dudek, Gregory ^{[1
,2
]}

机构：

[1] McGill Univ, Sch Comp Sci, Montreal, PQ H3A 2K6, Canada

[2] Samsung Elect AI Ctr Montrea1, Montreal, PQ H3B 4K4, Canada

来源：

IEEE ROBOTICS AND AUTOMATION LETTERS | 2021年 / 6卷 / 02期

关键词：

Navigation; Trajectory; Reinforcement learning; Entropy; Computational modeling; Two dimensional displays; Robot sensing systems; Inverse reinforcement learning; learning from demonstration; motion and path planning; robot navigation; social navigation; ROBOT;

D O I：

10.1109/LRA.2020.3048657

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

One of the main challenges of operating mobile robots in social environments is the safe and fluid navigation therein, specifically the ability to share a space with other human inhabitants by complying with the explicit and implicit rules that we humans follow during navigation. While these rules come naturally to us, they resist simple and explicit definitions. In this letter, we present a learning-based solution to address the question of socially compliant navigation, which is to navigate while maintaining adherence to the navigational policies a person might use. We infer these policies by learning from human examples using inverse reinforcement learning techniques. In particular, this letter contributes an efficient sampling-based approximation to enable model-free deep inverse reinforcement learning, and a goal conditioned risk-based feature representation that adequately captures local information surrounding the agent. We validate our approach by comparing against a classical algorithm and a reinforcement learning agent and evaluate our feature representation against similar feature representations from the literature. We find that the combination of our proposed method and our feature representation produce higher quality trajectories and that our proposed feature representation plays a critical role in successful navigation.

引用

页码：651 / 658

页数：8

共 6 条

[1] Socially Compliant Robot Navigation in Crowded Environment by Human Behavior Resemblance Using Deep Reinforcement Learning
Samsani, Sunil Srivatsav
Muhammad, Mannan Saeed
IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (03) : 5223 - 5230
[2] Risk-based implementation of COLREGs for autonomous surface vehicles using deep reinforcement learning
Heiberg, Amalie
Larsen, Thomas Nakken
Meyer, Eivind
Rasheed, Adil
San, Omer
Varagnolo, Damiano
NEURAL NETWORKS, 2022, 152 : 17 - 33
[3] Towards Goal Based Architecture Design for Learning High-Level Representation of Behaviors from Demonstration
Fonooni, Benjamin
Hellstrom, Thomas
Janlert, Lars-Erik
2013 IEEE INTERNATIONAL MULTI-DISCIPLINARY CONFERENCE ON COGNITIVE METHODS IN SITUATION AWARENESS AND DECISION SUPPORT (COGSIMA), 2013, : 67 - 74
[4] Implementation of a Surface Electromyography-Based Upper Extremity Exoskeleton Controller Using Learning from Demonstration
Siu, Ho Chit
Arenas, Ana M.
Sun, Tingxiao
Stirling, Leia A.
SENSORS, 2018, 18 (02):
[5] Trajectory learning and reproduction for differential drive mobile robots based on GMM/HMM and dynamic time warping using learning from demonstration framework
Vukovic, Najdan
Mitic, Marko
Miljkovic, Zoran
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2015, 45 : 388 - 404
[6] Tropical Cyclogenesis Detection From Remotely Sensed Sea Surface Winds Using Graphical and Statistical Features-Based Broad Learning System
Wang, Sheng
Yuen, Ka-Veng
Yang, Xiaofeng
Zhang, Yang
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61

← 1 →