A human-centered safe robot reinforcement learning framework with interactive behaviors

被引：3

作者：

Gu, Shangding ^{[1
]}

Kshirsagar, Alap ^{[2
]}

Du, Yali ^{[3
]}

Chen, Guang ^{[4
]}

Peters, Jan ^{[2
]}

Knoll, Alois ^{[1
]}

机构：

[1] Tech Univ Munich, Dept Comp Sci, Munich, Germany

[2] Tech Univ Darmstadt, Dept Comp Sci, Darmstadt, Germany

[3] Kings Coll London, Dept Informat, London, England

[4] Tongji Univ, Coll Elect & Informat Engn, Shanghai, Peoples R China

来源：

FRONTIERS IN NEUROROBOTICS | 2023年 / 17卷

基金：

欧盟地平线“2020”;

关键词：

interactive behaviors; safe exploration; value alignment; safe collaboration; bi-direction information;

D O I：

10.3389/fnbot.2023.1280341

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Deployment of Reinforcement Learning (RL) algorithms for robotics applications in the real world requires ensuring the safety of the robot and its environment. Safe Robot RL (SRRL) is a crucial step toward achieving human-robot coexistence. In this paper, we envision a human-centered SRRL framework consisting of three stages: safe exploration, safety value alignment, and safe collaboration. We examine the research gaps in these areas and propose to leverage interactive behaviors for SRRL. Interactive behaviors enable bi-directional information transfer between humans and robots, such as conversational robot ChatGPT. We argue that interactive behaviors need further attention from the SRRL community. We discuss four open challenges related to the robustness, efficiency, transparency, and adaptability of SRRL with interactive behaviors.

引用

页数：8

共 50 条

[41] Framework for formalizing inconsistencies and deviations in human-centered systems
Cugola, Gianpaolo
Di Nitto, Elisabetta
Fuggetta, Alfonso
Ghezzi, Carlo
ACM Transactions on Software Engineering and Methodology, 1996, 5 (03): : 191 - 230
[42] A human-centered framework for innovation in conservation incentive programs
Sorice, Michael G.
Donlan, C. Josh
AMBIO, 2015, 44 (08) : 788 - 792
[43] A framework for human-centered provisioning of ambient media services
Hossain, M. Anwar
Parra, Jorge
Atrey, Pradeep K.
El Saddik, Abdulmotaleb
MULTIMEDIA TOOLS AND APPLICATIONS, 2009, 44 (03) : 407 - 431
[44] A framework for human-centered provisioning of ambient media services
M. Anwar Hossain
Jorge Parra
Pradeep K. Atrey
Abdulmotaleb El Saddik
Multimedia Tools and Applications, 2009, 44 : 407 - 431
[45] Personalizing Human-Robot Workplace Parameters in Human-Centered Manufacturing
Ojstersek, Robert
Buchmeister, Borut
Javernik, Aljaz
MACHINES, 2024, 12 (08)
[46] Human-Centered AI using Ethical Causality and Learning Representation for Multi-Agent Deep Reinforcement Learning
Ho, Joshua
Wang, Chien-Min
PROCEEDINGS OF THE 2021 IEEE INTERNATIONAL CONFERENCE ON HUMAN-MACHINE SYSTEMS (ICHMS), 2021, : 143 - 148
[47] Safe mobile robot navigation in human-centered environments using a heat map-based path planner
Ravankar, Abhijeet
Ravankar, Ankit A.
Hoshino, Yohei
Watanabe, Michiko
Kobayashi, Yukinori
ARTIFICIAL LIFE AND ROBOTICS, 2020, 25 (02) : 264 - 272
[48] Human-centered Benchmarking for Socially-compliant Robot Navigation
Okunevich, Iaroslav
Hilaire, Vincent
Galland, Stephane
Lamotte, Olivier
Shilova, Liubov
Ruichek, Yassine
Yan, Zhi
2023 EUROPEAN CONFERENCE ON MOBILE ROBOTS, ECMR, 2023, : 125 - 131
[49] Interactive Reinforcement Learning for Table Balancing Robot
Jeon, Haein
Kim, Yewon
Kang, Boyeong
SPLU-ROBONLP 2021: THE 2ND INTERNATIONAL COMBINED WORKSHOP ON SPATIAL LANGUAGE UNDERSTANDING AND GROUNDED COMMUNICATION FOR ROBOTICS, 2021, : 71 - 78
[50] Safe mobile robot navigation in human-centered environments using a heat map-based path planner
Abhijeet Ravankar
Ankit A. Ravankar
Yohei Hoshino
Michiko Watanabe
Yukinori Kobayashi
Artificial Life and Robotics, 2020, 25 : 264 - 272

← 1 2 3 4 5 →