A human-centered safe robot reinforcement learning framework with interactive behaviors

被引:3
|
作者
Gu, Shangding [1 ]
Kshirsagar, Alap [2 ]
Du, Yali [3 ]
Chen, Guang [4 ]
Peters, Jan [2 ]
Knoll, Alois [1 ]
机构
[1] Tech Univ Munich, Dept Comp Sci, Munich, Germany
[2] Tech Univ Darmstadt, Dept Comp Sci, Darmstadt, Germany
[3] Kings Coll London, Dept Informat, London, England
[4] Tongji Univ, Coll Elect & Informat Engn, Shanghai, Peoples R China
基金
欧盟地平线“2020”;
关键词
interactive behaviors; safe exploration; value alignment; safe collaboration; bi-direction information;
D O I
10.3389/fnbot.2023.1280341
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deployment of Reinforcement Learning (RL) algorithms for robotics applications in the real world requires ensuring the safety of the robot and its environment. Safe Robot RL (SRRL) is a crucial step toward achieving human-robot coexistence. In this paper, we envision a human-centered SRRL framework consisting of three stages: safe exploration, safety value alignment, and safe collaboration. We examine the research gaps in these areas and propose to leverage interactive behaviors for SRRL. Interactive behaviors enable bi-directional information transfer between humans and robots, such as conversational robot ChatGPT. We argue that interactive behaviors need further attention from the SRRL community. We discuss four open challenges related to the robustness, efficiency, transparency, and adaptability of SRRL with interactive behaviors.
引用
收藏
页数:8
相关论文
共 50 条
  • [41] Framework for formalizing inconsistencies and deviations in human-centered systems
    Cugola, Gianpaolo
    Di Nitto, Elisabetta
    Fuggetta, Alfonso
    Ghezzi, Carlo
    ACM Transactions on Software Engineering and Methodology, 1996, 5 (03): : 191 - 230
  • [42] A human-centered framework for innovation in conservation incentive programs
    Sorice, Michael G.
    Donlan, C. Josh
    AMBIO, 2015, 44 (08) : 788 - 792
  • [43] A framework for human-centered provisioning of ambient media services
    Hossain, M. Anwar
    Parra, Jorge
    Atrey, Pradeep K.
    El Saddik, Abdulmotaleb
    MULTIMEDIA TOOLS AND APPLICATIONS, 2009, 44 (03) : 407 - 431
  • [44] A framework for human-centered provisioning of ambient media services
    M. Anwar Hossain
    Jorge Parra
    Pradeep K. Atrey
    Abdulmotaleb El Saddik
    Multimedia Tools and Applications, 2009, 44 : 407 - 431
  • [45] Personalizing Human-Robot Workplace Parameters in Human-Centered Manufacturing
    Ojstersek, Robert
    Buchmeister, Borut
    Javernik, Aljaz
    MACHINES, 2024, 12 (08)
  • [46] Human-Centered AI using Ethical Causality and Learning Representation for Multi-Agent Deep Reinforcement Learning
    Ho, Joshua
    Wang, Chien-Min
    PROCEEDINGS OF THE 2021 IEEE INTERNATIONAL CONFERENCE ON HUMAN-MACHINE SYSTEMS (ICHMS), 2021, : 143 - 148
  • [47] Safe mobile robot navigation in human-centered environments using a heat map-based path planner
    Ravankar, Abhijeet
    Ravankar, Ankit A.
    Hoshino, Yohei
    Watanabe, Michiko
    Kobayashi, Yukinori
    ARTIFICIAL LIFE AND ROBOTICS, 2020, 25 (02) : 264 - 272
  • [48] Human-centered Benchmarking for Socially-compliant Robot Navigation
    Okunevich, Iaroslav
    Hilaire, Vincent
    Galland, Stephane
    Lamotte, Olivier
    Shilova, Liubov
    Ruichek, Yassine
    Yan, Zhi
    2023 EUROPEAN CONFERENCE ON MOBILE ROBOTS, ECMR, 2023, : 125 - 131
  • [49] Interactive Reinforcement Learning for Table Balancing Robot
    Jeon, Haein
    Kim, Yewon
    Kang, Boyeong
    SPLU-ROBONLP 2021: THE 2ND INTERNATIONAL COMBINED WORKSHOP ON SPATIAL LANGUAGE UNDERSTANDING AND GROUNDED COMMUNICATION FOR ROBOTICS, 2021, : 71 - 78
  • [50] Safe mobile robot navigation in human-centered environments using a heat map-based path planner
    Abhijeet Ravankar
    Ankit A. Ravankar
    Yohei Hoshino
    Michiko Watanabe
    Yukinori Kobayashi
    Artificial Life and Robotics, 2020, 25 : 264 - 272