Widening the Pipeline in Human-Guided Reinforcement Learning with Explanation and Context-Aware Data Augmentation

被引:0
|
作者
Guan, Lin [1 ]
Verma, Mudit [1 ]
Guo, Sihang [2 ]
Zhang, Ruohan [3 ]
Kambhampati, Subbarao [1 ]
机构
[1] Arizona State Univ, Sch Comp & AI, Tempe, AZ 85281 USA
[2] Univ Texas Austin, Dept Comp Sci, Austin, TX 78712 USA
[3] Stanford Univ, Dept Comp Sci, Stanford, CA 94305 USA
来源
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021) | 2021年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Human explanation (e.g., in terms of feature importance) has been recently used to extend the communication channel between human and agent in interactive machine learning. Under this setting, human trainers provide not only the ground truth but also some form of explanation. However, this kind of human guidance was only investigated in supervised learning tasks, and it remains unclear how to best incorporate this type of human knowledge into deep reinforcement learning. In this paper, we present the first study of using human visual explanations in human-in-the-loop reinforcement learning (HIRL). We focus on the task of learning from feedback, in which the human trainer not only gives binary evaluative "good" or "bad" feedback for queried state-action pairs, but also provides a visual explanation by annotating relevant features in images. We propose EXPAND (EXPlanation AugmeNted feeDback) to encourage the model to encode task-relevant features through a context-aware data augmentation that only perturbs irrelevant features in human salient information. We choose five tasks, namely Pixel-Taxi and four Atari games, to evaluate the performance and sample efficiency of this approach. We show that our method significantly outperforms methods leveraging human explanation that are adapted from supervised learning, and Human-in-the-loop RL baselines that only utilize evaluative feedback.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Temporal prediction model with context-aware data augmentation for robust visual reinforcement learning
    Yue, Xinkai
    Ge, Hongwei
    He, Xin
    Hou, Yaqing
    Neural Computing and Applications, 2024, 36 (31) : 19337 - 19352
  • [2] Context-aware reinforcement learning for course recommendation
    Lin, Yuanguo
    Lin, Fan
    Yang, Lvqing
    Zeng, Wenhua
    Liu, Yong
    Wu, Pengcheng
    APPLIED SOFT COMPUTING, 2022, 125
  • [3] CAQ: Context-Aware Quantization via Reinforcement Learning
    Tu, Zhijun
    Ma, Jian
    Xia, Tian
    Zhao, Wenzhe
    Ren, Pengju
    Zheng, Nanning
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [4] Context-Aware Reinforcement Learning for Supporting WiFi Connectivity for Vehicles
    Hussain, Mushahid
    Franca, Felipe
    Aguiar, Ana
    2023 IEEE VEHICULAR NETWORKING CONFERENCE, VNC, 2023, : 65 - 68
  • [5] A context-aware approach to automated negotiation using reinforcement learning
    Krohling, Dan E.
    Chiotti, Omar J. A.
    Martinez, Ernesto C.
    ADVANCED ENGINEERING INFORMATICS, 2021, 47
  • [6] Context-Aware Mobility Management in HetNets: A Reinforcement Learning Approach
    Simsek, Meryem
    Bennis, Mehdi
    Guvenc, Ismail
    2015 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2015, : 1536 - 1541
  • [7] A context-aware approach to automated negotiation using reinforcement learning
    Kröhling, Dan E.
    Chiotti, Omar J.A.
    Martínez, Ernesto C.
    Advanced Engineering Informatics, 2021, 47
  • [8] Context-Aware Ranking by Constructing a Virtual Environment for Reinforcement Learning
    Zhang, Junqi
    Mao, Jiaxin
    Liu, Yiqun
    Zhang, Ruizhe
    Zhang, Min
    Ma, Shaoping
    Xu, Jun
    Tian, Qi
    PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM '19), 2019, : 1603 - 1612
  • [9] Context-aware reinforcement learning for cooling operation of data centers with an Aquifer Thermal Energy Storage
    Leindals, Lukas
    Gronning, Peter
    Dominkovic, Dominik Franjo
    Junker, Rune Gronborg
    ENERGY AND AI, 2024, 17
  • [10] Context-Aware Data Augmentation for Efficient Object Detection by UAV Surveillance
    Gordienko, Yuri
    Rokovyi, Oleksandr
    Alienin, Oleg
    Stirenko, Sergii
    2022 10TH INTERNATIONAL SYMPOSIUM ON DIGITAL FORENSICS AND SECURITY (ISDFS), 2022,