Widening the Pipeline in Human-Guided Reinforcement Learning with Explanation and Context-Aware Data Augmentation

被引：0

作者：

Guan, Lin ^{[1
]}

Verma, Mudit ^{[1
]}

Guo, Sihang ^{[2
]}

Zhang, Ruohan ^{[3
]}

Kambhampati, Subbarao ^{[1
]}

机构：

[1] Arizona State Univ, Sch Comp & AI, Tempe, AZ 85281 USA

[2] Univ Texas Austin, Dept Comp Sci, Austin, TX 78712 USA

[3] Stanford Univ, Dept Comp Sci, Stanford, CA 94305 USA

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021) | 2021年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Human explanation (e.g., in terms of feature importance) has been recently used to extend the communication channel between human and agent in interactive machine learning. Under this setting, human trainers provide not only the ground truth but also some form of explanation. However, this kind of human guidance was only investigated in supervised learning tasks, and it remains unclear how to best incorporate this type of human knowledge into deep reinforcement learning. In this paper, we present the first study of using human visual explanations in human-in-the-loop reinforcement learning (HIRL). We focus on the task of learning from feedback, in which the human trainer not only gives binary evaluative "good" or "bad" feedback for queried state-action pairs, but also provides a visual explanation by annotating relevant features in images. We propose EXPAND (EXPlanation AugmeNted feeDback) to encourage the model to encode task-relevant features through a context-aware data augmentation that only perturbs irrelevant features in human salient information. We choose five tasks, namely Pixel-Taxi and four Atari games, to evaluate the performance and sample efficiency of this approach. We show that our method significantly outperforms methods leveraging human explanation that are adapted from supervised learning, and Human-in-the-loop RL baselines that only utilize evaluative feedback.

引用

页数：13

共 50 条

[1] Temporal prediction model with context-aware data augmentation for robust visual reinforcement learning
Yue, Xinkai
Ge, Hongwei
He, Xin
Hou, Yaqing
Neural Computing and Applications, 2024, 36 (31) : 19337 - 19352
[2] Context-aware reinforcement learning for course recommendation
Lin, Yuanguo
Lin, Fan
Yang, Lvqing
Zeng, Wenhua
Liu, Yong
Wu, Pengcheng
APPLIED SOFT COMPUTING, 2022, 125
[3] CAQ: Context-Aware Quantization via Reinforcement Learning
Tu, Zhijun
Ma, Jian
Xia, Tian
Zhao, Wenzhe
Ren, Pengju
Zheng, Nanning
2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
[4] Context-Aware Reinforcement Learning for Supporting WiFi Connectivity for Vehicles
Hussain, Mushahid
Franca, Felipe
Aguiar, Ana
2023 IEEE VEHICULAR NETWORKING CONFERENCE, VNC, 2023, : 65 - 68
[5] A context-aware approach to automated negotiation using reinforcement learning
Krohling, Dan E.
Chiotti, Omar J. A.
Martinez, Ernesto C.
ADVANCED ENGINEERING INFORMATICS, 2021, 47
[6] Context-Aware Mobility Management in HetNets: A Reinforcement Learning Approach
Simsek, Meryem
Bennis, Mehdi
Guvenc, Ismail
2015 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2015, : 1536 - 1541
[7] A context-aware approach to automated negotiation using reinforcement learning
Kröhling, Dan E.
Chiotti, Omar J.A.
Martínez, Ernesto C.
Advanced Engineering Informatics, 2021, 47
[8] Context-Aware Ranking by Constructing a Virtual Environment for Reinforcement Learning
Zhang, Junqi
Mao, Jiaxin
Liu, Yiqun
Zhang, Ruizhe
Zhang, Min
Ma, Shaoping
Xu, Jun
Tian, Qi
PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM '19), 2019, : 1603 - 1612
[9] Context-aware reinforcement learning for cooling operation of data centers with an Aquifer Thermal Energy Storage
Leindals, Lukas
Gronning, Peter
Dominkovic, Dominik Franjo
Junker, Rune Gronborg
ENERGY AND AI, 2024, 17
[10] Context-Aware Data Augmentation for Efficient Object Detection by UAV Surveillance
Gordienko, Yuri
Rokovyi, Oleksandr
Alienin, Oleg
Stirenko, Sergii
2022 10TH INTERNATIONAL SYMPOSIUM ON DIGITAL FORENSICS AND SECURITY (ISDFS), 2022,

← 1 2 3 4 5 →