Initial State Interventions for Deconfounded Imitation Learning

被引：0

作者：

Pfrommer, Samuel ^{[1
]}

Bai, Yatong ^{[1
]}

Lee, Hyunin ^{[1
]}

Sojoudi, Somayeh ^{[1
]}

机构：

[1] Univ Calif Berkeley, Dept Elect Engn & Comp Sci, Berkeley, CA 94720 USA

来源：

2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC | 2023年

关键词：

D O I：

10.1109/CDC49753.2023.10383252

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Imitation learning suffers from causal confusion. This phenomenon occurs when learned policies attend to features that do not causally influence the expert actions but are instead spuriously correlated. Causally confused agents produce low open-loop supervised loss but poor closed-loop performance upon deployment. We consider the problem of masking observed confounders in a disentangled representation of the observation space. Our novel masking algorithm leverages the usual ability to intervene in the initial system state, avoiding any requirement involving expert querying, expert reward functions, or causal graph specification. Under certain assumptions, we theoretically prove that this algorithm is conservative in the sense that it does not incorrectly mask observations that causally influence the expert; furthermore, intervening on the initial state serves to strictly reduce excess conservatism. The masking algorithm is applied to behavior cloning for two illustrative control systems: CartPole and Reacher.

引用

页码：2312 / 2319

页数：8

共 50 条

[21] Adversarial Imitation Learning between Agents with Different Numbers of State Dimensions
Yoshida, Taketo
Kuniyoshi, Yasuo
2019 IEEE SECOND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND KNOWLEDGE ENGINEERING (AIKE), 2019, : 179 - 186
[22] Multi-instance embedding learning with deconfounded instance-level prediction
Zhang, Yu-Xuan
Yang, Mei
Zhou, Zhengchun
Min, Fan
INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, 2023, 16 (03) : 391 - 401
[23] Multi-instance embedding learning with deconfounded instance-level prediction
Yu-Xuan Zhang
Mei Yang
Zhengchun Zhou
Fan Min
International Journal of Data Science and Analytics, 2023, 16 : 391 - 401
[24] Adaptive scheduling for Internet of Vehicles using deconfounded graph transfer learning
Liu, Xiuwen
Wang, Shuo
Chen, Yanjiao
COMPUTER NETWORKS, 2025, 256
[25] Contrastive Initial State Buffer for Reinforcement Learning
Messikommer, Nico
Song, Yunlong
Scaramuzza, Davide
2024 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2024, 2024, : 2866 - 2872
[26] Imitation and the effort of learning
Williams, Justin H. G.
BEHAVIORAL AND BRAIN SCIENCES, 2008, 31 (01) : 40 - +
[27] Social Learning and Imitation
不详
BULLETIN OF THE MENNINGER CLINIC, 1943, 7 (02) : 86 - 86
[28] SOCIAL LEARNING AND IMITATION
Roheim, Geza
PSYCHOANALYTIC QUARTERLY, 1943, 12 (02) : 280 - 281
[29] Social Learning and Imitation
Sletto, Raymond F.
ANNALS OF THE AMERICAN ACADEMY OF POLITICAL AND SOCIAL SCIENCE, 1942, 220 : 267 - 268
[30] SOCIAL LEARNING AND IMITATION
WODTKE, KH
BROWN, BR
REVIEW OF EDUCATIONAL RESEARCH, 1967, 37 (05) : 514 - 538

← 1 2 3 4 5 →