Learning Latent Representations to Influence Multi-Agent Interaction

被引：0

作者：

Xie, Annie ^{[1
]}

Losey, Dylan P. ^{[2
]}

Tolsma, Ryan ^{[1
]}

Finn, Chelsea ^{[1
]}

Sadigh, Dorsa ^{[1
]}

机构：

[1] Stanford Univ, Stanford, CA USA

[2] Virginia Tech, Blacksburg, VA USA

来源：

CONFERENCE ON ROBOT LEARNING, VOL 155 | 2020年 / 155卷

关键词：

multi-agent systems; human-robot interaction; reinforcement learning; ROBOT;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Seamlessly interacting with humans or robots is hard because these agents are non-stationary. They update their policy in response to the ego agent's behavior, and the ego agent must anticipate these changes to co-adapt. Inspired by humans, we recognize that robots do not need to explicitly model every low-level action another agent will make; instead, we can capture the latent strategy of other agents through high-level representations. We propose a reinforcement learning-based framework for learning latent representations of an agent's policy, where the ego agent identifies the relationship between its behavior and the other agent's future strategy. The ego agent then leverages these latent dynamics to influence the other agent, purposely guiding them towards policies suitable for co-adaptation. Across several simulated domains and a real-world air hockey game, our approach outperforms the alternatives and learns to influence the other agent(1).

引用

页码：575 / 588

页数：14

共 53 条

[1]

Albrecht S. V., 2015, WORKSH 29 AAAI C ART

[2]

Bai HY, 2015, IEEE INT CONF ROBOT, P454, DOI 10.1109/ICRA.2015.7139219

[3]

Baker C., 2011, P ANN M COGN SCI SOC, V33, P2469, DOI DOI 10.1093/BRAIN/AWW287

[4]

Bandyopadhyay T., 2013, Algorithmic Foundations of Robotics X: Proceedings of the Tenth Workshop on the Algorithmic Foundations of Robotics, P475

[5]

Basu C, 2019, IEEE INT C INT ROBOT, P120, DOI [10.1109/IROS40897.2019.8968522, 10.1109/iros40897.2019.8968522]

[6]

Bentivegna DC, 2002, 2002 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-3, PROCEEDINGS, P2449, DOI 10.1109/IRDS.2002.1041635

[7] Implicitly Assisting Humans to Choose Good Grasps in Robot to Human Handovers [J].

Bestick, Aaron ;

Bajcsy, Ruzena ;

Dragan, Anca D. .

2016 INTERNATIONAL SYMPOSIUM ON EXPERIMENTAL ROBOTICS, 2017, 1 :341-354

[8] Vision-based control of an air hockey playing robot. [J].

Bishop, BE ;

Spong, MW .

IEEE CONTROL SYSTEMS MAGAZINE, 1999, 19 (03) :23-32

[9]

Brockman G, 2016, Arxiv, DOI arXiv:1606.01540

[10]

Rabinowitz NC, 2018, Arxiv, DOI [arXiv:1802.07740, DOI 10.48550/ARXIV.1802.07740]

← 1 2 3 4 5 6 →