Estimating Counterfactual Treatment Outcomes Over Time in Complex Multiagent Scenarios

被引：1

作者：

Fujii, Keisuke ^{[1
,2
]}

Takeuchi, Koh ^{[2
,3
]}

Kuribayashi, Atsushi ^{[1
]}

Takeishi, Naoya ^{[2
,4
]}

Kawahara, Yoshinobu ^{[2
,5
]}

Takeda, Kazuya ^{[1
]}

机构：

[1] Nagoya Univ, Grad Sch Informat, Nagoya, Aichi 4648601, Japan

[2] RIKEN, Ctr Adv Intelligence Project, Wako, Saitama 3510198, Japan

[3] Kyoto Univ, Grad Sch Informat, Kyoto 6068501, Japan

[4] Univ Tokyo, Grad Sch Engn, CH-3960 Sierre, Switzerland

[5] Osaka Univ, Grad Sch Informat Sci & Technol, Suita, Osaka 5650871, Japan

来源：

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS | 2025年 / 36卷 / 02期

基金：

日本科学技术振兴机构; 日本学术振兴会;

关键词：

Multi-agent systems; Behavioral sciences; Autonomous vehicles; Animals; Timing; Sports; Informatics; Autonomous vehicle; causal inference; deep generative model; multiagent modeling; sports; trajectory data; MARGINAL STRUCTURAL MODELS; CAUSAL INFERENCE;

D O I：

10.1109/TNNLS.2024.3361166

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Evaluation of intervention in a multiagent system, for example, when humans should intervene in autonomous driving systems and when a player should pass to teammates for a good shot, is challenging in various engineering and scientific fields. Estimating the individual treatment effect (ITE) using counterfactual long-term prediction is practical to evaluate such interventions. However, most of the conventional frameworks did not consider the time-varying complex structure of multiagent relationships and covariate counterfactual prediction. This may lead to erroneous assessments of ITE and difficulty in interpretation. Here, we propose an interpretable, counterfactual recurrent network in multiagent systems to estimate the effect of the intervention. Our model leverages graph variational recurrent neural networks (GVRNNs) and theory-based computation with domain knowledge for the ITE estimation framework based on long-term prediction of multiagent covariates and outcomes, which can confirm the circumstances under which the intervention is effective. On simulated models of an automated vehicle and biological agents with time-varying confounders, we show that our methods achieved lower estimation errors in counterfactual covariates and the most effective treatment timing than the baselines. Furthermore, using real basketball data, our methods performed realistic counterfactual predictions and evaluated the counterfactual passes in shot scenarios.

引用

页码：2103 / 2117

页数：15

共 82 条

[1] Conservative Policy Construction Using Variational Autoencoders for Logged Data With Missing Values
Abroshan, Mahed
Yip, Kai Hou
Tekin, Cem
van der Schaar, Mihaela
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (09) : 6368 - 6378
[2] Alaa AM, 2018, PR MACH LEARN RES, V80
[3] Baum-Snow N., 2015, Handbook of regional and urban economics, V5, P3
[4] Bica I., 2020, P INT C LEARN REPR
[5] Bica I, 2020, PR MACH LEARN RES, V119
[6] Chen G., 2021, arXiv
[7] Cho KYHY, 2014, Arxiv, DOI arXiv:1409.1259
[8] Chung J, 2015, ADV NEUR IN, V28
[9] Collective memory and spatial sorting in animal groups
Couzin, ID
Krause, J
James, R
Ruxton, GD
Franks, NR
[J]. JOURNAL OF THEORETICAL BIOLOGY, 2002, 218 (01) : 1 - 11
[10] Actions Speak Louder than Goals: Valuing Player Actions in Soccer
Decroos, Tom
Bransen, Lotte
Van Haaren, Jan
Davis, Jesse
[J]. KDD'19: PROCEEDINGS OF THE 25TH ACM SIGKDD INTERNATIONAL CONFERENCCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2019, : 1851 - 1861

← 1 2 3 4 5 6 7 8 9 →