Estimating Counterfactual Treatment Outcomes Over Time in Complex Multiagent Scenarios

被引:1
作者
Fujii, Keisuke [1 ,2 ]
Takeuchi, Koh [2 ,3 ]
Kuribayashi, Atsushi [1 ]
Takeishi, Naoya [2 ,4 ]
Kawahara, Yoshinobu [2 ,5 ]
Takeda, Kazuya [1 ]
机构
[1] Nagoya Univ, Grad Sch Informat, Nagoya, Aichi 4648601, Japan
[2] RIKEN, Ctr Adv Intelligence Project, Wako, Saitama 3510198, Japan
[3] Kyoto Univ, Grad Sch Informat, Kyoto 6068501, Japan
[4] Univ Tokyo, Grad Sch Engn, CH-3960 Sierre, Switzerland
[5] Osaka Univ, Grad Sch Informat Sci & Technol, Suita, Osaka 5650871, Japan
基金
日本科学技术振兴机构; 日本学术振兴会;
关键词
Multi-agent systems; Behavioral sciences; Autonomous vehicles; Animals; Timing; Sports; Informatics; Autonomous vehicle; causal inference; deep generative model; multiagent modeling; sports; trajectory data; MARGINAL STRUCTURAL MODELS; CAUSAL INFERENCE;
D O I
10.1109/TNNLS.2024.3361166
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Evaluation of intervention in a multiagent system, for example, when humans should intervene in autonomous driving systems and when a player should pass to teammates for a good shot, is challenging in various engineering and scientific fields. Estimating the individual treatment effect (ITE) using counterfactual long-term prediction is practical to evaluate such interventions. However, most of the conventional frameworks did not consider the time-varying complex structure of multiagent relationships and covariate counterfactual prediction. This may lead to erroneous assessments of ITE and difficulty in interpretation. Here, we propose an interpretable, counterfactual recurrent network in multiagent systems to estimate the effect of the intervention. Our model leverages graph variational recurrent neural networks (GVRNNs) and theory-based computation with domain knowledge for the ITE estimation framework based on long-term prediction of multiagent covariates and outcomes, which can confirm the circumstances under which the intervention is effective. On simulated models of an automated vehicle and biological agents with time-varying confounders, we show that our methods achieved lower estimation errors in counterfactual covariates and the most effective treatment timing than the baselines. Furthermore, using real basketball data, our methods performed realistic counterfactual predictions and evaluated the counterfactual passes in shot scenarios.
引用
收藏
页码:2103 / 2117
页数:15
相关论文
共 82 条
  • [1] Conservative Policy Construction Using Variational Autoencoders for Logged Data With Missing Values
    Abroshan, Mahed
    Yip, Kai Hou
    Tekin, Cem
    van der Schaar, Mihaela
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (09) : 6368 - 6378
  • [2] Alaa AM, 2018, PR MACH LEARN RES, V80
  • [3] Baum-Snow N., 2015, Handbook of regional and urban economics, V5, P3
  • [4] Bica I., 2020, P INT C LEARN REPR
  • [5] Bica I, 2020, PR MACH LEARN RES, V119
  • [6] Chen G., 2021, arXiv
  • [7] Cho KYHY, 2014, Arxiv, DOI arXiv:1409.1259
  • [8] Chung J, 2015, ADV NEUR IN, V28
  • [9] Collective memory and spatial sorting in animal groups
    Couzin, ID
    Krause, J
    James, R
    Ruxton, GD
    Franks, NR
    [J]. JOURNAL OF THEORETICAL BIOLOGY, 2002, 218 (01) : 1 - 11
  • [10] Actions Speak Louder than Goals: Valuing Player Actions in Soccer
    Decroos, Tom
    Bransen, Lotte
    Van Haaren, Jan
    Davis, Jesse
    [J]. KDD'19: PROCEEDINGS OF THE 25TH ACM SIGKDD INTERNATIONAL CONFERENCCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2019, : 1851 - 1861