CAUSAL DISCOVERY IN HEAVY-TAILED MODELS

被引:26
作者
Gnecco, Nicola [1 ]
Meinshausen, Nicolai [2 ]
Peters, Jonas [3 ]
Engelke, Sebastian [1 ]
机构
[1] Univ Geneva, Res Ctr Stat, Geneva, Switzerland
[2] Swiss Fed Inst Technol, Dept Math, Zurich, Switzerland
[3] Univ Copenhagen, Dept Math Sci, Copenhagen, Denmark
基金
瑞士国家科学基金会;
关键词
Causality; extreme value theory; heavy-tailed distributions; nonparametric estimation; INFERENCE;
D O I
10.1214/20-AOS2021
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Causal questions are omnipresent in many scientific problems. While much progress has been made in the analysis of causal relationships between random variables, these methods are not well suited if the causal mechanisms only manifest themselves in extremes. This work aims to connect the two fields of causal inference and extreme value theory. We define the causal tail coefficient that captures asymmetries in the extremal dependence of two random variables. In the population case, the causal tail coefficient is shown to reveal the causal structure if the distribution follows a linear structural causal model. This holds even in the presence of latent common causes that have the same tail index as the observed variables. Based on a consistent estimator of the causal tail coefficient, we propose a computationally highly efficient algorithm that estimates the causal structure. We prove that our method consistently recovers the causal order and we compare it to other well-established and nonextremal approaches in causal discovery on synthetic and real data. The code is available as an open-access R package.
引用
收藏
页码:1755 / 1778
页数:24
相关论文
共 52 条
[1]  
[Anonymous], 1990, REGIONAL ESTIMATION
[2]  
[Anonymous], 1996, Graphical Models. Oxford Statistical Science Series
[3]   EXTREMES ON RIVER NETWORKS [J].
Asadi, Peiman ;
Davison, Anthony C. ;
Engelke, Sebastian .
ANNALS OF APPLIED STATISTICS, 2015, 9 (04) :2023-2050
[4]   Regularly varying multivariate time series [J].
Basrak, Bojan ;
Segers, Johan .
STOCHASTIC PROCESSES AND THEIR APPLICATIONS, 2009, 119 (04) :1055-1080
[5]  
Bollen KA., 1989, STRUCTURAL EQUATIONS, DOI DOI 10.1002/9781118619179
[6]   CAM: CAUSAL ADDITIVE MODELS, HIGH-DIMENSIONAL ORDER SEARCH AND PENALIZED REGRESSION [J].
Buehlmann, Peter ;
Peters, Jonas ;
Ernest, Jan .
ANNALS OF STATISTICS, 2014, 42 (06) :2526-2556
[7]   Estimation of the marginal expected shortfall: the mean when a related variable is extreme [J].
Cai, Juan-Juan ;
Einmahl, John H. J. ;
de Haan, Laurens ;
Zhou, Chen .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2015, 77 (02) :417-442
[8]  
Claassen T., 2013, P 20 9 C UNC ART INT, P172
[9]  
Coles S., 2001, INTRO STAT MODELING, V208, DOI [10.1007/978-1-4471-3675-0, DOI 10.1007/978-1-4471-3675-0]
[10]   INDEPENDENT COMPONENT ANALYSIS, A NEW CONCEPT [J].
COMON, P .
SIGNAL PROCESSING, 1994, 36 (03) :287-314