Detecting and quantifying causal associations in large nonlinear time series datasets

被引:475
|
作者
Runge, Jakob [1 ,2 ]
Nowack, Peer [2 ,3 ,4 ]
Kretschmer, Marlene [5 ,9 ]
Flaxman, Seth [4 ,6 ]
Sejdinovic, Dino [7 ,8 ]
机构
[1] German Aerosp Ctr, Inst Data Sci, D-07745 Jena, Germany
[2] Imperial Coll, Grantham Inst, London SW7 2AZ, England
[3] Imperial Coll, Dept Phys, Blackett Lab, London SW7 2AZ, England
[4] Imperial Coll, Data Sci Inst, London SW7 2AZ, England
[5] Potsdam Inst Climate Impact Res, D-14473 Potsdam, Germany
[6] Imperial Coll, Dept Math, London SW7 2AZ, England
[7] Alan Turing Inst Data Sci, London NW1 3DB, England
[8] Univ Oxford, Dept Stat, Oxford OX1 3LB, England
[9] Univ Reading, Dept Meteorol, Whiteknights Rd, Reading RG6 6BG, Berks, England
基金
英国工程与自然科学研究理事会;
关键词
GRANGER-CAUSALITY; SOUTHERN OSCILLATION; CONSISTENCY; REGRESSION; DISCOVERY; COUPLINGS; INFERENCE; SELECTION; FEEDBACK; LASSO;
D O I
10.1126/sciadv.aau4996
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Identifying causal relationships and quantifying their strength from observational time series data are key problems in disciplines dealing with complex dynamical systems such as the Earth system or the human body. Data-driven causal inference in such systems is challenging since datasets are often high dimensional and nonlinear with limited sample sizes. Here, we introduce a novel method that flexibly combines linear or nonlinear conditional independence tests with a causal discovery algorithm to estimate causal networks from large-scale time series datasets. We validate the method on time series of well-understood physical mechanisms in the climate system and the human heart and using large-scale synthetic datasets mimicking the typical properties of real-world data. The experiments demonstrate that our method outperforms state-of-the-art techniques in detection power, which opens up entirely new possibilities to discover and quantify causal networks from time series across a range of research fields.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] Detecting Associations in Large Dataset on MapReduce
    Dai, Dong
    Li, Xi
    Wang, Chao
    Zhang, Junneng
    Zhou, Xuehai
    2013 12TH IEEE INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (TRUSTCOM 2013), 2013, : 1788 - 1794
  • [42] Detecting Causality from Nonlinear Dynamics with Short-term Time Series
    Huanfei Ma
    Kazuyuki Aihara
    Luonan Chen
    Scientific Reports, 4
  • [43] Detecting Causality from Nonlinear Dynamics with Short-term Time Series
    Ma, Huanfei
    Aihara, Kazuyuki
    Chen, Luonan
    SCIENTIFIC REPORTS, 2014, 4
  • [44] Detecting and Quantifying the Nonlinear and Time-Variant Effects in FRF Measurements Using Periodic Excitations
    Pintelon, Rik
    Louarroudi, Ebrahim
    Lataire, John
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2013, 62 (12) : 3361 - 3373
  • [45] Mining Latent Sources of Causal Time Series Using Nonlinear State Space Modeling
    Chen, Wei-Shing
    Yu, Fong-Jung
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2011, PT I, 2011, 6591 : 137 - 148
  • [46] Unraveling complex temporal associations in cellular systems across multiple time-series microarray datasets
    Li, Wenyuan
    Xu, Min
    Zhou, Xianghong Jasmine
    JOURNAL OF BIOMEDICAL INFORMATICS, 2010, 43 (04) : 550 - 559
  • [47] Quantifying Organization of Atmospheric Turbulent Eddy Motion Using Nonlinear Time Series Analysis
    Karen H. Wesson
    Gabriel G. Katul
    Mario Siqueira
    Boundary-Layer Meteorology, 2003, 106 : 507 - 525
  • [48] Quantifying organization of atmospheric turbulent eddy motion using nonlinear time series analysis
    Wesson, KH
    Katul, GG
    Siqueira, M
    BOUNDARY-LAYER METEOROLOGY, 2003, 106 (03) : 507 - 525
  • [49] Nonlinear manipulation and analysis of large DNA datasets
    Cui, Meiying
    Zhao, Xueping
    Reddavide, Francesco, V
    Gaillez, Michelle Patino
    Heiden, Stephan
    Mannocci, Luca
    Thompson, Michael
    Zhang, Yixin
    NUCLEIC ACIDS RESEARCH, 2022, 50 (15) : 8974 - 8985
  • [50] On Detecting the Dependence of Time Series
    Dokuchaev, Nikolai
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2012, 41 (05) : 934 - 942