Detecting and quantifying causal associations in large nonlinear time series datasets

被引:475
|
作者
Runge, Jakob [1 ,2 ]
Nowack, Peer [2 ,3 ,4 ]
Kretschmer, Marlene [5 ,9 ]
Flaxman, Seth [4 ,6 ]
Sejdinovic, Dino [7 ,8 ]
机构
[1] German Aerosp Ctr, Inst Data Sci, D-07745 Jena, Germany
[2] Imperial Coll, Grantham Inst, London SW7 2AZ, England
[3] Imperial Coll, Dept Phys, Blackett Lab, London SW7 2AZ, England
[4] Imperial Coll, Data Sci Inst, London SW7 2AZ, England
[5] Potsdam Inst Climate Impact Res, D-14473 Potsdam, Germany
[6] Imperial Coll, Dept Math, London SW7 2AZ, England
[7] Alan Turing Inst Data Sci, London NW1 3DB, England
[8] Univ Oxford, Dept Stat, Oxford OX1 3LB, England
[9] Univ Reading, Dept Meteorol, Whiteknights Rd, Reading RG6 6BG, Berks, England
基金
英国工程与自然科学研究理事会;
关键词
GRANGER-CAUSALITY; SOUTHERN OSCILLATION; CONSISTENCY; REGRESSION; DISCOVERY; COUPLINGS; INFERENCE; SELECTION; FEEDBACK; LASSO;
D O I
10.1126/sciadv.aau4996
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Identifying causal relationships and quantifying their strength from observational time series data are key problems in disciplines dealing with complex dynamical systems such as the Earth system or the human body. Data-driven causal inference in such systems is challenging since datasets are often high dimensional and nonlinear with limited sample sizes. Here, we introduce a novel method that flexibly combines linear or nonlinear conditional independence tests with a causal discovery algorithm to estimate causal networks from large-scale time series datasets. We validate the method on time series of well-understood physical mechanisms in the climate system and the human heart and using large-scale synthetic datasets mimicking the typical properties of real-world data. The experiments demonstrate that our method outperforms state-of-the-art techniques in detection power, which opens up entirely new possibilities to discover and quantify causal networks from time series across a range of research fields.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Discovering contemporaneous and lagged causal relations in autocorrelated nonlinear time series datasets
    Runge, Jakob
    CONFERENCE ON UNCERTAINTY IN ARTIFICIAL INTELLIGENCE (UAI 2020), 2020, 124 : 1388 - 1397
  • [2] Clustering of large time series datasets
    Aghabozorgi, Saeed
    Teh, Ying Wah
    INTELLIGENT DATA ANALYSIS, 2014, 18 (05) : 793 - 817
  • [3] Nonlinear Causal Discovery in Time Series
    Wu, Tianhao
    Wu, Xingyu
    Wang, Xin
    Liu, Shikang
    Chen, Huanhuan
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 4575 - 4579
  • [4] Methods for quantifying the causal structure of bivariate time series
    Lungarella, M.
    Ishiguro, K.
    Kuniyoshi, Y.
    Otsu, N.
    INTERNATIONAL JOURNAL OF BIFURCATION AND CHAOS, 2007, 17 (03): : 903 - 921
  • [5] Detecting and Quantifying Geometric Features in Large Series of Cluster Structures
    Joswig, Jan-Ole
    Lorenz, Tommy
    ZEITSCHRIFT FUR PHYSIKALISCHE CHEMIE-INTERNATIONAL JOURNAL OF RESEARCH IN PHYSICAL CHEMISTRY & CHEMICAL PHYSICS, 2016, 230 (5-7): : 1057 - 1066
  • [6] Causal Discovery for time series from multiple datasets with latent contexts
    Guenther, Wiebke
    Ninad, Urmi
    Runge, Jakob
    UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2023, 216 : 766 - 776
  • [7] Detecting dynamical change in nonlinear time series
    Oak Ridge National Laboratory, P.O. Box 2009, Oak Ridge, TN 37831-8066, United States
    Phys Lett Sect A Gen At Solid State Phys, 2-3 (103-114):
  • [8] Detecting dynamical change in nonlinear time series
    Hively, LM
    Gailey, PC
    Protopopescu, VA
    PHYSICS LETTERS A, 1999, 258 (2-3) : 103 - 114
  • [9] FTSPlot: Fast Time Series Visualization for Large Datasets
    Riss, Michael
    PLOS ONE, 2014, 9 (04):
  • [10] Search for additive nonlinear time series causal models
    Chu, Tianjiao
    Glymour, Clark
    Journal of Machine Learning Research, 2008, 9 : 967 - 991