CausalFormer: An Interpretable Transformer for Temporal Causal Discovery

被引:0
作者
Kong, Lingbai [1 ]
Li, Wengen [1 ]
Yang, Hanchen [1 ]
Zhang, Yichao [1 ]
Guan, Jihong [1 ]
Zhou, Shuigeng [2 ]
机构
[1] Tongji Univ, Sch Comp Sci & Technol, Shanghai 201804, Peoples R China
[2] Fudan Univ, Sch Comp Sci, Shanghai 200433, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Time series analysis; Cause effect analysis; Transformers; Predictive models; Deep learning; Convolution; Detectors; Mathematical models; Data models; Noise; Time series; temporal causal discovery; interpretability; transformer; TIME-SERIES; EXPLANATIONS;
D O I
10.1109/TKDE.2024.3484461
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Temporal causal discovery is a crucial task aimed at uncovering the causal relations within time series data. The latest temporal causal discovery methods usually train deep learning models on prediction tasks to uncover the causality between time series. They capture causal relations by analyzing the parameters of some components of the trained models, e.g., attention weights and convolution weights. However, this is an incomplete mapping process from the model parameters to the causality and fails to investigate the other components, e.g., fully connected layers and activation functions, that are also significant for causal discovery. To facilitate the utilization of the whole deep learning models in temporal causal discovery, we proposed an interpretable transformer-based causal discovery model termed CausalFormer, which consists of the causality-aware transformer and the decomposition-based causality detector. The causality-aware transformer learns the causal representation of time series data using a prediction task with the designed multi-kernel causal convolution which aggregates each input time series along the temporal dimension under the temporal priority constraint. Then, the decomposition-based causality detector interprets the global structure of the trained causality-aware transformer with the proposed regression relevance propagation to identify potential causal relations and finally construct the causal graph. Experiments on synthetic, simulated, and real datasets demonstrate the state-of-the-art performance of CausalFormer on discovering temporal causality.
引用
收藏
页码:102 / 115
页数:14
相关论文
共 51 条
  • [1] From attribution maps to human-understandable explanations through Concept Relevance Propagation
    Achtibat, Reduan
    Dreyer, Maximilian
    Eisenbraun, Ilona
    Bosse, Sebastian
    Wiegand, Thomas
    Samek, Wojciech
    Lapuschkin, Sebastian
    [J]. NATURE MACHINE INTELLIGENCE, 2023, 5 (09) : 1006 - +
  • [2] Assaad CK, 2022, J ARTIF INTELL RES, V73, P767
  • [3] On Pixel-Wise Explanations for Non-Linear Classifier Decisions by Layer-Wise Relevance Propagation
    Bach, Sebastian
    Binder, Alexander
    Montavon, Gregoire
    Klauschen, Frederick
    Mueller, Klaus-Robert
    Samek, Wojciech
    [J]. PLOS ONE, 2015, 10 (07):
  • [4] Multivariate Granger causality and generalized variance
    Barrett, Adam B.
    Barnett, Lionel
    Seth, Anil K.
    [J]. PHYSICAL REVIEW E, 2010, 81 (04):
  • [5] Large-scale chemical process causal discovery from big data with transformer-based deep learning
    Bi, Xiaotian
    Wu, Deyang
    Xie, Daoxiong
    Ye, Huawei
    Zhao, Jinsong
    [J]. PROCESS SAFETY AND ENVIRONMENTAL PROTECTION, 2023, 173 : 163 - 177
  • [6] Castri L, 2023, PR MACH LEARN RES, V208, P85
  • [7] Transformer Interpretability Beyond Attention Visualization
    Chefer, Hila
    Gur, Shir
    Wolf, Lior
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 782 - 791
  • [8] Cheng Y., 2023, P 11 INT C LEARN REP, P1
  • [9] Causal Inference from Noise
    Climenhaga, Nevin
    DesAutels, Lane
    Ramsey, Grant
    [J]. NOUS, 2021, 55 (01): : 152 - 170
  • [10] A Continual Learning Survey: Defying Forgetting in Classification Tasks
    De Lange, Matthias
    Aljundi, Rahaf
    Masana, Marc
    Parisot, Sarah
    Jia, Xu
    Leonardis, Ales
    Slabaugh, Greg
    Tuytelaars, Tinne
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (07) : 3366 - 3385