CausalFormer: An Interpretable Transformer for Temporal Causal Discovery

被引：0

作者：

Kong, Lingbai ^{[1
]}

Li, Wengen ^{[1
]}

Yang, Hanchen ^{[1
]}

Zhang, Yichao ^{[1
]}

Guan, Jihong ^{[1
]}

Zhou, Shuigeng ^{[2
]}

机构：

[1] Tongji Univ, Sch Comp Sci & Technol, Shanghai 201804, Peoples R China

[2] Fudan Univ, Sch Comp Sci, Shanghai 200433, Peoples R China

来源：

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING | 2025年 / 37卷 / 01期

基金：

中国国家自然科学基金; 国家重点研发计划;

关键词：

Time series analysis; Cause effect analysis; Transformers; Predictive models; Deep learning; Convolution; Detectors; Mathematical models; Data models; Noise; Time series; temporal causal discovery; interpretability; transformer; TIME-SERIES; EXPLANATIONS;

D O I：

10.1109/TKDE.2024.3484461

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Temporal causal discovery is a crucial task aimed at uncovering the causal relations within time series data. The latest temporal causal discovery methods usually train deep learning models on prediction tasks to uncover the causality between time series. They capture causal relations by analyzing the parameters of some components of the trained models, e.g., attention weights and convolution weights. However, this is an incomplete mapping process from the model parameters to the causality and fails to investigate the other components, e.g., fully connected layers and activation functions, that are also significant for causal discovery. To facilitate the utilization of the whole deep learning models in temporal causal discovery, we proposed an interpretable transformer-based causal discovery model termed CausalFormer, which consists of the causality-aware transformer and the decomposition-based causality detector. The causality-aware transformer learns the causal representation of time series data using a prediction task with the designed multi-kernel causal convolution which aggregates each input time series along the temporal dimension under the temporal priority constraint. Then, the decomposition-based causality detector interprets the global structure of the trained causality-aware transformer with the proposed regression relevance propagation to identify potential causal relations and finally construct the causal graph. Experiments on synthetic, simulated, and real datasets demonstrate the state-of-the-art performance of CausalFormer on discovering temporal causality.

引用

页码：102 / 115

页数：14

共 51 条

[1] From attribution maps to human-understandable explanations through Concept Relevance Propagation
Achtibat, Reduan
Dreyer, Maximilian
Eisenbraun, Ilona
Bosse, Sebastian
Wiegand, Thomas
Samek, Wojciech
Lapuschkin, Sebastian
[J]. NATURE MACHINE INTELLIGENCE, 2023, 5 (09) : 1006 - +
[2] Assaad CK, 2022, J ARTIF INTELL RES, V73, P767
[3] On Pixel-Wise Explanations for Non-Linear Classifier Decisions by Layer-Wise Relevance Propagation
Bach, Sebastian
Binder, Alexander
Montavon, Gregoire
Klauschen, Frederick
Mueller, Klaus-Robert
Samek, Wojciech
[J]. PLOS ONE, 2015, 10 (07):
[4] Multivariate Granger causality and generalized variance
Barrett, Adam B.
Barnett, Lionel
Seth, Anil K.
[J]. PHYSICAL REVIEW E, 2010, 81 (04):
[5] Large-scale chemical process causal discovery from big data with transformer-based deep learning
Bi, Xiaotian
Wu, Deyang
Xie, Daoxiong
Ye, Huawei
Zhao, Jinsong
[J]. PROCESS SAFETY AND ENVIRONMENTAL PROTECTION, 2023, 173 : 163 - 177
[6] Castri L, 2023, PR MACH LEARN RES, V208, P85
[7] Transformer Interpretability Beyond Attention Visualization
Chefer, Hila
Gur, Shir
Wolf, Lior
[J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 782 - 791
[8] Cheng Y., 2023, P 11 INT C LEARN REP, P1
[9] Causal Inference from Noise
Climenhaga, Nevin
DesAutels, Lane
Ramsey, Grant
[J]. NOUS, 2021, 55 (01): : 152 - 170
[10] A Continual Learning Survey: Defying Forgetting in Classification Tasks
De Lange, Matthias
Aljundi, Rahaf
Masana, Marc
Parisot, Sarah
Jia, Xu
Leonardis, Ales
Slabaugh, Greg
Tuytelaars, Tinne
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (07) : 3366 - 3385

← 1 2 3 4 5 6 →