Connectivity-Contrastive Learning: Combining Causal Discovery and Representation Learning for Multimodal Data

被引：0

作者：

Morioka, Hiroshi ^{[1
]}

Hyvarinen, Aapo ^{[2
,3
]}

机构：

[1] RIKEN AIP, Tokyo, Japan

[2] Univ Helsinki, Helsinki, Finland

[3] Univ Paris Saclay, INRIA, Paris, France

来源：

INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 206 | 2023年 / 206卷

基金：

芬兰科学院;

关键词：

MODELS; ALGORITHMS; NETWORKS;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Causal discovery methods typically extract causal relations between multiple nodes (variables) based on univariate observations of each node. However, one frequently encounters situations where each node is multivariate, i.e. has multiple observational modalities. Furthermore, the observed modalities may be generated through an unknown mixing process, so that some original latent variables are entangled inside the nodes. In such a multimodal case, the existing frameworks cannot be applied. To analyze such data, we propose a new causal representation learning framework called connectivity-contrastive learning (CCL). CCL disentangles the observational mixing and extracts a set of mutually independent latent components, each having a separate causal structure between the nodes. The actual learning proceeds by a novel self-supervised learning method in which the pretext task is to predict the label of a pair of nodes from the observations of the node pairs. We present theorems which show that CCL can indeed identify both the latent components and the multimodal causal structure under weak technical assumptions, up to some indeterminacy. Finally, we experimentally show its superior causal discovery performance compared to state-of-the-art baselines, in particular demonstrating robustness against latent confounders.

引用

页数：28

共 77 条

[1]

Andersson SA, 1997, ANN STAT, V25, P505

[2] DiSMEC - Distributed Sparse Machines for Extreme Multi-label Classification [J].

Babbar, Rohit ;

Schoelkopf, Bernhard .

WSDM'17: PROCEEDINGS OF THE TENTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2017, :721-729

[3] Uncovering the structure of clinical EEG signals with self-supervised learning [J].

Banville, Hubert ;

Chehab, Omar ;

Hyvarinen, Aapo ;

Engemann, Denis-Alexander ;

Gramfort, Alexandre .

JOURNAL OF NEURAL ENGINEERING, 2021, 18 (04)

[4]

Bollen K. A, 1989, Structural equations with latent variables

[5] CAM: CAUSAL ADDITIVE MODELS, HIGH-DIMENSIONAL ORDER SEARCH AND PENALIZED REGRESSION [J].

Buehlmann, Peter ;

Peters, Jonas ;

Ernest, Jan .

ANNALS OF STATISTICS, 2014, 42 (06) :2526-2556

[6]

Chen T, 2020, PMLR, P1597, DOI DOI 10.5555/3524938.3525087

[7]

Chen Xinshi, 2021, ADV NEURAL INFORM PR, V34, P11083

[8]

Chickering D. M., 2003, Journal of Machine Learning Research, V3, P507, DOI 10.1162/153244303321897717

[9]

Choi J., 2020, ADV NEURAL INFORM PR, V33, P5887

[10]

Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171

← 1 2 3 4 5 6 7 8 →