Out-of-Sample Tuning for Causal Discovery

被引：7

作者：

Biza, Konstantina ^{[1
]}

Tsamardinos, Ioannis ^{[1
]}

Triantafillou, Sofia ^{[2
]}

机构：

[1] Univ Crete, Dept Comp Sci, Iraklion 70013, Greece

[2] Univ Crete, Dept Math & Appl Math, Iraklion 70013, Greece

来源：

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS | 2024年 / 35卷 / 04期

基金：

欧洲研究理事会;

关键词：

Tuning; Markov processes; Data models; Stars; Task analysis; Predictive models; Estimation; Causal-based simulation; causal discovery; out-of-sample; tuning; DIRECTED ACYCLIC GRAPHS; MODEL; NETWORKS; LATENT;

D O I：

10.1109/TNNLS.2022.3185842

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Causal discovery is continually being enriched with new algorithms for learning causal graphical probabilistic models. Each one of them requires a set of hyperparameters, creating a great number of combinations. Given that the true graph is unknown and the learning task is unsupervised, the challenge to a practitioner is how to tune these choices. We propose out-of-sample causal tuning (OCT) that aims to select an optimal combination. The method treats a causal model as a set of predictive models and uses out-of-sample protocols for supervised methods. This approach can handle general settings like latent confounders and nonlinear relationships. The method uses an information-theoretic approach to be able to generalize to mixed data types and a penalty for dense graphs to penalize for complexity. To evaluate OCT, we introduce a causal-based simulation method to create datasets that mimic the properties of real-world problems. We evaluate OCT against two other tuning approaches, based on stability and in-sample fitting. We show that OCT performs well in many experimental settings and it is an effective tuning method for causal discovery.

引用

页码：4963 / 4973

页数：11

共 44 条

[1]

Andrews B, 2019, PR MACH LEARN RES, V104, P4

[2] Scoring Bayesian networks of mixed variables [J].