Robustness of Algorithms for Causal Structure Learning to Hyperparameter Choice

被引：0

作者：

Machlanski, Damian ^{[1
]}

Samothrakis, Spyridon ^{[2
]}

Clarke, Paul ^{[3
]}

机构：

[1] Univ Essex, Dept Comp Sci & Elect Engn, Colchester, Essex, England

[2] Univ Essex, Inst Analyt & Data Sci, Colchester, Essex, England

[3] Univ Essex, Inst Social & Econ Res, Colchester, Essex, England

来源：

CAUSAL LEARNING AND REASONING, VOL 236 | 2024年 / 236卷

基金：

英国经济与社会研究理事会;

关键词：

Hyperparameters; model selection; causal discovery; structure learning; performance evaluation; misspecification; robustness; SELECTION; MODELS;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Hyperparameters play a critical role in machine learning. Hyperparameter tuning can make the difference between state-of-the-art and poor prediction performance for any algorithm, but it is particularly challenging for structure learning due to its unsupervised nature. As a result, hyperparameter tuning is often neglected in favour of using the default values provided by a particular implementation of an algorithm. While there have been numerous studies on performance evaluation of causal discovery algorithms, how hyperparameters affect individual algorithms, as well as the choice of the best algorithm for a specific problem, has not been studied in depth before. This work addresses this gap by investigating the influence of hyperparameters on causal structure learning tasks. Specifically, we perform an empirical evaluation of hyperparameter selection for some seminal learning algorithms on datasets of varying levels of complexity. We find that, while the choice of algorithm remains crucial to obtaining state-of-the-art performance, hyperparameter selection in ensemble settings strongly influences the choice of algorithm, in that a poor choice of hyperparameters can lead to analysts using algorithms which do not give state-of-the-art performance for their data.

引用

页码：703 / 739

页数：37

共 49 条

[1]

Assaad CK, 2022, J ARTIF INTELL RES, V73, P767

[2] Emergence of scaling in random networks [J].

Barabási, AL ;

Albert, R .

SCIENCE, 1999, 286 (5439) :509-512

[3]

BengioY KeglB, 2011, ADV NEURAL INFORM PR, P2546

[4] Out-of-Sample Tuning for Causal Discovery [J].

Biza, Konstantina ;

Tsamardinos, Ioannis ;

Triantafillou, Sofia .

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (04) :4963-4973

[5] CAM: CAUSAL ADDITIVE MODELS, HIGH-DIMENSIONAL ORDER SEARCH AND PENALIZED REGRESSION [J].

Buehlmann, Peter ;

Peters, Jonas ;

Ernest, Jan .

ANNALS OF STATISTICS, 2014, 42 (06) :2526-2556

[6]

Chickering D. M., 2003, Journal of Machine Learning Research, V3, P507, DOI 10.1162/153244303321897717

[7]

Chobtham K, 2023, Arxiv, DOI arXiv:2306.13932

[8]

Curth A, 2023, Arxiv, DOI arXiv:2302.02923

[9]

Eberhardt F., 2017, INT J DATA SCI ANAL, V3, P81, DOI [DOI 10.1007/S41060-016-0038-6, 10.1007/s41060-016-0038-6]

[10] Parameter tuning for configuring and analyzing evolutionary algorithms [J].

Eiben, A. E. ;

Smit, S. K. .

SWARM AND EVOLUTIONARY COMPUTATION, 2011, 1 (01) :19-31

← 1 2 3 4 5 →