Robustness of Algorithms for Causal Structure Learning to Hyperparameter Choice

被引:0
作者
Machlanski, Damian [1 ]
Samothrakis, Spyridon [2 ]
Clarke, Paul [3 ]
机构
[1] Univ Essex, Dept Comp Sci & Elect Engn, Colchester, Essex, England
[2] Univ Essex, Inst Analyt & Data Sci, Colchester, Essex, England
[3] Univ Essex, Inst Social & Econ Res, Colchester, Essex, England
来源
CAUSAL LEARNING AND REASONING, VOL 236 | 2024年 / 236卷
基金
英国经济与社会研究理事会;
关键词
Hyperparameters; model selection; causal discovery; structure learning; performance evaluation; misspecification; robustness; SELECTION; MODELS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Hyperparameters play a critical role in machine learning. Hyperparameter tuning can make the difference between state-of-the-art and poor prediction performance for any algorithm, but it is particularly challenging for structure learning due to its unsupervised nature. As a result, hyperparameter tuning is often neglected in favour of using the default values provided by a particular implementation of an algorithm. While there have been numerous studies on performance evaluation of causal discovery algorithms, how hyperparameters affect individual algorithms, as well as the choice of the best algorithm for a specific problem, has not been studied in depth before. This work addresses this gap by investigating the influence of hyperparameters on causal structure learning tasks. Specifically, we perform an empirical evaluation of hyperparameter selection for some seminal learning algorithms on datasets of varying levels of complexity. We find that, while the choice of algorithm remains crucial to obtaining state-of-the-art performance, hyperparameter selection in ensemble settings strongly influences the choice of algorithm, in that a poor choice of hyperparameters can lead to analysts using algorithms which do not give state-of-the-art performance for their data.
引用
收藏
页码:703 / 739
页数:37
相关论文
共 49 条
  • [1] Assaad CK, 2022, J ARTIF INTELL RES, V73, P767
  • [2] Emergence of scaling in random networks
    Barabási, AL
    Albert, R
    [J]. SCIENCE, 1999, 286 (5439) : 509 - 512
  • [3] Bergstra James, 2011, ADV NEURAL INFORM PR, V24, DOI DOI 10.5555/2986459.2986743
  • [4] Out-of-Sample Tuning for Causal Discovery
    Biza, Konstantina
    Tsamardinos, Ioannis
    Triantafillou, Sofia
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (04) : 4963 - 4973
  • [5] CAM: CAUSAL ADDITIVE MODELS, HIGH-DIMENSIONAL ORDER SEARCH AND PENALIZED REGRESSION
    Buehlmann, Peter
    Peters, Jonas
    Ernest, Jan
    [J]. ANNALS OF STATISTICS, 2014, 42 (06) : 2526 - 2556
  • [6] Chickering D. M., 2003, Journal of Machine Learning Research, V3, P507, DOI 10.1162/153244303321897717
  • [7] Chobtham K, 2023, Arxiv, DOI arXiv:2306.13932
  • [8] Curth A, 2023, Arxiv, DOI arXiv:2302.02923
  • [9] Eberhardt F., 2017, International Journal of Data Science and Analytics, V3, P81, DOI [10.1007/s41060-016-0038-6, DOI 10.1007/S41060-016-0038-6]
  • [10] Parameter tuning for configuring and analyzing evolutionary algorithms
    Eiben, A. E.
    Smit, S. K.
    [J]. SWARM AND EVOLUTIONARY COMPUTATION, 2011, 1 (01) : 19 - 31