Causal scientific explanations from machine learning

被引:0
作者
Stefan Buijsman
机构
[1] TU Delft,
来源
Synthese | / 202卷
关键词
Scientific explanation; Machine learning; Causal inference; Artificial intelligence;
D O I
暂无
中图分类号
学科分类号
摘要
Machine learning is used more and more in scientific contexts, from the recent breakthroughs with AlphaFold2 in protein fold prediction to the use of ML in parametrization for large climate/astronomy models. Yet it is unclear whether we can obtain scientific explanations from such models. I argue that when machine learning is used to conduct causal inference we can give a new positive answer to this question. However, these ML models are purpose-built models and there are technical results showing that standard machine learning models cannot be used for the same type of causal inference. Instead, there is a pathway to causal explanations from predictive ML models through new explainability techniques; specifically, new methods to extract structural equation models from such ML models. The extracted models are likely to suffer from issues though: they will often fail to account for confounders and colliders, as well as deliver simply incorrect causal graphs due to ML models tendency to violate physical laws such as the conservation of energy. In this case, extracted graphs are a starting point for new explanations, but predictive accuracy is no guarantee for good explanations.
引用
收藏
相关论文
共 59 条
[1]  
Batterman RW(1992)Explanatory instability Nous 26 325-348
[2]  
Buijsman S(2022)Defining explanation and explanatory depth in XAI Minds and Machines 32 563-584
[3]  
Cao Y(2022)Machine learning-aided causal inference for unraveling chemical dispersant and salinity effects on crude oil biodegradation Bioresource Technology 345 261-265
[4]  
Kang Q(2017)Double/debiased/Neyman machine learning of treatment effects American Economic Review 107 5-32
[5]  
Zhang B(2017)Ontological distinctions between hardware and software Applied Ontology 12 9574-9586
[6]  
Zhu Z(2021)Causal abstractions of neural networks Advances in Neural Information Processing Systems 34 524-1897
[7]  
Dong G(2019)Review of causal discovery methods based on graphical models Frontiers in Genetics 10 1877-589
[8]  
Cai Q(2021)Understanding climate change with statistical downscaling and machine learning Synthese 199 583-1020
[9]  
Lee K(2021)Highly accurate protein structure prediction with alphafold Nature 596 1008-56
[10]  
Chen B(2021)Can machines learn how clouds work? The epistemic implications of machine learning methods in climate science Philosophy of Science 88 46-3156