The importance of expert knowledge in big data and machine learning

被引:0
作者
Jens Ulrik Hansen
Paula Quinon
机构
[1] Roskilde University,Department of People and Technology
[2] Warsaw University of Technology,Faculty of Administration and Social Sciences
来源
Synthese | / 201卷
关键词
Big data; Machine learning; Expert knowledge; Agnostic sciences; Inductive method; Role of theory; Paradigm shift;
D O I
暂无
中图分类号
学科分类号
摘要
According to popular belief, big data and machine learning provide a wholly novel approach to science that has the potential to revolutionise scientific progress and will ultimately lead to the ‘end of theory’. Proponents of this view argue that advanced algorithms are able to mine vast amounts of data relating to a given problem without any prior knowledge and that we do not need to concern ourselves with causality, as correlation is sufficient for handling complex issues. Consequently, the human contribution to scientific progress is deemed to be non-essential and replaceable. We, however, following the position most commonly represented in the philosophy of science, argue that the need for human expertise remains. Based on an analysis of big data and machine learning methods in two case studies—skin cancer detection and protein folding—we show that expert knowledge is essential and inherent in the application of these methods. Drawing on this analysis, we establish a classification of the different kinds of expert knowledge that are involved in the application of big data and machine learning in scientific contexts. We address the ramifications of a human-driven expert knowledge approach to big data and machine learning for scientific practice and the discussion about the role of theory. Finally, we show that the ways in which big data and machine learning both influence and are influenced by scientific methodology involve continuous conceptual shifts rather than a rigid paradigm change.
引用
收藏
相关论文
共 47 条
  • [1] Ankeny R(2011)Introduction: Philosophy of science in practice European Journal for Philosophy of Science 1 303-307
  • [2] Chang H(2007)What is scientific progress? Noûs 41 64-89
  • [3] Boumans M(2017)The deluge of spurious correlations in big data Foundations of Science 22 595-612
  • [4] Boon M(2017)Dermatologist-level classification of skin cancer with deep neural networks Nature 542 115-118
  • [5] Bird A(2021)Highly accurate protein structure prediction with AlphaFold Nature 596 583-589
  • [6] Calude CS(2014)Big data, new epistemologies and paradigm shifts Big Data & Society 16 1-20
  • [7] Longo G(2011)Agnostic science: Towards a philosophy of data analysis Foundations of Science 61 485-490
  • [8] Esteva A(2014)Is big data enough? A reflection on the changing role of mathematics in applications Notices of the AMS 81 96-104
  • [9] Kuprel B(2020)Big data and prediction: Four case studies Studies in History and Philosophy of Science Part A 82 905-916
  • [10] Novoa RA(2015)Aspects of theory-ladenness in data-intensive science Philosophy of Science 29 137-171