Big Data and Its Epistemology

被引:85
作者
Fricke, Martin [1 ]
机构
[1] Univ Arizona, SIRLS, Tucson, AZ 85719 USA
关键词
HYPOTHESIS; INDUCTION; KNOWLEDGE; CRITIQUE;
D O I
10.1002/asi.23212
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The article considers whether Big Data, in the form of data-driven science, will enable the discovery, or appraisal, of universal scientific theories, instrumentalist tools, or inductive inferences. It points out, initially, that such aspirations are similar to the now-discredited inductivist approach to science. On the positive side, Big Data may permit larger sample sizes, cheaper and more extensive testing of theories, and the continuous assessment of theories. On the negative side, data-driven science encourages passive data collection, as opposed to experimentation and testing, and hornswoggling (unsound statistical fiddling). The roles of theory and data in inductive algorithms, statistical modeling, and scientific discoveries are analyzed, and it is argued that theory is needed at every turn. Data-driven science is a chimera.
引用
收藏
页码:651 / 661
页数:11
相关论文
共 66 条
[1]   Hypothesis, induction and background knowledge. Data do not speak for themselves. Replies to Donald A Gillies, Lawrence A Kelley and Michael Scott [J].
Allen, JF .
BIOESSAYS, 2001, 23 (09) :861-862
[2]  
Allen JF, 2001, BIOESSAYS, V23, P104, DOI 10.1002/1521-1878(200101)23:1<104::AID-BIES1013>3.0.CO
[3]  
2-2
[4]  
[Anonymous], 2008, ALL WE WANT ARE FACT
[5]  
[Anonymous], 2009, Microsoft Research
[6]  
[Anonymous], 2011, Extracting Value from Chaos
[7]  
[Anonymous], 2008, WIRED
[8]  
[Anonymous], 2022, SAGE J
[9]   Pisces did not have increased heart failure: data-driven comparisons of binary proportions between levels of a categorical variable can result in incorrect statistical significance levels [J].
Austin, Peter C. ;
Goldwasser, Meredith A. .
JOURNAL OF CLINICAL EPIDEMIOLOGY, 2008, 61 (03) :295-300
[10]   What you see may not be what you get: A brief, nontechnical introduction to overfitting in regression-type models [J].
Babyak, MA .
PSYCHOSOMATIC MEDICINE, 2004, 66 (03) :411-421