Retrospective selection bias (or the benefit of hindsight)

被引:18
|
作者
Mulargia, F [1 ]
机构
[1] Univ Bologna, Dipartmento Fis, Settore Geofis, I-40127 Bologna, Italy
关键词
earthquake physics; selection bias;
D O I
10.1046/j.1365-246x.2001.01458.x
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
The complexity of geophysical systems makes modelling them a formidable task, and in many cases research studies are still in the phenomenological stage. In earthquake physics, long timescales and the lack of any natural laboratory restrict research to retrospective analysis of data. Such 'fishing expedition' approaches lead to optimal selection of data, albeit not always consciously. This introduces significant biases, which are capable of falsely representing simple statistical fluctuations as significant anomalies requiring fundamental explanations. This paper identifies three different strategies for discriminating real issues from artefacts generated retrospectively. The first attempts to identify ab initio each optimal choice and account for it. Unfortunately, a satisfactory solution can only be achieved in particular cases. The second strategy acknowledges this difficulty as well as the unavoidable existence of bias, and classifies all 'anomalous' observations as artefacts unless their retrospective probability of occurrence is exceedingly low (for instance, beyond six standard deviations). However, such a strategy is also likely to reject some scientifically important anomalies. The third strategy relies on two separate steps with learning and validation performed on effectively independent sets of data. This approach appears to be preferable in the case of small samples, such as are frequently encountered in geophysics, but the requirement for forward validation implies long waiting times before credible conclusions can be reached. A practical application to pattern recognition, which is the prototype of retrospective 'fishing expeditions', is presented, illustrating that valid conclusions are hard to find.
引用
收藏
页码:489 / 496
页数:8
相关论文
共 50 条
  • [21] A Potential Outcomes Approach to Selection Bias
    Kenah, Eben
    EPIDEMIOLOGY, 2023, 34 (06) : 865 - 872
  • [22] Selection bias in semen studies due to self-selection of volunteers
    Muller, A
    De La Rochebrochard, E
    Labbé-Declèves, C
    Jouannet, P
    Bujan, L
    Mieusset, R
    Le Lannou, D
    Guerin, JF
    Benchaib, M
    Slama, R
    Spira, A
    HUMAN REPRODUCTION, 2004, 19 (12) : 2838 - 2844
  • [23] Probability Transform of Kriging Estimates and Its Effects on Selection Bias and Conditional Bias
    Bourgault, Gilles
    MATHEMATICAL GEOSCIENCES, 2022, 54 (02) : 345 - 362
  • [24] Sensitivity analysis of selection bias: a graphical display by bias-correction index
    Chung, Ping-Chen
    Lin, I-Feng
    PEERJ, 2023, 11
  • [25] Probability Transform of Kriging Estimates and Its Effects on Selection Bias and Conditional Bias
    Gilles Bourgault
    Mathematical Geosciences, 2022, 54 : 345 - 362
  • [26] Selection bias in a gravity model of agrifood trade
    Ul Haq, Zahoor
    Meilke, Karl
    Cranfield, John
    EUROPEAN REVIEW OF AGRICULTURAL ECONOMICS, 2013, 40 (02) : 331 - 360
  • [27] Small business survival and sample selection bias
    Cader, Hanas A.
    Leatherman, John C.
    SMALL BUSINESS ECONOMICS, 2011, 37 (02) : 155 - 165
  • [28] Selection bias in college admissions test scores
    Clark, Melissa
    Rothstein, Jesse
    Schanzenbach, Diane Whitmore
    ECONOMICS OF EDUCATION REVIEW, 2009, 28 (03) : 295 - 307
  • [29] Avoiding selection bias in metabolomics studies: a tutorial
    S. C. Boone
    S. le Cessie
    K. Willems van Dijk
    R. de Mutsert
    D. O. Mook-Kanamori
    Metabolomics, 2019, 15
  • [30] Learning to Rank with Selection Bias in Personal Search
    Wang, Xuanhui
    Bendersky, Michael
    Metzler, Donald
    Najork, Marc
    SIGIR'16: PROCEEDINGS OF THE 39TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2016, : 115 - 124