When power analyses based on pilot data are biased: Inaccurate effect size estimators and follow-up bias

被引:223
作者
Albers, Casper [1 ,3 ]
Lakens, Daniel [2 ,4 ]
机构
[1] Univ Groningen, Groningen, Netherlands
[2] Eindhoven Univ, Eindhoven, Netherlands
[3] Univ Groningen, Dept Psychol, Groningen, Netherlands
[4] Eindhoven Univ Technol, Sch Innovat Sci, Eindhoven, Netherlands
关键词
Effect size; Power analysis; Follow-up bias; Eta-squared; Omega-squared; Epsilon-squared; STATISTICAL POWER;
D O I
10.1016/j.jesp.2017.09.004
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
When designing a study, the planned sample size is often based on power analyses. One way to choose an effect size for power analyses is by relying on pilot data. A-priori power analyses are only accurate when the effect size estimate is accurate. In this paper we highlight two sources of bias when performing a-priori power analyses for between-subject designs based on pilot data. First, we examine how the choice of the effect size index (eta(2), omega(2) and epsilon(2)) affects the sample size and power of the main study. Based on our observations, we recommend against the use of eta(2) in a-priori power analyses. Second, we examine how the maximum sample size researchers are willing to collect in a main study (e.g. due to time or financial constraints) leads to overestimated effect size estimates in the studies that are performed. Determining the required sample size exclusively based on the effect size estimates from pilot data, and following up on pilot studies only when the sample size estimate for the main study is considered feasible, creates what we term follow-up bias. We explain how follow-up bias leads to underpowered main studies. Our simulations show that designing main studies based on effect sizes estimated from small pilot studies does not yield desired levels of power due to accuracy bias and follow-up bias, even when publication bias is not an issue. We urge researchers to consider alternative approaches to determining the sample size of their studies, and discuss several options.
引用
收藏
页码:187 / 195
页数:9
相关论文
共 41 条
  • [1] Estimating the reproducibility of psychological science
    Aarts, Alexander A.
    Anderson, Joanna E.
    Anderson, Christopher J.
    Attridge, Peter R.
    Attwood, Angela
    Axt, Jordan
    Babel, Molly
    Bahnik, Stepan
    Baranski, Erica
    Barnett-Cowan, Michael
    Bartmess, Elizabeth
    Beer, Jennifer
    Bell, Raoul
    Bentley, Heather
    Beyan, Leah
    Binion, Grace
    Borsboom, Denny
    Bosch, Annick
    Bosco, Frank A.
    Bowman, Sara D.
    Brandt, Mark J.
    Braswell, Erin
    Brohmer, Hilmar
    Brown, Benjamin T.
    Brown, Kristina
    Bruening, Jovita
    Calhoun-Sauls, Ann
    Callahan, Shannon P.
    Chagnon, Elizabeth
    Chandler, Jesse
    Chartier, Christopher R.
    Cheung, Felix
    Christopherson, Cody D.
    Cillessen, Linda
    Clay, Russ
    Cleary, Hayley
    Cloud, Mark D.
    Cohn, Michael
    Cohoon, Johanna
    Columbus, Simon
    Cordes, Andreas
    Costantini, Giulio
    Alvarez, Leslie D. Cramblet
    Cremata, Ed
    Crusius, Jan
    DeCoster, Jamie
    DeGaetano, Michelle A.
    Della Penna, Nicolas
    den Bezemer, Bobby
    Deserno, Marie K.
    [J]. SCIENCE, 2015, 349 (6251)
  • [2] Albers C., 2015, COMMENT WHY YOU SHOU
  • [3] Addressing the "Replication Crisis": Using Original Studies to Design Replication Studies with Appropriate Statistical Power
    Anderson, Samantha F.
    Maxwell, Scott E.
    [J]. MULTIVARIATE BEHAVIORAL RESEARCH, 2017, 52 (03) : 305 - 324
  • [4] [Anonymous], 2013, SIGNIFICANCE TESTING
  • [5] [Anonymous], 1963, STAT PSYCHOLOGISTS
  • [6] SAMPLING CHARACTERISTICS OF KELLEYS EPSILON-2 AND HAYS OMEGA-2
    CARROLL, RM
    NORDHOLM, LA
    [J]. EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 1975, 35 (03) : 541 - 554
  • [7] Cohen J., 1988, STAT POWER ANAL BEHA, DOI [10.4324/9780203771587, DOI 10.4324/9780203771587]
  • [8] A method of sampling inspection
    Dodge, HF
    Romig, HG
    [J]. BELL SYSTEM TECHNICAL JOURNAL, 1929, 8 : 613 - 631
  • [9] G*Power 3: A flexible statistical power analysis program for the social, behavioral, and biomedical sciences
    Faul, Franz
    Erdfelder, Edgar
    Lang, Albert-Georg
    Buchner, Axel
    [J]. BEHAVIOR RESEARCH METHODS, 2007, 39 (02) : 175 - 191
  • [10] Fisher R. A., 1946, Statistical methods for research workers.