When power analyses based on pilot data are biased: Inaccurate effect size estimators and follow-up bias

被引：233

作者：

Albers, Casper ^{[1
,3
]}

Lakens, Daniel ^{[2
,4
]}

机构：

[1] Univ Groningen, Groningen, Netherlands

[2] Eindhoven Univ, Eindhoven, Netherlands

[3] Univ Groningen, Dept Psychol, Groningen, Netherlands

[4] Eindhoven Univ Technol, Sch Innovat Sci, Eindhoven, Netherlands

来源：

JOURNAL OF EXPERIMENTAL SOCIAL PSYCHOLOGY | 2018年 / 74卷

关键词：

Effect size; Power analysis; Follow-up bias; Eta-squared; Omega-squared; Epsilon-squared; STATISTICAL POWER;

D O I：

10.1016/j.jesp.2017.09.004

中图分类号：

B84 [心理学];

学科分类号：

04 ; 0402 ;

摘要：

When designing a study, the planned sample size is often based on power analyses. One way to choose an effect size for power analyses is by relying on pilot data. A-priori power analyses are only accurate when the effect size estimate is accurate. In this paper we highlight two sources of bias when performing a-priori power analyses for between-subject designs based on pilot data. First, we examine how the choice of the effect size index (eta(2), omega(2) and epsilon(2)) affects the sample size and power of the main study. Based on our observations, we recommend against the use of eta(2) in a-priori power analyses. Second, we examine how the maximum sample size researchers are willing to collect in a main study (e.g. due to time or financial constraints) leads to overestimated effect size estimates in the studies that are performed. Determining the required sample size exclusively based on the effect size estimates from pilot data, and following up on pilot studies only when the sample size estimate for the main study is considered feasible, creates what we term follow-up bias. We explain how follow-up bias leads to underpowered main studies. Our simulations show that designing main studies based on effect sizes estimated from small pilot studies does not yield desired levels of power due to accuracy bias and follow-up bias, even when publication bias is not an issue. We urge researchers to consider alternative approaches to determining the sample size of their studies, and discuss several options.

引用

页码：187 / 195

页数：9

共 41 条

[1] Estimating the reproducibility of psychological science [J].

Aarts, Alexander A. ;

Anderson, Joanna E. ;

Anderson, Christopher J. ;

Attridge, Peter R. ;

Attwood, Angela ;

Axt, Jordan ;

Babel, Molly ;

Bahnik, Stepan ;

Baranski, Erica ;

Barnett-Cowan, Michael ;

Bartmess, Elizabeth ;

Beer, Jennifer ;

Bell, Raoul ;

Bentley, Heather ;

Beyan, Leah ;

Binion, Grace ;

Borsboom, Denny ;

Bosch, Annick ;

Bosco, Frank A. ;

Bowman, Sara D. ;

Brandt, Mark J. ;

Braswell, Erin ;

Brohmer, Hilmar ;

Brown, Benjamin T. ;

Brown, Kristina ;

Bruening, Jovita ;

Calhoun-Sauls, Ann ;

Callahan, Shannon P. ;

Chagnon, Elizabeth ;

Chandler, Jesse ;

Chartier, Christopher R. ;

Cheung, Felix ;

Christopherson, Cody D. ;

Cillessen, Linda ;

Clay, Russ ;

Cleary, Hayley ;

Cloud, Mark D. ;

Cohn, Michael ;

Cohoon, Johanna ;

Columbus, Simon ;

Cordes, Andreas ;

Costantini, Giulio ;

Alvarez, Leslie D. Cramblet ;

Cremata, Ed ;

Crusius, Jan ;

DeCoster, Jamie ;

DeGaetano, Michelle A. ;

Della Penna, Nicolas ;

den Bezemer, Bobby ;

Deserno, Marie K. .

SCIENCE, 2015, 349 (6251)

[2]

Albers C., 2015, COMMENT WHY YOU SHOU

[3] Addressing the "Replication Crisis": Using Original Studies to Design Replication Studies with Appropriate Statistical Power [J].

Anderson, Samantha F. ;

Maxwell, Scott E. .

MULTIVARIATE BEHAVIORAL RESEARCH, 2017, 52 (03) :305-324

[4]

[Anonymous], 2013, SIGNIFICANCE TESTING

[5]

[Anonymous], 1963, STAT PSYCHOLOGISTS

[6] SAMPLING CHARACTERISTICS OF KELLEYS EPSILON-2 AND HAYS OMEGA-2 [J].

CARROLL, RM ;

NORDHOLM, LA .

EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 1975, 35 (03) :541-554

[7]

Cohen J., 1988, STAT POWER ANAL BEHA, DOI [10.4324/9780203771587, DOI 10.4324/9780203771587]

[8] A method of sampling inspection [J].

Dodge, HF ;

Romig, HG .

BELL SYSTEM TECHNICAL JOURNAL, 1929, 8 :613-631

[9] G*Power 3: A flexible statistical power analysis program for the social, behavioral, and biomedical sciences [J].

Faul, Franz ;

Erdfelder, Edgar ;

Lang, Albert-Georg ;

Buchner, Axel .

BEHAVIOR RESEARCH METHODS, 2007, 39 (02) :175-191

[10]

Fisher R. A., 1946, Statistical methods for research workers.

← 1 2 3 4 5 →