Simulation-Based Design Optimization for Statistical Power: Utilizing Machine Learning

被引：3

作者：

Zimmer, Felix ^{[1
]}

Debelak, Rudolf ^{[1
]}

机构：

[1] Univ Zurich, Dept Psychol, Div Psychol Methods Evaluat & Stat, Binzmuehlestr 14,Box 27, CH-8050 Zurich, Switzerland

来源：

PSYCHOLOGICAL METHODS | 2023年

基金：

瑞士国家科学基金会;

关键词：

simulation; sample size; power analysis; machine learning; SAMPLE-SIZE; EFFICIENT; PACKAGE; TRIALS;

D O I：

10.1037/met0000611

中图分类号：

B84 [心理学];

学科分类号：

04 ; 0402 ;

摘要：

The planning of adequately powered research designs increasingly goes beyond determining a suitable sample size. More challenging scenarios demand simultaneous tuning of multiple design parameter dimensions and can only be addressed using Monte Carlo simulation if no analytical approach is available. In addition, cost considerations, for example, in terms of monetary costs, are a relevant target for optimization. In this context, optimal design parameters can imply a desired level of power at minimum cost or maximum power at a cost threshold. We introduce a surrogate modeling framework based on machine learning predictions to solve these optimization tasks. In a simulation study, we demonstrate the efficiency for a wide range of hypothesis testing scenarios with single- and multidimensional design parameters, including t tests, analysis of variance, item response theory models, multilevel models, and multiple imputations. Our framework provides an algorithmic solution for optimizing study designs when no analytic power analysis is available, handling multiple design dimensions and cost considerations. Our implementation is publicly available in the R package mlpwr.

引用

页数：25

共 53 条

[1] When power analyses based on pilot data are biased: Inaccurate effect size estimators and follow-up bias [J].