Aiming for a representative sample: Simulating random versus purposive strategies for hospital selection

被引:25
作者
van Hoeven, Loan R. [1 ,2 ]
Janssen, Mart P. [1 ,2 ]
Roes, Kit C. B. [1 ]
Koffijberg, Hendrik [1 ,3 ]
机构
[1] Univ Med Ctr Utrecht, Julius Ctr Hlth Sci & Primary Care, NL-3584 CG Utrecht, Netherlands
[2] Sanquin Res, Transfus Technol Assessment Dept, Sanquin Blood Supply, NL-3584 CG Utrecht, Netherlands
[3] Univ Twente, MIRA Inst Biomed Technol & Tech Med, Dept Hlth Technol & Serv Res, NL-7522 NB Enschede, Netherlands
关键词
Sampling strategy; Hospital selection; Representativeness; Random vs. purposive sampling; Maximum variation; Simulation; Model-based inference; DESIGN; HEALTH;
D O I
10.1186/s12874-015-0089-8
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background: A ubiquitous issue in research is that of selecting a representative sample from the study population. While random sampling strategies are the gold standard, in practice, random sampling of participants is not always feasible nor necessarily the optimal choice. In our case, a selection must be made of 12 hospitals (out of 89 Dutch hospitals in total). With this selection of 12 hospitals, it should be possible to estimate blood use in the remaining hospitals as well. In this paper, we evaluate both random and purposive strategies for the case of estimating blood use in Dutch hospitals. Methods: Available population-wide data on hospital blood use and number of hospital beds are used to simulate five sampling strategies: (1) select only the largest hospitals, (2) select the largest and the smallest hospitals ('maximum variation'), (3) select hospitals randomly, (4) select hospitals from as many different geographic regions as possible, (5) select hospitals from only two regions. Simulations of each strategy result in different selections of hospitals, that are each used to estimate blood use in the remaining hospitals. The estimates are compared to the actual population values; the subsequent prediction errors are used to indicate the quality of the sampling strategy. Results: The strategy leading to the lowest prediction error in the case study was maximum variation sampling, followed by random, regional variation and two-region sampling, with sampling the largest hospitals resulting in the worst performance. Maximum variation sampling led to a hospital level prediction error of 15 %, whereas random sampling led to a prediction error of 19 % (95 % CI 17 %-26 %). While lowering the sample size reduced the differences between maximum variation and the random strategies, increasing sample size to n = 18 did not change the ranking of the strategies and led to only slightly better predictions. Conclusions: The optimal strategy for estimating blood use was maximum variation sampling. When proxy data are available, it is possible to evaluate random and purposive sampling strategies using simulations before the start of the study. The results enable researchers to make a more educated choice of an appropriate sampling strategy.
引用
收藏
页数:9
相关论文
共 23 条
[1]   Inference on Polychotomous Responses in Finite Populations [J].
Adhya, Sumanta ;
Banerjee, Tathagata ;
Chattopadhyay, Gaurangadeb .
SCANDINAVIAN JOURNAL OF STATISTICS, 2011, 38 (04) :788-800
[2]   Sampling in ecology and evolution - bridging the gap between theory and practice [J].
Albert, Cecile H. ;
Yoccoz, Nigel G. ;
Edwards, Thomas C., Jr. ;
Graham, Catherine H. ;
Zimmermann, Niklaus E. ;
Thuiller, Wilfried .
ECOGRAPHY, 2010, 33 (06) :1028-1037
[3]   A Class of Unbiased Estimators of the Average Treatment Effect in Randomized Experiments [J].
Aronow, Peter M. ;
Middleton, Joel A. .
JOURNAL OF CAUSAL INFERENCE, 2013, 1 (01) :135-154
[4]  
Banerjee Amitav, 2010, Ind Psychiatry J, V19, P60, DOI 10.4103/0972-6748.77642
[5]  
Buelens B, 2012, 201218 STAT NETH
[6]   Effects of sample survey design on the accuracy of classification tree models in species distribution models [J].
Edwards, Thomas C., Jr. ;
Cutler, D. Richard ;
Zimmermann, Niklaus E. ;
Geiser, Linda ;
Moisen, Gretchen G. .
ECOLOGICAL MODELLING, 2006, 199 (02) :132-141
[7]   Which is the optimal sampling strategy for habitat suitability modelling [J].
Hirzel, A ;
Guisan, A .
ECOLOGICAL MODELLING, 2002, 157 (2-3) :331-341
[8]   Improving Population Health Measurement in National Household Surveys: A Simulation Study of the Sample Design of the Comprehensive Survey of Living Conditions of the People on Health and Welfare in Japan [J].
Ikeda, Nayu ;
Shibuya, Kenji ;
Hashimoto, Hideki .
JOURNAL OF EPIDEMIOLOGY, 2011, 21 (05) :385-390
[9]   A Pragmatic Framework for Single-site and Multisite Data Quality Assessment in Electronic Health Record-based Clinical Research [J].
Kahn, Michael G. ;
Raebel, Marsha A. ;
Glanz, Jason M. ;
Riedlinger, Karen ;
Steiner, John F. .
MEDICAL CARE, 2012, 50 (07) :S21-S29
[10]  
Kish L., 2002, SURV METHODOL, V1, P31