Comparisons of statistical methods for handling attrition in a follow-up visit with complex survey sampling

被引:4
作者
Cai, Jianwen [1 ,3 ]
Zeng, Donglin [1 ]
Li, Haolin [1 ]
Butera, Nicole M. [2 ]
Baldoni, Pedro L. [1 ]
Maitra, Poulami [1 ]
Dong, Li [1 ]
机构
[1] Univ N Carolina, Dept Biostat, Chapel Hill, NC USA
[2] George Washington Univ, Milken Inst Sch Publ Hlth, Biostat Ctr, Dept Biostat & Bioinformat, Rockville, MD USA
[3] Univ N Carolina, Dept Biostat, 3101 McGavran Greenberg Hall, Chapel Hill, NC 27599 USA
基金
美国国家卫生研究院;
关键词
attrition; complex survey sampling design; design-based analysis; missing data; MAXIMUM-LIKELIHOOD-ESTIMATION; MISSING-DATA; MULTIPLE-IMPUTATION; VARIANCE; DESIGN; MODELS;
D O I
10.1002/sim.9692
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Design-based analysis, which accounts for the design features of the study, is commonly used to conduct data analysis in studies with complex survey sampling, such as the Hispanic Community Health Study/Study of Latinos (HCHS/SOL). In this type of longitudinal study, attrition has often been a problem. Although there have been various statistical approaches proposed to handle attrition, such as inverse probability weighting (IPW), non-response cell weighting (NRCW), multiple imputation (MI), and full information maximum likelihood (FIML) approach, there has not been a systematic assessment of these methods to compare their performance in design-based analyses. In this article, we perform extensive simulation studies and compare the performance of different missing data methods in linear and generalized linear population models, and under different missing data mechanism. We find that the design-based analysis is able to produce valid estimation and statistical inference when the missing data are handled appropriately using IPW, NRCW, MI, or FIML approach under missing-completely-at-random or missing-at-random missing mechanism and when the missingness model is correctly specified or over-specified. We also illustrate the use of these methods using data from HCHS/SOL.
引用
收藏
页码:1641 / 1668
页数:28
相关论文
共 29 条
[3]  
Brick J M, 1996, Stat Methods Med Res, V5, P215, DOI 10.1177/096228029600500302
[4]   The impact of nonnormality on full information maximum-likelihood estimation for structural equation models with missing data [J].
Enders, CK .
PSYCHOLOGICAL METHODS, 2001, 6 (04) :352-370
[5]   A Primer on Maximum Likelihood Algorithms Available for Use With Missing Data [J].
Enders, Craig K. .
STRUCTURAL EQUATION MODELING-A MULTIDISCIPLINARY JOURNAL, 2001, 8 (01) :128-141
[6]   Consequences of not interpreting structure coefficients in published CFA research: A reminder [J].
Graham, JM ;
Guthrie, AC ;
Thompson, B .
STRUCTURAL EQUATION MODELING-A MULTIDISCIPLINARY JOURNAL, 2003, 10 (01) :142-153
[7]  
Grau E., 2006, SURVEY RES METHODS S, P3073
[8]  
Groves R.M., 2002, Survey Nonresponse
[9]   ANALYSIS OF INCOMPLETE DATA [J].
HARTLEY, HO ;
HOCKING, RR .
BIOMETRICS, 1971, 27 (04) :783-&
[10]   On the bias of the multiple-imputation variance estimator in survey sampling [J].
Kim, Jae Kwang ;
Brick, J. Michael ;
Fuller, Wayne A. ;
Kalton, Graham .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2006, 68 :509-521