Attrition in longitudinal studies: How to deal with missing data

被引：377

作者：

Twisk, J ^{[1
]}

de Vente, W ^{[1
]}

机构：

[1] Free Univ Amsterdam, Inst Res Extramural Med, NL-1081 BT Amsterdam, Netherlands

来源：

JOURNAL OF CLINICAL EPIDEMIOLOGY | 2002年 / 55卷 / 04期

关键词：

attrition; longitudinal studies; missing data;

D O I：

10.1016/S0895-4356(01)00476-0

中图分类号：

R19 [保健组织与事业（卫生事业管理）];

学科分类号：

摘要：

The purpose of this paper was to illustrate the influence of missing data on the results of longitudinal statistical analyses [i.e., MANOVA for repeated measurements and Generalised Estimating Equations (GEE)] and to illustrate the influence of using different imputation methods to replace missing data. Besides a complete dataset, four incomplete datasets were considered: two datasets with 10% missing data and two datasets with 25% missing data. In both situations missingness was considered independent and dependent on observed data. Imputation methods were divided into cross-sectional methods (i.e., mean of series, hot deck, and cross-sectional regression) and longitudinal methods (i.e., last value carried forward, longitudinal interpolation, and longitudinal regression). Besides these, also the multiple imputation method was applied and discussed. The analyses were performed on a particular (observational) longitudinal dataset, with particular missing data patterns and imputation methods. The results of this illustration shows that when MANOVA for repeated measurements is used, imputation methods are highly recommendable (because MANOVA as implemented in the software used, uses listwise deletion of cases with a missing value). Applying GEE analysis, imputation methods were not necessary. When imputation methods were used, longitudinal imputation methods were often preferable ab9ove cross-sectional imputation methods, in a way that the point estimates and standard errors were closer to the estimates derived from the complete dataset. Furthermore, this study showed that the theoretically more valid multiple imputation method did not lead to different point estimates than the more simple (longitudinal) imputation methods, However, the estimated standard errors appeared to be theoretically more adequate, because they reflect the uncertainty in estimation caused by missing values. (C) 2002 Elsevier Science Inc. All rights reserved.

引用

页码：329 / 337

页数：9

共 18 条

[1] TESTING FOR RANDOM DROPOUTS IN REPEATED MEASUREMENT DATA [J].

DIGGLE, PJ .

BIOMETRICS, 1989, 45 (04) :1255-1258

[2]

GEBSKI V, 1992, SPIDA USERS MANUAL 6

[3]

KEMPER HCG, 1995, SK SPORTS SCI MONOGR, V6

[4]

Kleinbaum DG., 1988, STUDENTS PARTIAL SOL, Vvol 601

[5] REGRESSION-ANALYSIS FOR CORRELATED DATA [J].

LIANG, KY ;

ZEGER, SL .

ANNUAL REVIEW OF PUBLIC HEALTH, 1993, 14 :43-68

[6] MODELING THE DROP-OUT MECHANISM IN REPEATED-MEASURES STUDIES [J].

LITTLE, RJA .

JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1995, 90 (431) :1112-1121

[7]

Little RJA, 1987, Statistical Analysis With Missing Data

[8]

Magnusson D., 1990, DATA QUALITY LONGITU

[9] Multiple imputation after 18+ years [J].

Rubin, DB .

JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1996, 91 (434) :473-489

[10]

Rubin DonaldB., 1987, MULTIPLE IMPUTATIONS

← 1 2 →