Advanced statistics: Missing data in clinical research - Part 1: An introduction and conceptual framework

被引:179
作者
Haukoos, Jason S.
Newgard, Craig D.
机构
[1] Univ Colorado, Hlth Sci Ctr, Denver Hlth Med Ctr, Dept Emergency Med, Denver, CO 80262 USA
[2] Univ Colorado, Hlth Sci Ctr, Dept Prevent Med, Denver, CO USA
[3] Oregon Hlth & Sci Univ, Ctr Policy Res Emergency Med, Dept Emergency Med, Portland, OR 97201 USA
关键词
missing data; bias; clinical research; statistical analysis; complete-case analysis; imputation; single imputation; mean imputation; regression imputation; hot deck imputation; last observation carried forward; worst case analysis;
D O I
10.1197/j.aem.2006.11.037
中图分类号
R4 [临床医学];
学科分类号
1002 ; 100602 ;
摘要
Missing data are commonly encountered in clinical research. Unfortunately, they are often neglected or not properly handled during analytic procedures, and this may substantially bias the results of the study, reduce study power, and lead to invalid conclusions. In this two-part series, the authors will introduce key concepts regarding missing data in clinical research, provide a conceptual framework for how to approach missing data in this setting, describe typical mechanisms and patterns of censoring of data and their relationships to specific methods of handling incomplete data, and describe in detail several simple and more complex methods of handling such data. In part 1, the authors will describe relatively simple approaches to handling missing data, including complete-case analysis, available-case analysis, and several forms of single imputation, including mean imputation, regression imputation, hot and cold deck imputation, last observation carried forward, and worst case analysis. In part 2, the authors will describe in detail multiple imputation, a more sophisticated and valid method for handling missing data.
引用
收藏
页码:662 / 668
页数:7
相关论文
共 37 条
[1]   The methods for handling missing data in clinical trials influence sample size requirements [J].
Auleley, GR ;
Giraudeau, B ;
Baron, G ;
Maillefert, JF ;
Dougados, M ;
Ravaud, P .
JOURNAL OF CLINICAL EPIDEMIOLOGY, 2004, 57 (05) :447-453
[2]   Direct likelihood analysis versus simple forms of imputation for missing data in randomized clinical trials [J].
Beunckens, C ;
Molenberghs, G ;
Kenward, MG .
CLINICAL TRIALS, 2005, 2 (05) :379-386
[3]  
Cochran WG., 1963, SAMPLING TECHNIQUES
[4]   A comparison of inclusive and restrictive strategies in modern missing data procedures [J].
Collins, LM ;
Schafer, JL ;
Kam, CM .
PSYCHOLOGICAL METHODS, 2001, 6 (04) :330-351
[5]   A COMPARISON OF ANALYTIC METHODS FOR NONRANDOM MISSINGNESS OF OUTCOME DATA [J].
CRAWFORD, SL ;
TENNSTEDT, SL ;
MCKINLAY, JB .
JOURNAL OF CLINICAL EPIDEMIOLOGY, 1995, 48 (02) :209-219
[6]   Review: A gentle introduction to imputation of missing values [J].
Donders, A. Rogier T. ;
van der Heijden, Geert J. M. G. ;
Stijnen, Theo ;
Moons, Karel G. M. .
JOURNAL OF CLINICAL EPIDEMIOLOGY, 2006, 59 (10) :1087-1091
[7]   Uncertainty method improved on best-worst case analysis in a binary meta-analysis [J].
Gamble, C ;
Hollis, S .
JOURNAL OF CLINICAL EPIDEMIOLOGY, 2005, 58 (06) :579-588
[8]   A critical look at methods for handling missing covariates in epidemiologic regression analyses [J].
Greenland, S ;
Finkle, WD .
AMERICAN JOURNAL OF EPIDEMIOLOGY, 1995, 142 (12) :1255-1264
[9]   Imputing cross-sectional missing data: comparison of common techniques [J].
Hawthorne, G ;
Elliott, P .
AUSTRALIAN AND NEW ZEALAND JOURNAL OF PSYCHIATRY, 2005, 39 (07) :583-590
[10]   The mouse ortholog of the neuronal ceroid lipofuscinosis CLN5 gene encodes a soluble lysosomal glycoprotein expressed in the developing brain [J].
Holmberg, V ;
Jalanko, A ;
Isosomppi, J ;
Fabritius, AL ;
Peltonen, L ;
Kopra, O .
NEUROBIOLOGY OF DISEASE, 2004, 16 (01) :29-40