Imputation for incomplete high-dimensional multivariate normal data using a common factor model

被引:11
作者
Song, JW
Belin, TR
机构
[1] Univ Texas, MD Anderson Canc Ctr, Dept Biostat & Appl Math, Houston, TX 77030 USA
[2] Univ Calif Los Angeles, Sch Publ Hlth, Ctr Hlth Sci 51 267, Dept Biostat, Los Angeles, CA 90095 USA
关键词
multiple imputation; missing data; factor analysis; ridge prior; suicide attempt; psychological outcomes;
D O I
10.1002/sim.1867
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
It is common in applied research to have large numbers of variables measured on a modest number of cases. Even with low rates of missingness on individual variables, such data sets can have a large number of incomplete cases. Here we present a new method for handling missing continuously scaled items in multivariate data, based on extracting common factors to reduce the number of covariance parameters to be estimated in a multivariate normal model. The technique is compared in several simulation settings to available-case analysis and to a multivariate normal model with a ridge prior. The method is also illustrated on a study with over 100 variables evaluating an emergency room intervention for adolescents who attempted suicide. Copyright (C) 2004 John Wiley Sons, Ltd.
引用
收藏
页码:2827 / 2843
页数:17
相关论文
共 40 条
[1]   VALIDATING REGRESSION PROCEDURES WITH NEW DATA [J].
BERK, KN .
TECHNOMETRICS, 1984, 26 (04) :331-338
[2]  
CONERLY MD, 1990, J STATISTICAL COMPUT, V34, P75
[3]   SIMULATION STUDY OF ALTERNATIVES TO ORDINARY LEAST-SQUARES [J].
DEMPSTER, AP ;
SCHATZOFF, M ;
WERMUTH, N .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1977, 72 (357) :77-93
[4]   MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38
[5]   A STATISTICAL VIEW OF SOME CHEMOMETRICS REGRESSION TOOLS [J].
FRANK, IE ;
FRIEDMAN, JH .
TECHNOMETRICS, 1993, 35 (02) :109-135
[6]   SAMPLING-BASED APPROACHES TO CALCULATING MARGINAL DENSITIES [J].
GELFAND, AE ;
SMITH, AFM .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1990, 85 (410) :398-409
[7]  
Gelman A., 1992, STAT SCI, V7, P457, DOI DOI 10.1214/SS/1177011136
[8]   STOCHASTIC RELAXATION, GIBBS DISTRIBUTIONS, AND THE BAYESIAN RESTORATION OF IMAGES [J].
GEMAN, S ;
GEMAN, D .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1984, 6 (06) :721-741
[9]  
Gilks W., 1995, Markov Chain Monte Carlo in Practice, DOI 10.1201/b14835
[10]   GENERALIZED MEAN SQUARED ERROR PROPERTIES OF REGRESSION ESTIMATORS [J].
GUNST, RF ;
MASON, RL .
COMMUNICATIONS IN STATISTICS PART A-THEORY AND METHODS, 1976, A 5 (15) :1501-1508