Mixture models of missing data

被引:5
作者
Rudas, T [1 ]
机构
[1] Eotvos Lorand Univ, Fac Social Sci, Dept Stat, H-1117 Budapest, Hungary
基金
匈牙利科学研究基金会;
关键词
missing data; mixture index of fit; model diagnostics; no-fit rate; no-observation rate;
D O I
10.1007/s11135-004-5945-2
中图分类号
C [社会科学总论];
学科分类号
03 ; 0303 ;
摘要
This paper proposes a general framework for the analysis of survey data with missing observations. The approach presented here treats missing data as an unavoidable feature of any survey of the human population and aims at incorporating the unobserved part of the data into the analysis rather than trying to avoid it or make up for it. To handle coverage error and unit non-response, the true distribution is modeled as a mixture of an observable and of an unobservable component. Generally, for the unobserved component, its relative size (the no-observation rate) and its distribution are not known. It is assumed that the goal of the analysis is to assess the fit of a statistical model, and for this purpose the mixture index of fit is used. The mixture index of fit does not postulate that the statistical model of interest is able to account for the entire population rather, that it may only describe a fraction of it. This leads to another mixture representation of the true distribution, with one component from the statistical model of interest and another unrestricted one. Inference with respect to the fit of the model, with missing data taken into account, is obtained by equating these two mixtures and asking, for different no-observation rates, what is the largest fraction of the population where the statistical model may hold. A statistical model is deemed relevant for the population, if it may account for a large enough fraction of the population, assuming the true (if known) or a sufficiently small or a realistic no-observation rate.
引用
收藏
页码:19 / 36
页数:18
相关论文
共 17 条
[1]  
[Anonymous], 1967, AM OCCUPATIONAL STRU
[2]  
Clogg C.C., 1998, VISUALIZATION CATEGO, P425, DOI [10.1016/B978-012299045-8/50033-4, DOI 10.1016/B978-012299045-8/50033-4]
[3]   A new index of structure for the analysis of models for mobility tables and other cross-classifications [J].
Clogg, CC ;
Rudas, T ;
Xi, LW .
SOCIOLOGICAL METHODOLOGY 1995, VOL 25, 1995, 25 :197-222
[4]  
DAYTON M, 2003, IN PRESS BRIT J MATH
[5]  
Formann AK, 2000, STAT MED, V19, P1881, DOI 10.1002/1097-0258(20000730)19:14<1881::AID-SIM495>3.0.CO
[6]  
2-I
[7]   Latent class model diagnostics - a review and some proposals [J].
Formann, AK .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2003, 41 (3-4) :549-559
[8]  
Knoke D., 1980, Log-linear models
[9]  
KNOTT M, 2001, 63 LOND SCH EC DEP S
[10]  
Little R.J., 1987, Statistical Analysis With Missing Data