Bootstrapping goodness-of-fit measures in categorical data analysis

被引:155
作者
Langeheine, R
Pannekoek, J
VandePol, F
机构
关键词
D O I
10.1177/0049124196024004004
中图分类号
O1 [数学]; C [社会科学总论];
学科分类号
03 ; 0303 ; 0701 ; 070101 ;
摘要
When sparse data save to be fitted to a log-linear or latent class model, one cannot use the theoretical chi-square distribution to evaluate model fit, because with sparse data the observed cross-table has too many cells in relation to the number of observations to use a distribution that only holds asymptotically. The choice of a theoretical distribution is also difficult when model-expected frequencies 0 or when model probabilities are estimated 0 or 1. The authors propose to solve these problems by estimating the distribution of a fit measure, using bootstrap methods. An algorithm is presented for estimating this distribution by drawing bootstrap samples from the model-expected proportions, the so-called nonnaive bootstrap method. For the first time the method is applied to empirical data of varying sparseness, from five different data sets. Results show that the asymptotic chi-square distribution is not at all valid for sparse data.
引用
收藏
页码:492 / 516
页数:25
相关论文
共 43 条
[11]   LATENT STRUCTURE-ANALYSIS OF A SET OF MULTIDIMENSIONAL CONTINGENCY-TABLES [J].
CLOGG, CC ;
GOODMAN, LA .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1984, 79 (388) :762-771
[12]  
CLOGG CC, 1981, FACTOR ANAL MEASUREM, P214
[13]  
Clogg CC, 1995, HDB STAT MODELING SO, P311, DOI DOI 10.1007/978-1-4899-1292-3_6
[14]   GOODNESS-OF-FIT TESTING FOR LATENT CLASS MODELS [J].
COLLINS, LM ;
FIDLER, PL ;
WUGALTER, SE ;
LONG, JD .
MULTIVARIATE BEHAVIORAL RESEARCH, 1993, 28 (03) :375-389
[16]   THE COMPUTER-GENERATION OF MULTINOMIAL RANDOM VARIATES [J].
DAVIS, CS .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 1993, 16 (02) :205-217
[17]  
Efron B., 1980, JACKKNIFE BOOTSTRAP
[18]   LONGITUDINAL ANALYSIS OF CATEGORICAL EPIDEMIOLOGICAL DATA - A STUDY OF THREE-MILE-ISLAND [J].
FIENBERG, SE ;
BROMET, EJ ;
FOLLMANN, D ;
LAMBERT, D ;
MAY, SM .
ENVIRONMENTAL HEALTH PERSPECTIVES, 1985, 63 (NOV) :241-248
[19]   ANALYSIS OF SYSTEMS OF QUALITATIVE VARIABLES WHEN SOME OF VARIABLES ARE UNOBSERVABLE .1. MODIFIED LATENT STRUCTURE APPROACH [J].
GOODMAN, LA .
AMERICAN JOURNAL OF SOCIOLOGY, 1974, 79 (05) :1179-1259
[20]  
Haberman S. J., 1979, ANAL QUALITATIVE DAT, V2