An essay on measurement and factorial invariance

被引:529
作者
Meredith, William
Teresi, Jeanne A.
机构
[1] Columbia Univ, Stroud Ctr, New York, NY 10032 USA
[2] Univ Calif Berkeley, Berkeley, CA 94720 USA
[3] New York State Psychiat Inst & Hosp, Fac Med, Riverdale, NY USA
[4] Hebrew Home Aged, Div Res, Riverdale, NY USA
关键词
measurement invariance; factorial invariance; differential item functioning; item response theory; fairness and equity;
D O I
10.1097/01.mlr.0000245438.73837.89
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background: Analysis of subgroups such as different ethnic, language; or education groups selected from among a parent population is common in health disparities research. One goal of such analyses is to examine measurement equivalence, which includes both qualitative review of the meaning of items as well as quantitative examination of different levels of factorial invariance and differential item functioning. Objectives: The purpose of this essay is to review the definitions and assumptions associated with factorial invariance, placing this formulation in the context of bias, fairness, and equity. The connection between the concept;, of factorial invariance and item bias (differential item functioning) using a variant of item response theory is discussed. The situations under which different forms of invariance (weak, strong, and strict) are required are discussed. Methods: Establishing factorial invariance involves a hierarchy of levels that include tests of weak, strong, and strict invariance. Pattern (metric or weak) factorial invariance implies that the regression slopes are invariant across groups. Pattern invariance requires only invariant factor loadings. Strong factorial invariance implies that the conditional expectation of the response, given the common and specific factors, is invariant across groups. Strong factorial invariance requires that specific factor means (represented as invariant intercepts) also be identical across groups. Strict factorial invariance implies that, in addition, the conditional variance of the response, given the common and specific factors, is invariant across groups. Strict factorial invariance requires that, in addition to equal factor loadings and intercepts, the residual (specific factor plus error variable) variances are equivalent across groups. The concept of measurement invariance that is most closely aligned to that of item response theory considers the latent variable as a common factor measured by manifest variables; the specific factors can be characterized as nuisance variables. Conclusions: Invariance of factor loadings across studied groups is required for valid comparisons of scale score or latent variable means. Strong and strict invariance may be less important in the context of basic research in which group differences in specific factors are indicative of individual differences that are important for scientific exploration. However, for most applications in which the aim is to ensure fairness and equity, strict factorial invariance is required. Health disparities research often focuses on self-reported clinical outcomes such as quality of life that are not observed directly. Latent variable models such as factor analyses are central to establishing valid assessment of such outcomes.
引用
收藏
页码:S69 / S77
页数:9
相关论文
共 26 条
[1]  
[Anonymous], 1988, TEST VALIDITY
[2]   TESTING FOR THE EQUIVALENCE OF FACTOR COVARIANCE AND MEAN STRUCTURES - THE ISSUE OF PARTIAL MEASUREMENT INVARIANCE [J].
BYRNE, BM ;
SHAVELSON, RJ ;
MUTHEN, B .
PSYCHOLOGICAL BULLETIN, 1989, 105 (03) :456-466
[3]   FACTOR-ANALYSIS OF DICHOTOMIZED VARIABLES [J].
CHRISTOFFERSSON, A .
PSYCHOMETRIKA, 1975, 40 (01) :5-32
[4]   Differential item functioning analysis with ordinal logistic regression techniques - DIFdetect and difwithpar [J].
Crane, Paul K. ;
Gibbons, Laura E. ;
Jolley, Lance ;
van Belle, Gerald .
MEDICAL CARE, 2006, 44 (11) :S115-S123
[5]   Differential item functioning on the Mini-Mental State Examination - An application of the Mantel-Haenszel and standardization procedures [J].
Dorans, Neil J. ;
Kulick, Edward .
MEDICAL CARE, 2006, 44 (11) :S107-S114
[6]   Identification of differential item functioning using item response theory and the likelihood-based model comparison approach - Application to the Mini-Mental State Examination [J].
Edelen, Maria Orlando ;
Thissen, David ;
Teresi, Jeanne A. ;
Kleinman, Marjorie ;
Ocepek-Welikson, Katja .
MEDICAL CARE, 2006, 44 (11) :S134-S142
[7]   Do self-report instruments allow meaningful comparisons across diverse population groups? Testing measurement invariance using the confirmatory factor analysis framework [J].
Gregorich, Steven E. .
MEDICAL CARE, 2006, 44 (11) :S78-S94
[8]  
GREGORY KL, 1991, THESIS U CALIFORNIA
[9]   THE DETERMINACY OF FACTOR SCORE MATRICES WITH IMPLICATIONS FOR FIVE OTHER BASIC PROBLEMS OF COMMON-FACTOR THEORY [J].
Guttman, Louis .
BRITISH JOURNAL OF STATISTICAL PSYCHOLOGY, 1955, 8 :65-81
[10]   Identification of measurement differences between English and Spanish language versions of the Mini-Mental State Examination - Detecting differential item functioning using MIMIC modeling [J].
Jones, Richard N. .
MEDICAL CARE, 2006, 44 (11) :S124-S133