The Crit coefficient in Mokken scale analysis: a simulation study and an application in quality-of-life research

被引:5
作者
Crisan, Daniela R. [1 ]
Tendeiro, Jorge N. [1 ,2 ]
Meijer, Rob R. [1 ]
机构
[1] Univ Groningen, Fac Behav & Social Sci, Dept Psychometr & Stat, Groningen, Netherlands
[2] Hiroshima Univ, Educ & Res Ctr Artificial Intelligence & Data Inn, Higashihiroshima, Japan
关键词
Mokken scaling; MSA; Crit; Monotonicity; IIO; Item fit; ITEM RESPONSE THEORY; MANIFEST MONOTONICITY;
D O I
10.1007/s11136-021-02924-z
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Purpose In Mokken scaling, the Crit index was proposed and is sometimes used as evidence (or lack thereof) of violations of some common model assumptions. The main goal of our study was twofold: To make the formulation of the Crit index explicit and accessible, and to investigate its distribution under various measurement conditions. Methods We conducted two simulation studies in the context of dichotomously scored item responses. We manipulated the type of assumption violation, the proportion of violating items, sample size, and quality. False positive rates and power to detect assumption violations were our main outcome variables. Furthermore, we used the Crit coefficient in a Mokken scale analysis to a set of responses to the General Health Questionnaire (GHQ-12), a self-administered questionnaire for assessing current mental health. Results We found that the false positive rates of Crit were close to the nominal rate in most conditions, and that power to detect misfit depended on the sample size, type of violation, and number of assumption-violating items. Overall, in small samples Crit lacked the power to detect misfit, and in larger samples power differed considerably depending on the type of violation and proportion of misfitting items. Furthermore, we also found in our empirical example that even in large samples the Crit index may fail to detect assumption violations. Discussion Even in large samples, the Crit coefficient showed limited usefulness for detecting moderate and severe violations of monotonicity. Our findings are relevant to researchers and practitioners who use Mokken scaling for scale and questionnaire construction and revision.
引用
收藏
页码:49 / 59
页数:11
相关论文
共 34 条
[1]  
Cavalini P.M., 1992, Its an ill wind that brings no good. Studies on odour annoyance and the dispersion of odorant concentrations from industries
[2]  
Goldberg D., 1988, A user's guide to the General Health Questionnaire
[3]  
Junker BW, 2000, APPL PSYCH MEAS, V24, P63
[4]   A two-step, test-guided Mokken scale analysis, for nonclustered and clustered data [J].
Koopman, Letty ;
Zijlstra, Bonne J. H. ;
van der Ark, L. Andries .
QUALITY OF LIFE RESEARCH, 2022, 31 (01) :25-36
[5]   STANDARD ERRORS AND CONFIDENCE INTERVALS FOR SCALABILITY COEFFICIENTS IN MOKKEN SCALE ANALYSIS USING MARGINAL MODELS [J].
Kuijpers, Renske E. ;
Van der Ark, L. Andries ;
Croon, Marcel A. .
SOCIOLOGICAL METHODOLOGY 2013, VOL 43, 2013, 43 :42-69
[7]  
Meijer R.R., 2018, The Wiley handbook of psychometric testing: A multidisciplinary reference on survey, scale and test development, P413, DOI [10.1002/9781118489772.ch15, DOI 10.1002/9781118489772.CH15]
[8]   Detection and validation of unscalable item score patterns using item response theory: An illustration with Harter's self-perception profile for children [J].
Meijer, Rob R. ;
Egberink, Iris J. L. ;
Emons, Wilco H. M. ;
Sljtsma, Klaas .
JOURNAL OF PERSONALITY ASSESSMENT, 2008, 90 (03) :227-238
[9]   Investigating Invariant Item Ordering in Personality and Clinical Scales: Some Empirical Findings and a Discussion [J].
Meijer, Rob R. ;
Egberink, Iris J. L. .
EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 2012, 72 (04) :589-607
[10]   An Evaluation of the Brief Symptom Inventory-18 Using Item Response Theory: Which Items Are Most Strongly Related to Psychological Distress? [J].
Meijer, Rob R. ;
de Vries, Rivka M. ;
van Bruggen, Vincent .
PSYCHOLOGICAL ASSESSMENT, 2011, 23 (01) :193-202