How to compare scores from different depression scales: equating the Patient Health Questionnaire (PHQ) and the ICD-10-Symptom Rating (ISR) using Item Response Theory

被引：35

作者：

Fischer, H. Felix ^{[1
,2
]}

Tritt, Karin ^{[3
,4
]}

Klapp, Burghard F. ^{[2
]}

Fliege, Herbert ^{[2
]}

机构：

[1] Charite, Inst Sozialmed Epidemiol & Gesundheitsokon, D-10117 Berlin, Germany

[2] Charite, Med Klin Schwerpunkt Psychosomat, D-10117 Berlin, Germany

[3] Inst Qualitatsentwicklung Psychotherapie & Psycho, Munich, Germany

[4] Univ Regensburg, Inst Epidemiol & Pravent Med, Regensburg, Germany

来源：

INTERNATIONAL JOURNAL OF METHODS IN PSYCHIATRIC RESEARCH | 2011年 / 20卷 / 04期

关键词：

Item Response Theory; equating; PHQ; ISR; depression self-rating scales; COMPUTER-ADAPTIVE TEST; MONITORING DEPRESSION; VALIDATION; VALIDITY; SCL-90-R;

D O I：

10.1002/mpr.350

中图分类号：

R749 [精神病学];

学科分类号：

100205 ;

摘要：

A wide range of questionnaires for measuring depression are available. Item Response Theory models can help to evaluate the questionnaires exceeding the boundaries of Classical Test Theory and provide an opportunity to equate the questionnaires. In this study after checking for unidimensionality, a General Partial Credit Model was applied to data from two different depression scales [Patient Health Questionnaire (PHQ-9) and ICD-10-Symptom Rating (ISR)] obtained in clinical settings from a consecutive sample, including 4517 observations from a total of 2999 inpatients and outpatients of a psychosomatic clinic. The precision of each questionnaire was compared and the model was used to transform scores based on the assumed underlying latent trait. Both instruments were constructed to measure the same construct and their estimates of depression severity are highly correlated. Our analysis showed that the predicted scores provided by the conversion tables are similar to the observed scores in a validation sample. The PHQ-9 and ISR depression scales measure depression severity across a broad range with similar precision. While the PHQ-9 shows advantages in measuring low or high depression severity, the ISR is more parsimonious and also suitable for clinical purposes. Furthermore, the equation tables derived in this study enhance the comparability of studies using either one of the instruments, but due to substantial statistical spread the comparison of individual scores is imprecise. Copyright (C) 2011 John Wiley & Sons, Ltd.

引用

页码：203 / 214

页数：12

共 38 条

[1]

[Anonymous], 1993, Educational measurement: issues and practice

[2]

[Anonymous], 2003, PSYCHOTHERAPIE PSYCH

[3]

[Anonymous], 2000, FORCE DSM 4 DSM 4 T, DOI 10.1176/dsm10.1176/appi.books.9780890420249.dsm-iv-tr

[4]

[Anonymous], 2007, IRTFIT: A macro for item fit and local dependence tests under IRT models

[5] Using item response theory to calibrate the Headache Impact Test (HIT™) to the metric of traditional headache scales [J].

Bjorner, JB ;

Kosinski, M ;

Ware, JE .

QUALITY OF LIFE RESEARCH, 2003, 12 (08) :981-1002

[6] Calibration of an item pool for assessing the burden of headaches:: An application of item response theory to the Headache Impact Test (HIT™) [J].

Bjorner, JB ;

Kosinski, M ;

Ware, JE .

QUALITY OF LIFE RESEARCH, 2003, 12 (08) :913-933

[7]

Brown T.A., 2015, Confirmatory factor analysis for applied research, V2nd

[8] FACTOR STRUCTURE OF THE SCL-90-R - IS THERE ONE [J].

CYR, JJ ;

MCKENNAFOLEY, JM ;

PEACOCK, E .

JOURNAL OF PERSONALITY ASSESSMENT, 1985, 49 (06) :571-578

[9]

Derogatis L.R., 1999, USE PSYCHOL TESTING, Vsecond, DOI DOI 10.1037/T07502-000

[10]

Dilling H., 2005, Internationale Klassifikation psychischer Storungen. ICD-10 Kapitel V (F) Klinisch-diagnostische Leitlinien

← 1 2 3 4 →