How to compare scores from different depression scales: equating the Patient Health Questionnaire (PHQ) and the ICD-10-Symptom Rating (ISR) using Item Response Theory

被引:33
作者
Fischer, H. Felix [1 ,2 ]
Tritt, Karin [3 ,4 ]
Klapp, Burghard F. [2 ]
Fliege, Herbert [2 ]
机构
[1] Charite, Inst Sozialmed Epidemiol & Gesundheitsokon, D-10117 Berlin, Germany
[2] Charite, Med Klin Schwerpunkt Psychosomat, D-10117 Berlin, Germany
[3] Inst Qualitatsentwicklung Psychotherapie & Psycho, Munich, Germany
[4] Univ Regensburg, Inst Epidemiol & Pravent Med, Regensburg, Germany
关键词
Item Response Theory; equating; PHQ; ISR; depression self-rating scales; COMPUTER-ADAPTIVE TEST; MONITORING DEPRESSION; VALIDATION; VALIDITY; SCL-90-R;
D O I
10.1002/mpr.350
中图分类号
R749 [精神病学];
学科分类号
100205 ;
摘要
A wide range of questionnaires for measuring depression are available. Item Response Theory models can help to evaluate the questionnaires exceeding the boundaries of Classical Test Theory and provide an opportunity to equate the questionnaires. In this study after checking for unidimensionality, a General Partial Credit Model was applied to data from two different depression scales [Patient Health Questionnaire (PHQ-9) and ICD-10-Symptom Rating (ISR)] obtained in clinical settings from a consecutive sample, including 4517 observations from a total of 2999 inpatients and outpatients of a psychosomatic clinic. The precision of each questionnaire was compared and the model was used to transform scores based on the assumed underlying latent trait. Both instruments were constructed to measure the same construct and their estimates of depression severity are highly correlated. Our analysis showed that the predicted scores provided by the conversion tables are similar to the observed scores in a validation sample. The PHQ-9 and ISR depression scales measure depression severity across a broad range with similar precision. While the PHQ-9 shows advantages in measuring low or high depression severity, the ISR is more parsimonious and also suitable for clinical purposes. Furthermore, the equation tables derived in this study enhance the comparability of studies using either one of the instruments, but due to substantial statistical spread the comparison of individual scores is imprecise. Copyright (C) 2011 John Wiley & Sons, Ltd.
引用
收藏
页码:203 / 214
页数:12
相关论文
共 38 条