State of the psychometric methods: patient-reported outcome measure development and refinement using item response theory

被引:56
作者
Stover, Angela M. [1 ,2 ]
McLeod, Lori D. [3 ]
Langer, Michelle M. [2 ,4 ,5 ]
Chen, Wen-Hung [3 ]
Reeve, Bryce B. [1 ,6 ]
机构
[1] Univ N Carolina, Dept Hlth Policy & Management, 1101-G McGavran Greenberg Hall CB 7411, Chapel Hill, NC 27599 USA
[2] Univ N Carolina, Lineberger Comprehens Canc Ctr, Sch Med, 101 Manning Dr, Chapel Hill, NC 27599 USA
[3] RTI Hlth Solut, 3040 Cornwallis Rd, Res Triangle Pk, NC 27709 USA
[4] Northwestern Univ, Med Social Sci, 625 N Michigan Ave Suite 2700, Chicago, IL 60611 USA
[5] Northwestern Univ, Feinberg Sch Med, 625 N Michigan Ave Suite 2700, Chicago, IL 60611 USA
[6] Duke Univ, Sch Med, Dept Populat Hlth Sci & Pediat, Ctr Hlth Measurement, 2200 West Main St,Suite 720A, Durham, NC 27707 USA
关键词
Item response theory; Scale construction; Scale evaluation; Measurement; PROMIS (R); GOODNESS-OF-FIT; INFORMATION-SYSTEM PROMIS(R); INSTRUMENT DEVELOPMENT; LIMITED-INFORMATION; LATENT ABILITY; IRT MODEL; DEPRESSION; IMPACT; ORGANIZATION; VALIDATION;
D O I
10.1186/s41687-019-0130-5
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background: This paper is part of a series comparing different psychometric approaches to evaluate patient-reported outcome (PRO) measures using the same items and dataset. We provide an overview and example application to demonstrate 1) using item response theory (IRT) to identify poor and well performing items; 2) testing if items perform differently based on demographic characteristics (differential item functioning, DIF); and 3) balancing IRT and content validity considerations to select items for short forms. Methods: Model fit, local dependence, and DIF were examined for 51 items initially considered for the Patient-Reported Outcomes Measurement Information System (R) (PROMIS (R)) Depression item bank. Samejima's graded response model was used to examine how well each item measured severity levels of depression and how well it distinguished between individuals with high and low levels of depression. Two short forms were constructed based on psychometric properties and consensus discussions with instrument developers, including psychometricians and content experts. Calibrations presented here are for didactic purposes and are not intended to replace official PROMIS parameters or to be used for research. Results: Of the 51 depression items, 14 exhibited local dependence, 3 exhibited DIF for gender, and 9 exhibited misfit, and these items were removed from consideration for short forms. Short form 1 prioritized content, and thus items were chosen to meet DSM-V criteria rather than being discarded for lower discrimination parameters. Short form 2 prioritized well performing items, and thus fewer DSM-V criteria were satisfied. Short forms 1-2 performed similarly for model fit statistics, but short form 2 provided greater item precision. Conclusions: IRT is a family of flexible models providing item- and scale-level information, making it a powerful tool for scale construction and refinement. Strengths of IRT models include placing respondents and items on the same metric, testing DIF across demographic or clinical subgroups, and facilitating creation of targeted short forms. Limitations include large sample sizes to obtain stable item parameters, and necessary familiarity with measurement methods to interpret results. Combining psychometric data with stakeholder input (including people with lived experiences of the health condition and clinicians) is highly recommended for scale development and evaluation.
引用
收藏
页数:16
相关论文
共 50 条
[21]   Developing a Valid Patient-Reported Outcome Measure [J].
Rothrock, N. E. ;
Kaiser, K. A. ;
Cella, D. .
CLINICAL PHARMACOLOGY & THERAPEUTICS, 2011, 90 (05) :737-742
[22]   Development of a patient-reported outcome measure for the foot affected by rheumatoid arthritis [J].
Walmsley, Steven ;
Ravey, Mike ;
Graham, Andrea ;
Teh, Lee S. ;
Williams, Anita E. .
JOURNAL OF CLINICAL EPIDEMIOLOGY, 2012, 65 (04) :413-422
[23]   A comparison of methods to address item non-response when testing for differential item functioning in multidimensional patient-reported outcome measures [J].
Olawale F. Ayilara ;
Tolulope T. Sajobi ;
Ruth Barclay ;
Eric Bohm ;
Mohammad Jafari Jozani ;
Lisa M. Lix .
Quality of Life Research, 2022, 31 :2837-2848
[24]   Psychometric Properties of the Triarchic Psychopathy Measure: An Item Response Theory Approach [J].
Shou, Yiyun ;
Sellbom, Martin ;
Xu, Jing .
PERSONALITY DISORDERS-THEORY RESEARCH AND TREATMENT, 2018, 9 (03) :217-227
[25]   The Tardive Dyskinesia Impact Scale (TDIS), a novel patient-reported outcome measure in tardive dyskinesia: development and psychometric validation [J].
Farber, Robert H. ;
Stull, Donald E. ;
Witherspoon, Brooke ;
Evans, Christopher J. ;
Yonan, Charles ;
Bron, Morgan ;
Dhanda, Rahul ;
Jen, Eric ;
Brien, Christopher O. .
JOURNAL OF PATIENT-REPORTED OUTCOMES, 2024, 8 (01)
[26]   The Tardive Dyskinesia Impact Scale (TDIS), a novel patient-reported outcome measure in tardive dyskinesia: development and psychometric validation [J].
Robert H. Farber ;
Donald E. Stull ;
Brooke Witherspoon ;
Christopher J. Evans ;
Charles Yonan ;
Morgan Bron ;
Rahul Dhanda ;
Eric Jen ;
Christopher O.’ Brien .
Journal of Patient-Reported Outcomes, 8
[27]   Development and Psychometric Validation of a Patient-Reported Outcome Measure for Arm Lymphedema: The LYMPH-Q Upper Extremity Module [J].
Klassen, Anne F. ;
Tsangaris, Elena ;
Kaur, Manraj N. ;
Poulsen, Lotte ;
Beelen, Louise M. ;
Jacobsen, Amalie Lind ;
Jorgensen, Mads Gustaf ;
Sorensen, Jens Ahm ;
Vasilic, Dalibor ;
Dayan, Joseph ;
Mehrara, Babak ;
Pusic, Andrea L. .
ANNALS OF SURGICAL ONCOLOGY, 2021, 28 (09) :5166-5182
[28]   Psychometric evaluation of the Urgency NRS as a new patient-reported outcome measure for patients with ulcerative colitis [J].
Marla C. Dubinsky ;
Mingyang Shan ;
Laure Delbecque ;
Trevor Lissoos ;
Theresa Hunter ;
Gale Harding ;
Larissa Stassek ;
David Andrae ;
James D. Lewis .
Journal of Patient-Reported Outcomes, 6
[29]   Psychometric evaluation of the Urgency NRS as a new patient-reported outcome measure for patients with ulcerative colitis [J].
Dubinsky, Marla C. ;
Shan, Mingyang ;
Delbecque, Laure ;
Lissoos, Trevor ;
Hunter, Theresa ;
Harding, Gale ;
Stassek, Larissa ;
Andrae, David ;
Lewis, James D. .
JOURNAL OF PATIENT-REPORTED OUTCOMES, 2022, 6 (01)
[30]   RespOnse Shift ALgorithm in Item response theory (ROSALI) for response shift detection with missing data in longitudinal patient-reported outcome studies [J].
Alice Guilleux ;
Myriam Blanchin ;
Antoine Vanier ;
Francis Guillemin ;
Bruno Falissard ;
Carolyn E. Schwartz ;
Jean-Benoit Hardouin ;
Véronique Sébille .
Quality of Life Research, 2015, 24 :553-564