Combining EQ-5D-5L items into a level summary score: demonstrating feasibility using non-parametric item response theory using an international dataset

被引:26
作者
Feng, You-Shan [1 ,2 ]
Jiang, Ruixuan [3 ]
Pickard, A. Simon [4 ]
Kohlmann, Thomas [2 ]
机构
[1] Med Univ Tubingen, Inst Clin Epidemiol & Appl Biometr, Silcherstr 5, D-72076 Tubingen, Germany
[2] Univ Greifswald, Inst Community Med, Greifswald, Germany
[3] Merck & Co Inc, Ctr Observat & Real World Evidence, Kenilworth, NJ USA
[4] Univ Illinois, Coll Pharm, Chicago, IL USA
关键词
EQ-5D-5L; Non-economic scoring approaches; Non-parametric item response theory; Mokken scaling; Unweighted summary score; MOKKEN SCALE ANALYSIS; QUALITY-OF-LIFE; DIMENSIONS; VERSION; MODEL; VALUATION; VALUES; CAUSAL;
D O I
10.1007/s11136-021-02922-1
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background The EQ-5D-5L is a well-established health questionnaire that estimates health utilities by applying preference-based weights. Limited work has been done to examine alternative scoring approaches when utility weights are unavailable or inapplicable. We examined whether the Mokken scaling approach can elucidate 1) if the level summary score is appropriate for the EQ-5D-5L and 2) an interpretation of such a score. Methods The R package "mokken" was used to assess monotonicity (scaling coefficients H, automated item selection procedure) and manifest invariant item ordering (MIIO: paired item response functions [IRF], H-T). We used a rich dataset (the Multiple Instrument Comparison, MIC) which includes EQ-5D-5L data from six Western countries. Results While all EQ-5D-5L items demonstrated monotonicity, the anxiety/depression (AD) item had weak scalability (H-i = 0.377). Without AD, scalability improved from H-s = 0.559 to H-s = 0.714. MIIO revealed that the 5 items can be ordered, and the ordering is moderately accurate in the MIC data (H-T = 0.463). Excluding AD, H-T improves to 0.743. Results were largely consistent across disease and country subgroups. Discussion The 5 items of the EQ-5D-5L form a moderate to strong Mokken scale, enabling persons to be ordered using the level summary score. Item ordering suggests that the lower range of the score represents mainly problems with pain and anxiety/depression, the mid-range indicates additional problems with mobility and usual activities, and middle to higher range of scores reveals additional limitations with self-care. Scalability and item ordering are even stronger when the anxiety/depression item is not included in the scale.
引用
收藏
页码:11 / 23
页数:13
相关论文
共 54 条
[51]   An Instructional Module on Mokken Scale Analysis [J].
Wind, Stefanie A. .
EDUCATIONAL MEASUREMENT-ISSUES AND PRACTICE, 2017, 36 (02) :50-66
[52]  
Wittrup-Jensenm KL., 2008, ASSESSMENT 2 GENERIC
[53]   Effect of Health State Sampling Methods on Model Predictions of EQ-5D-5L Values: Small Designs Can Suffice [J].
Yang, Zhihao ;
Luo, Nan ;
Bonsel, Gouke ;
Busschbach, Jan ;
Stolk, Elly .
VALUE IN HEALTH, 2019, 22 (01) :38-44
[54]   Summarizing health-related quality of life (HRQOL): development and testing of a one-factor model [J].
Yin, Shaoman ;
Njai, Rashid ;
Barker, Lawrence ;
Siegel, Paul Z. ;
Liao, Youlian .
POPULATION HEALTH METRICS, 2016, 14