Psychometric validation of PROM instruments. Article four in a series of ten

被引:31
作者
Christensen, Karl B. [1 ]
Comins, Jonathan D. [2 ,3 ,4 ]
Krogsgaard, Michael R. [2 ]
Brodersen, John [3 ,4 ,5 ]
Jensen, Jonas [2 ]
Hansen, Christian Fugl [2 ]
Kreiner, Svend [1 ]
机构
[1] Univ Copenhagen, Sect Biostat, Dept Publ Hlth, Copenhagen, Denmark
[2] Bispebjerg & Frederiksberg Hosp, Sect Sports Traumatol M51, Bispebjerg Bakke 23, DK-2400 Copenhagen NV, Denmark
[3] Univ Copenhagen, Dept Publ Hlth, Res Unit Gen Practice, Copenhagen, Denmark
[4] Univ Copenhagen, Dept Publ Hlth, Sect Gen Practice, Copenhagen, Denmark
[5] Primary Hlth Care Res Unit, Region Zealand, Denmark
关键词
classical test theory; confirmatory factor analyses; construct validity; differential item functioning; modern test theory; patient-reported outcome measures; psychometric validation; Rasch analyses; OUTCOME SCORE KOOS; RASCH MODEL; KNEE INJURY; STATISTICS; INDEXES; ALPHA; BIAS;
D O I
10.1111/sms.13908
中图分类号
G8 [体育];
学科分类号
04 ; 0403 ;
摘要
The aim was to provide an overview of the different statistical methods for validation of patient-reported outcome measures, ranging from simple statistical methods available in all software packages to advanced statistical models that require specialized software. A non-technical summary of classical test theory (CTT) and modern test theory (MTT) is provided. Specifically, confirmatory factor analysis, item response theory, and Rasch analysis is outlined. One CTT and three MTT methods were used to validate the two subscales (Symptoms and Quality of Life) from the Knee Injury and Osteoarthritis Outcome Score (KOOS). For each methodology, two analyses were considered: (i) a unidimensional analysis ignoring the pre-specified dimensionality, and (ii) a two-dimensional analysis using the pre-specified dimensionality. While CTT did not adequately address central issues regarding the validity of the KOOS subscales, the three MTT methods yielded very similar results. In conclusion, MTT methods offer analysis of all relevant properties related to the validity of patient-reported outcome measures, while this is not the case for CTT. Claims about sufficient validity based on CTT methods are inadequate and should not be trusted.
引用
收藏
页码:1225 / 1238
页数:14
相关论文
共 79 条
[1]  
Ali U.S., 2015, ETS Research Report Series, P1, DOI [DOI 10.1002/ETS2.12065, 10.1002/ets2.12065]
[2]  
An X., 2014, Item Response Theory: What it is and how you can use the IRT procedure to apply it
[3]   SUFFICIENT STATISTICS AND LATENT TRAIT MODELS [J].
ANDERSEN, EB .
PSYCHOMETRIKA, 1977, 42 (01) :69-81
[4]   Controversy and the Rasch model - A characteristic of incompatible paradigms? [J].
Andrich, D .
MEDICAL CARE, 2004, 42 (01) :7-16
[5]   RATING FORMULATION FOR ORDERED RESPONSE CATEGORIES [J].
ANDRICH, D .
PSYCHOMETRIKA, 1978, 43 (04) :561-573
[6]  
Andrich D., 1988, RASCH MODELS MEASURE, DOI 10.4135/9781412985598
[7]  
Andrich D., 2010, RUMM2030
[8]   An Expanded Derivation of the Threshold Structure of the Polytomous Rasch Model That Dispels Any "Threshold Disorder Controversy" [J].
Andrich, David .
EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 2013, 73 (01) :78-124
[9]  
[Anonymous], 1961, Proceedings of the fourth Berkeley symposium on mathematical statistics and probability, DOI DOI 10.4135/9781446262481.N15
[10]  
Arbuckle J. L., 2014, Amos (version 23.0) (Computer program)