Reliability of 95% confidence interval revealed by expected quality-of-life scores: an example of nasopharyngeal carcinoma patients after radiotherapy using EORTC QLQ-C 30

被引:11
作者
Chien, Tsair-Wei [1 ,2 ]
Lin, Shun-Jin [3 ]
Wang, Wen-Chung [6 ]
Leung, Henry W. C. [5 ]
Lai, Wen-Pin [4 ]
Chan, Agnes L. F. [1 ,3 ]
机构
[1] Chi Mei Med Ctr, Dept Pharm, Tainan, Taiwan
[2] Chia Nan Univ Pharm & Sci, Dept Hosp & Hlth Care Adm, Tainan, Taiwan
[3] Kaohsiung Med Univ, Sch Pharm, Kaohsiung, Taiwan
[4] Chi Mei Med Ctr, Dept Emergency, Tainan, Taiwan
[5] Taipei Med Univ, Shuang Ho Hosp, Dept Radiat Oncol, Taipei, Taiwan
[6] Hong Kong Inst Educ, Assessment Res Ctr, Tai Po, Hong Kong, Peoples R China
关键词
ITEM RESPONSE THEORY; COEFFICIENT-ALPHA; MONTE-CARLO; RASCH; SCALES; KIDMAP;
D O I
10.1186/1477-7525-8-68
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background: Many researchers use observed questionnaire scores to evaluate score reliability and to make conclusions and inferences regarding quality-of-life outcomes. The amount of false alarms from medical diagnoses that would be avoided if observed scores were substituted with expected scores is interesting, and understanding these differences is important for the care of cancer patients. Using expected scores to estimate the reliability of 95% confidence intervals (CIs) is rarely reported in published papers. We investigated the reliability of patient responses to a quality-of-life questionnaire and made recommendations for future studies of the quality of life of patients. Methods: A total of 115 patients completed the EORTC core questionnaire QLQ-C30 (version 3) after radiotherapy. The observed response scores, assumed to be one-dimensional, were summed and transformed into expected scores using the Rasch rating scale model with WINSTEPS software. A series of simulations was performed using a unified bootstrap procedure after manipulating scenarios with different questionnaire lengths and patient numbers to estimate the reliability at 95% confidence intervals. Skewness analyses of the 95% CIs were compared to detect different effects between groups according to the two data sets of observed and expected response scores. Results: We found that (1) it is necessary to report CIs for reliability and skewness coefficients in papers; (2) data derived from expected response scores are preferable to making inferences; and (3) visual representations displaying the 95% CIs of skewness values applied to item-by-item analyses can provide a useful interpretation of quality-of-life outcomes. Conclusion: Reliability coefficients can be reported with 95% CIs by statistical software to evaluate the internal consistency of respondent scores on questionnaire items. The SPSS syntax procedures for estimating the reliability of the 95% CI, expected score generation and visual skewness analyses are demonstrated in this study. We recommend that effect sizes such as a 95% CI be reported along with p values reporting significant differences in quality-of-life studies.
引用
收藏
页数:8
相关论文
共 32 条
[11]   Score reliability in Web- or Internet-based surveys: Unnumbered graphic rating scales versus likert-type scales [J].
Cook, C ;
Heath, F ;
Thompson, RL ;
Thompson, B .
EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 2001, 61 (04) :697-706
[12]  
Cronbach LJ, 1951, PSYCHOMETRIKA, V16, P297
[13]   My current thoughts on coefficient alpha and successor procedures [J].
Cronbach, LJ .
EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 2004, 64 (03) :391-418
[14]   Reliability: Arguments for multiple perspectives and potential problems with generalization across studies [J].
Dimitrov, DM .
EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 2002, 62 (05) :783-801
[15]   1977 RIETZ LECTURE - BOOTSTRAP METHODS - ANOTHER LOOK AT THE JACKKNIFE [J].
EFRON, B .
ANNALS OF STATISTICS, 1979, 7 (01) :1-26
[16]   Confidence intervals about score reliability coefficients, please:: An EPM guidelines editorial [J].
Fan, XT ;
Thompson, B .
EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 2001, 61 (04) :517-531
[17]   Monte Carlo studies in item response theory [J].
Harwell, M ;
Stone, CA ;
Hsu, TC ;
Kirisci, L .
APPLIED PSYCHOLOGICAL MEASUREMENT, 1996, 20 (02) :101-125
[18]   A reliability generalization study of the geriatric depression scale [J].
Kieffer, KM ;
Reese, RJ .
EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 2002, 62 (06) :969-994
[19]   Practical significance: A concept whose time has come [J].
Kirk, RE .
EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 1996, 56 (05) :746-759
[20]  
Linacre J.M., 2009, Winsteps (Computer Program and Manual)