Getting serious about test-retest reliability: a critique of retest research and some recommendations

被引:294
作者
Polit, Denise F. [1 ,2 ]
机构
[1] Humanalysis Inc, Saratoga Springs, NY 12866 USA
[2] Griffith Univ, Ctr Hlth Practice Innovat, Brisbane, Qld 4111, Australia
关键词
COSMIN; Instrument development; Measurement; Patient-reported outcome; Psychometrics; Reliability; Test-retest reliability; QUESTIONNAIRE; STABILITY; SCALE; REPRODUCIBILITY; PERSONALITY; VALIDATION; QUALITY; COSMIN; ERROR; ALPHA;
D O I
10.1007/s11136-014-0632-9
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
To focus attention on the need for rigorous and carefully designed test-retest reliability assessments for new patient-reported outcomes and to encourage retest researchers to be thoughtful, ambitious, and creative in their retest efforts. The paper outlines key challenges that confront retest researchers, calls attention to some limitations in meeting those challenges, and describes some strategies to improve retest research. Modest retest coefficients are often reported as acceptable, and many important decisions-such as the retest interval-appear not to be evidence-based. Retest assessments are seldom undertaken before a measure has been finalized, which rules out using retest data to select strong, reproducible items. Strategies for improving retest research include seeking input from patients or experts regarding the stability of the construct to support decisions about the retest interval, analyzing item-level retest data to identify items to revise or discard, establishing a priori standards of acceptability for reliability coefficients, using large, heterogeneous, and representative retest samples and collecting follow-up data to better understand consistent and inconsistent responses over time.
引用
收藏
页码:1713 / 1720
页数:8
相关论文
共 29 条
  • [1] Initial psychometric evaluation of the Arm Activity Measure (ArmA): a measure of activity in the hemiparetic arm
    Ashford, Stephen
    Turner-Stokes, Lynne
    Siegert, Richard
    Slade, Mike
    [J]. CLINICAL REHABILITATION, 2013, 27 (08) : 728 - 740
  • [2] Patient-reported outcomes in randomized clinical trials: development of ISOQOL reporting standards
    Brundage, Michael
    Blazeby, Jane
    Revicki, Dennis
    Bass, Brenda
    de Vet, Henrica
    Duffy, Helen
    Efficace, Fabio
    King, Madeleine
    Lam, Cindy L. K.
    Moher, David
    Scott, Jane
    Sloan, Jeff
    Snyder, Claire
    Yount, Susan
    Calvert, Melanie
    [J]. QUALITY OF LIFE RESEARCH, 2013, 22 (06) : 1161 - 1175
  • [3] Validation of a 10-item Care-related Regret Intensity Scale (RIS-10) for Health Care Professionals
    Courvoisier, Delphine S.
    Cullati, Stephane
    Haller, Chiara S.
    Schmidt, Ralph E.
    Haller, Guy
    Agoritsas, Thomas
    Perneger, Thomas V.
    [J]. MEDICAL CARE, 2013, 51 (03) : 285 - 291
  • [4] TEST "RELIABILITY": ITS MEANING AND DETERMINATION
    Cronbach, Lee J.
    [J]. PSYCHOMETRIKA, 1947, 12 (01) : 1 - 16
  • [5] de Vet H. W., 2011, MEASUREMENT MED, DOI DOI 10.1017/CBO9780511996214
  • [6] DeVellis RF, 2012, Scale development: Theory and application, V3rd
  • [7] REPRODUCIBILITY AND RESPONSIVENESS OF HEALTH-STATUS MEASURES - STATISTICS AND STRATEGIES FOR EVALUATION
    DEYO, RA
    DIEHR, P
    PATRICK, DL
    [J]. CONTROLLED CLINICAL TRIALS, 1991, 12 (04): : S142 - S158
  • [8] Meta-analysis identifies Back Pain Questionnaire reliability influenced more by instrument than study design or population
    Geere, Jonathan H.
    Geere, Jo-Anne L.
    Hunter, Paul R.
    [J]. JOURNAL OF CLINICAL EPIDEMIOLOGY, 2013, 66 (03) : 261 - 267
  • [9] Giraudeau B, 2001, STAT MED, V20, P3205, DOI 10.1002/sim.935.abs
  • [10] INTERRELATIONSHIPS AMONG PERSONALITY SCALE PARAMETERS - ITEM RESPONSE STABILITY AND SCALE RELIABILITY
    JONES, RR
    GOLDBERG, LR
    [J]. EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 1967, 27 (02) : 323 - &