Inter-rater agreement and reliability of the COSMIN (COnsensus-based Standards for the selection of health status Measurement Instruments) Checklist

被引:180
作者
Mokkink, Lidwine B. [1 ,2 ]
Terwee, Caroline B. [1 ,2 ]
Gibbons, Elizabeth [3 ]
Stratford, Paul W. [4 ,5 ]
Alonso, Jordi [6 ]
Patrick, Donald L. [7 ]
Knol, Dirk L. [1 ,2 ]
Bouter, Lex M. [1 ,2 ,8 ]
de Vet, Henrica C. W. [1 ,2 ]
机构
[1] Vrije Univ Amsterdam Med Ctr, Dept Epidemiol & Biostat, Amsterdam, Netherlands
[2] Vrije Univ Amsterdam Med Ctr, EMGO Inst Hlth & Care Res, Amsterdam, Netherlands
[3] Univ Oxford, Patient Reported Outcome Measurement Grp, Dept Publ Hlth, Oxford, England
[4] McMaster Univ, Dept Clin Epidemiol & Biostat, Hamilton, ON, Canada
[5] McMaster Univ, Sch Rehabil Sci, Hamilton, ON, Canada
[6] CIBER Epidemiol & Salud Publ CIBERESP, Barcelona, Spain
[7] Univ Washington, Dept Hlth Serv, Seattle, WA 98195 USA
[8] Vrije Univ Amsterdam, Execut Board, Amsterdam, Netherlands
关键词
QUALITY; KAPPA;
D O I
10.1186/1471-2288-10-82
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background: The COSMIN checklist is a tool for evaluating the methodological quality of studies on measurement properties of health-related patient-reported outcomes. The aim of this study is to determine the inter-rater agreement and reliability of each item score of the COSMIN checklist (n = 114). Methods: 75 articles evaluating measurement properties were randomly selected from the bibliographic database compiled by the Patient-Reported Outcome Measurement Group, Oxford, UK. Raters were asked to assess the methodological quality of three articles, using the COSMIN checklist. In a one-way design, percentage agreement and intraclass kappa coefficients or quadratic-weighted kappa coefficients were calculated for each item. Results: 88 raters participated. Of the 75 selected articles, 26 articles were rated by four to six participants, and 49 by two or three participants. Overall, percentage agreement was appropriate (68% was above 80% agreement), and the kappa coefficients for the COSMIN items were low (61% was below 0.40, 6% was above 0.75). Reasons for low inter-rater agreement were need for subjective judgement, and accustom to different standards, terminology and definitions. Conclusions: Results indicated that raters often choose the same response option, but that it is difficult on item level to distinguish between articles. When using the COSMIN checklist in a systematic review, we recommend getting some training and experience, completing it by two independent raters, and reaching consensus on one final rating. Instructions for using the checklist are improved.
引用
收藏
页数:11
相关论文
共 12 条
  • [1] Fleiss JL., 1981, STAT METHODS RATES P
  • [2] Kappa coefficients in medical research
    Kraemer, HC
    Periyakoil, VS
    Noda, A
    [J]. STATISTICS IN MEDICINE, 2002, 21 (14) : 2109 - 2129
  • [3] ONE-WAY COMPONENTS OF VARIANCE MODEL FOR CATEGORICAL DATA
    LANDIS, JR
    KOCH, GG
    [J]. BIOMETRICS, 1977, 33 (04) : 671 - 679
  • [4] A unified approach for assessing agreement for continuous and categorical data
    Lin, Lawrence
    Hedayat, A. S.
    Wu, Wenting
    [J]. JOURNAL OF BIOPHARMACEUTICAL STATISTICS, 2007, 17 (04) : 629 - 652
  • [5] Evaluating the quality of reporting occupational therapy randomized controlled trials by expanding the CONSORT criteria
    Moberg-Mogren, E
    Nelson, DL
    [J]. AMERICAN JOURNAL OF OCCUPATIONAL THERAPY, 2006, 60 (02) : 226 - 235
  • [6] Mokkink LB., COSMIN checklist manual
  • [7] The COSMIN study reached international consensus on taxonomy, terminology, and definitions of measurement properties for health-related patient-reported outcomes
    Mokkink, Lidwine B.
    Terwee, Caroline B.
    Patrick, Donald L.
    Alonso, Jordi
    Stratford, Paul W.
    Knol, Dirk L.
    Bouter, Lex M.
    de Vet, Henrica C. W.
    [J]. JOURNAL OF CLINICAL EPIDEMIOLOGY, 2010, 63 (07) : 737 - 745
  • [8] The COSMIN checklist for assessing the methodological quality of studies on measurement properties of health status measurement instruments: an international Delphi study
    Mokkink, Lidwine B.
    Terwee, Caroline B.
    Patrick, Donald L.
    Alonso, Jordi
    Stratford, Paul W.
    Knol, Dirk L.
    Bouter, Lex M.
    de Vet, Henrica C. W.
    [J]. QUALITY OF LIFE RESEARCH, 2010, 19 (04) : 539 - 549
  • [9] The COSMIN checklist for evaluating the methodological quality of studies on measurement properties: A clarification of its content
    Mokkink, Lidwine B.
    Terwee, Caroline B.
    Knol, Dirk L.
    Stratford, Paul W.
    Alonso, Jordi
    Patrick, Donald L.
    Bouter, Lex M.
    de Vet, Henrica C. W.
    [J]. BMC MEDICAL RESEARCH METHODOLOGY, 2010, 10
  • [10] Reproducibility of the STARD checklist: An instrument to assess the quality of reporting of diagnostic accuracy studies
    Smidt N.
    Rutjes A.W.S.
    Van Der Windt D.A.W.M.
    Ostelo R.W.J.G.
    Bossuyt P.M.
    Reitsma J.B.
    Bouter L.M.
    De Vet H.C.W.
    [J]. BMC Medical Research Methodology, 6 (1)