Measurement of observer agreement

被引:930
作者
Kundel, HL
Polansky, M
机构
[1] Univ Penn, Ctr Med, Dept Radiol, Philadelphia, PA 19104 USA
[2] Univ Penn, Ctr Med, MCP Hahnemann Sch Publ Hlth, Philadelphia, PA 19104 USA
关键词
diagnostic radiology; observer performance; statistical analysis;
D O I
10.1148/radiol.2282011860
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
Statistical measures are described that are used in diagnostic imaging for expressing observer agreement in regard to categorical data. The measures are used to characterize the reliability of imaging methods and the reproducibility of disease classifications and, occasionally with great care, as the surrogate for accuracy. The review concentrates on the chance-corrected indices, kappa and weighted kappa. Examples from the imaging literature illustrate the method of calculation and the effects of both disease prevalence and the number of rating categorie. Other measures of agreement that are used less frequently, including multiple-rater kappa, are referenced and described briefly. (C) RSNA 2003.
引用
收藏
页码:303 / 308
页数:6
相关论文
共 26 条
  • [1] AGRESTI A, 1990, CATEGORICAL DATA ANA, P366
  • [2] Breast imaging reporting and data system standardized mammography lexicon: Observer variability in lesion description
    Baker, JA
    Kornguth, PJ
    Floyd, CE
    [J]. AMERICAN JOURNAL OF ROENTGENOLOGY, 1996, 166 (04) : 773 - 778
  • [3] TUBERCULOSIS CASE FINDING - A COMPARISON OF THE EFFECTIVENESS OF VARIOUS ROENTGENOGRAPHIC AND PHOTOFLUOROGRAPHIC METHODS
    BIRKELO, CC
    CHAMBERLAIN, WE
    PHELPS, PS
    SCHOOLS, PE
    ZACKS, D
    YERUSHALMY, J
    [J]. JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 1947, 133 (06): : 359 - 366
  • [4] HIGH AGREEMENT BUT LOW KAPPA .2. RESOLVING THE PARADOXES
    CICCHETTI, DV
    FEINSTEIN, AR
    [J]. JOURNAL OF CLINICAL EPIDEMIOLOGY, 1990, 43 (06) : 551 - 558
  • [5] VARIABILITY IN RADIOLOGISTS INTERPRETATIONS OF MAMMOGRAMS
    ELMORE, JG
    WELLS, CK
    LEE, CH
    HOWARD, DH
    FEINSTEIN, AR
    [J]. NEW ENGLAND JOURNAL OF MEDICINE, 1994, 331 (22) : 1493 - 1499
  • [6] OBSERVER VARIATION IN THE DETECTION OF OSTEOPENIA
    EPSTEIN, DM
    DALINKA, MK
    KAPLAN, FS
    ARONCHICK, JM
    MARINELLI, DL
    KUNDEL, HL
    [J]. SKELETAL RADIOLOGY, 1986, 15 (05) : 347 - 349
  • [7] HIGH AGREEMENT BUT LOW KAPPA .1. THE PROBLEMS OF 2 PARADOXES
    FEINSTEIN, AR
    CICCHETTI, DV
    [J]. JOURNAL OF CLINICAL EPIDEMIOLOGY, 1990, 43 (06) : 543 - 549
  • [8] Fleiss J. L, 1981, STAT METHODS RATES P, P212
  • [9] RECEIVER OPERATOR CHARACTERISTIC (ROC) ANALYSIS WITHOUT TRUTH
    HENKELMAN, RM
    KAY, I
    BRONSKILL, MJ
    [J]. MEDICAL DECISION MAKING, 1990, 10 (01) : 24 - 29
  • [10] LIMITED CORRELATION OF LEFT-VENTRICULAR END-DIASTOLIC PRESSURE WITH RADIOGRAPHIC ASSESSMENT OF PULMONARY HEMODYNAMICS
    HERMAN, PG
    KHAN, A
    KALLMAN, CE
    ROJAS, KA
    CARMODY, DP
    BODENHEIMER, MM
    [J]. RADIOLOGY, 1990, 174 (03) : 721 - 724