A graphical method for assessing agreement with the mean between multiple observers using continuous measures

被引:84
作者
Jones, Mark [1 ]
Dobson, Annette [1 ]
O'Brian, Sue [2 ]
机构
[1] Univ Queensland, Sch Populat Hlth, Herston, Qld 4006, Australia
[2] Univ Sydney, Australian Stuttering Res Ctr, Sydney, NSW 2006, Australia
关键词
Accuracy; precision; reliability; reproducibility; agreement; RELIABILITY;
D O I
10.1093/ije/dyr109
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
Methods We aimed to develop a simple graphical method to assess agreement between multiple observers using continuous measurements. The Bland-Altman graphical method for assessing agreement between two observers using continuous measures was modified and extended to accommodate multiple observers. Mathematical formulae are derived and real data examples used to illustrate the proposed method. Results The examples show that the proposed graphical method of assessing agreement provides clinically useful information. This information includes estimates of the limits of agreement with the mean and a visual means for determining these limits over the range of measurements. In a data example that included five readers' measurements of 40 lung lesions, the intra-class correlation (ICC) was 0.84 indicating readers can reliably measure the lesions. However, the estimated limits of agreement with the mean were -1.1 to 1.1 cm implying that the readers' measurements can plausibly differ from the mean estimated tumour size by more than 1 cm. This is a clinically significant difference according to the study authors. In addition, a plot of the limits of agreement with the mean by mean tumour size shows heterogeneous agreement presumably due to the varying degrees of definition at the edge of the lesions. Conclusions The proposed graphical method of assessing agreement can be used alongside other measures such as ICC for reporting on reproducibility in studies of multiple observers making continuous measurements.
引用
收藏
页码:1308 / 1313
页数:6
相关论文
共 12 条
  • [1] [Anonymous], 1994, ACC TRUEN PREC MEA 1
  • [2] An overview on assessing agreement with continuous measurements
    Barnhart, Huiman X.
    Haber, Michael J.
    Lin, Lawrence I.
    [J]. JOURNAL OF BIOPHARMACEUTICAL STATISTICS, 2007, 17 (04) : 529 - 569
  • [3] STATISTICAL METHODS FOR ASSESSING AGREEMENT BETWEEN TWO METHODS OF CLINICAL MEASUREMENT
    BLAND, JM
    ALTMAN, DG
    [J]. LANCET, 1986, 1 (8476) : 307 - 310
  • [4] Breitburg D., 1999, Limnol. Oceanogr, V44, P1, DOI DOI 10.1002/9781118032923.CH1
  • [5] ROBUST TESTS FOR EQUALITY OF VARIANCES
    BROWN, MB
    FORSYTHE, AB
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1974, 69 (346) : 364 - 367
  • [6] When to use agreement versus reliability measures
    de Vet, Henrica C. W.
    Terwee, Caroline B.
    Knol, Dirk L.
    Bouter, Lex M.
    [J]. JOURNAL OF CLINICAL EPIDEMIOLOGY, 2006, 59 (10) : 1033 - 1039
  • [7] Interobserver and intraobserver variability in measurement of non-small-cell carcinoma lung lesions: Implications for assessment of tumor response
    Erasmus, JJ
    Gladish, GW
    Broemeling, L
    Sabloff, BS
    Truong, MT
    Herbst, RS
    Munden, RF
    [J]. JOURNAL OF CLINICAL ONCOLOGY, 2003, 21 (13) : 2574 - 2582
  • [8] THE LOG TRANSFORMATION IS SPECIAL
    KEENE, ON
    [J]. STATISTICS IN MEDICINE, 1995, 14 (08) : 811 - 819
  • [9] Measurement of stuttering in adults: Comparison of stuttering-rate and severity-scaling methods
    O'Brian, S
    Packman, A
    Onslow, M
    O'Brian, N
    [J]. JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 2004, 47 (05): : 1081 - 1087
  • [10] Assessing intrarater, interrater and test-retest reliability of continuous measurements
    Rousson, V
    Gasser, T
    Seifert, B
    [J]. STATISTICS IN MEDICINE, 2002, 21 (22) : 3431 - 3446