International variation in histologic grading is large, and persistent feedback does not improve reproducibility

被引:165
作者
Furness, PN [1 ]
Taub, N [1 ]
Assmann, KJM [1 ]
Banfi, G [1 ]
Cosyns, JP [1 ]
Dorman, AM [1 ]
Hill, CM [1 ]
Kapper, SK [1 ]
Waldherr, R [1 ]
Laurinavicius, A [1 ]
Marcussen, N [1 ]
Martins, AP [1 ]
Nogueira, M [1 ]
Regele, H [1 ]
Seron, D [1 ]
Carrera, M [1 ]
Sund, S [1 ]
Taskinen, EI [1 ]
Paavonen, T [1 ]
Tihomirova, T [1 ]
Rosenthal, R [1 ]
机构
[1] Leicester Gen Hosp, Clin Sci Labs, Leicester LE5 4PW, Leics, England
关键词
histology; grading; scoring; external quality assessment;
D O I
10.1097/00000478-200306000-00012
中图分类号
R36 [病理学];
学科分类号
100104 ;
摘要
Histologic grading systems are used to guide diagnosis, therapy, and audit on an international basis. The reproducibility of grading systems is usually tested within small groups of pathologists who have previously worked or trained together. This may underestimate the international variation of scoring systems. We therefore evaluated the reproducibility of an established system, the Banff classification of renal allograft pathology, throughout Europe. We also sought to improve reproducibility by providing individual feedback after each of 14 small groups of cases. Kappa values for all features studied were lower than any previously published, confirming that international variation is greater than interobserver variation as previously assessed. A prolonged attempt to improve reproducibility, using numeric or graphical feedback, failed to produce any detectable improvement. We then asked participants to grade selected photographs, to eliminate variation induced by pathologists viewing different areas of the slide. This produced improved kappa values only for some features. Improvement was influenced by the nature of the grade definitions. Definitions based on "area affected" by a process were not improved. The results indicate the danger of basing decisions on grading systems that may be applied very differently in different institutions.
引用
收藏
页码:805 / 810
页数:6
相关论文
共 18 条
  • [1] [Anonymous], STAT METHODS RATES P
  • [2] BROWN L, NATL GYNAECOLOGICAL
  • [3] Offline telepathology diagnosis of colorectal polyps:: a study of interobserver agreement and comparison with glass slide diagnoses
    Cross, SS
    Burton, JL
    Dubé, AK
    Feeley, KM
    Lumb, PD
    Stephenson, TJ
    Start, RD
    [J]. JOURNAL OF CLINICAL PATHOLOGY, 2002, 55 (04) : 305 - 308
  • [4] Observer accuracy in estimating proportions in images: implications for the semiquantitative assessment of staining reactions and a proposal for a new system
    Cross, SS
    [J]. JOURNAL OF CLINICAL PATHOLOGY, 2001, 54 (05) : 385 - 390
  • [5] EFFORTS TO IMPROVE INTEROBSERVER AGREEMENT IN HISTOPATHOLOGICAL GRADING
    DEVET, HCW
    KOUDSTAAL, J
    KWEE, WS
    WILLEBRAND, D
    ARENDS, JW
    [J]. JOURNAL OF CLINICAL EPIDEMIOLOGY, 1995, 48 (07) : 869 - 873
  • [6] Consistency in the observation of features used to classify duct carcinoma in situ (DCIS) of the breast
    Douglas-Jones, AG
    Morgan, JM
    Appleton, MAC
    Attanoos, RL
    Caslin, A
    Champ, CS
    Cotter, M
    Dallimore, NS
    Dawson, A
    Fortt, RW
    Griffiths, AP
    Hughes, M
    Kitching, PA
    O'Brien, C
    Rashid, AM
    Stock, D
    Verghese, A
    Williams, DW
    Williams, NW
    Williams, S
    [J]. JOURNAL OF CLINICAL PATHOLOGY, 2000, 53 (08) : 596 - 602
  • [7] International variation in the interpretation of renal transplant biopsies: Report of the CERTPAP Project
    Furness, PN
    Taub, N
    [J]. KIDNEY INTERNATIONAL, 2001, 60 (05) : 1998 - 2012
  • [8] Reproducibility of the Banff schema in reporting protocol biopsies of stable renal allografts
    Gough, J
    Rush, D
    Jeffery, J
    Nickerson, P
    McKenna, R
    Solez, K
    Trpkov, K
    [J]. NEPHROLOGY DIALYSIS TRANSPLANTATION, 2002, 17 (06) : 1081 - 1084
  • [9] Husain OAN, 1997, ACTA CYTOL, V41, P1439
  • [10] Keenan SJ, 2000, J PATHOL, V192, P351, DOI 10.1002/1096-9896(2000)9999:9999<::AID-PATH708>3.0.CO