Observer reliability of arteriovenous malformations grading scales using current imaging modalities

被引:12
作者
Griessenauer, Christoph J. [1 ]
Miller, Joseph. H. [1 ]
Agee, Bonita S. [1 ]
Fisher, Winfield S., III [1 ]
Cure, Joel K. [2 ]
Chapman, Philip R. [2 ]
Foreman, Paul M. [1 ]
Fisher, Wilson A. M. [1 ]
Witcher, Adam C. [1 ]
Walters, Beverly C. [1 ]
机构
[1] Univ Alabama Birmingham, Dept Neurosurg, Birmingham, AL USA
[2] Univ Alabama Birmingham, Dept Radiol, Birmingham, AL USA
关键词
arteriovenous malformation; grading scale; reliability; interrater; intrarater; vascular disorders; AGREEMENT; CLASSIFICATION; COEFFICIENT; INTERRATER; SYSTEM; KAPPA;
D O I
10.3171/2014.2.JNS131262
中图分类号
R74 [神经病学与精神病学];
学科分类号
摘要
Object. The aim of this study was to examine observer reliability of frequently used arteriovenous malformation (AVM) grading scales, including the 5-tier Spetzler-Martin scale, the 3-tier Spetzler-Ponce scale, and the Pollock-Flickinger radiosurgery-based scale, using current imaging modalities in a setting closely resembling routine clinical practice. Methods. Five experienced raters, including 1 vascular neurosurgeon, 2 neuroradiologists, and 2 senior neurosurgical residents independently reviewed 15 MR1 studies, 15 CT angiograms, and 15 digital subtraction angiograms obtained at the time of initial diagnosis. Assessments of 5 scans of each imaging modality were repeated for measurement of intrarater reliability. Three months after the initial assessment, raters reassessed those scans where there was disagreement: In this second assessment, raters were asked to justify their rating with comments and illustrations. Generalized kappa (kappa) analysis for multiple raters, Kendall's coefficient of concordance (W), and interclass correlation coefficient(ICC) were applied to determine interrater reliability. For intrarater reliability analysis, Cohen's kappa (kappa), Kendall's correlation coefficient (tau-b), and ICC were used to assess repeat measurement agreement for each rater. Results. Interrater reliability for the overall 5-tier Spetzler-Martin scale was fair to good (ICC = 0.69) to extremely strong (Kendall's W = 0.73) on initial assessment and improved on reassessment. Assessment of CT angiograms resulted in the highest agreement, followed by MRI and digital subtraction angiography. Agreement for the overall 3-tier Spetzler-Ponce grade was fair to good (ICC = 0.68) to strong (Kendall's W = 0.70) on initial assessment, improved on reassessment, and was comparable to agreement for the 5-tier Spetzler-Martin scale. Agreement for the overall Pollock-Flickinger radiosurgery-based grade was excellent (ICC = 0.89) to extremely strong (Kendall's W = 0.81). Intrarater reliability for the overall 5-tier Spetzler-Martin grade was excellent (ICC > 0.75) in 3 of the 5 raters and fair to good (ICC > 0.40) in the other 2 raters. Conclusion. The 5-tier Spetzler-Martin scale, the 3-tier Spetzler-Ponce scale, and the Pollock-Flickinger radiosurgery-based scale all showed a high level of agreement. The improved reliability on reassessment was explained by a training effect from the initial assessment and the requirement to defend the rating, which outlines a potential downside for grades determined as part of routine clinical practice to be used for scientific purposes.
引用
收藏
页码:1179 / 1187
页数:9
相关论文
共 26 条
[1]   Observer agreement in the angiographic assessment of arteriovenous malformations of the brain [J].
Al-Shahi, R ;
Pal, N ;
Lewis, SC ;
Bhattacharya, JJ ;
Sellar, RJ ;
Warlow, CP .
STROKE, 2002, 33 (06) :1501-1508
[2]   Beyond kappa: A review of interrater agreement measures [J].
Banerjee, M .
CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 1999, 27 (01) :3-23
[3]   Dependence of weighted kappa coefficients on the number of categories [J].
Brenner, H ;
Kliebsch, U .
EPIDEMIOLOGY, 1996, 7 (02) :199-202
[4]  
Chen B, P 30 ANN SAS US GROU
[5]   A COEFFICIENT OF AGREEMENT FOR NOMINAL SCALES [J].
COHEN, J .
EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 1960, 20 (01) :37-46
[6]   Interpreting Red Blood Cells in Lumbar Puncture: Distinguishing True Subarachnoid Hemorrhage From Traumatic Tap [J].
Czuczman, Amanda D. ;
Thomas, Lisa E. ;
Boulanger, Alyson B. ;
Peak, David A. ;
Senecal, Emily L. ;
Brown, David F. ;
Marill, Keith A. .
ACADEMIC EMERGENCY MEDICINE, 2013, 20 (03) :247-256
[7]   Interobserver variability in grading of brain arteriovenous malformations using the Spetzler-Martin system [J].
Du, R ;
Dowd, CF ;
Johnston, SC ;
Young, WL ;
Lawton, MT .
NEUROSURGERY, 2005, 57 (04) :668-674
[8]  
Fleiss J., 1986, Reliability of measurement: the design and analysis of clinical experiments
[9]  
FLEISS JL, 1971, PSYCHOL BULL, V76, P378, DOI 10.1037/h0031619
[10]  
Fleiss JL., 1981, STAT METHODS RATES P