Six of one, half a dozen of the other: A measure of multidisciplinary inter/intra-rater reliability of the society for fetal urology and urinary tract dilation grading systems for hydronephrosis

被引:22
作者
Rickard, Mandy [1 ,2 ]
Easterbrook, Bethany [2 ,3 ]
Kim, Soojin [4 ]
Farrokhyar, Forough [4 ]
Stein, Nina [5 ]
Arora, Steven [6 ]
Belostotsky, Vladamir [6 ]
DeMaria, Jorge [1 ,2 ,3 ,4 ]
Lorenzo, Armando J. [2 ,7 ]
Braga, Luis H. [1 ,2 ,3 ,4 ]
机构
[1] McMaster Childrens Hosp, Hamilton, ON, Canada
[2] McMaster Childrens Hosp, Clin Urol Res Enterprise CURE Program, Hamilton, ON, Canada
[3] McMaster Univ, McMaster Pediat Surg Res Collaborat, Dept Surg, Div Urol, Hamilton, ON, Canada
[4] McMaster Univ, Div Urol, Dept Surg, Hamilton, ON, Canada
[5] McMaster Univ, Dept Radiol, Hamilton, ON, Canada
[6] McMaster Univ, Dept Pediat, Hamilton, ON, Canada
[7] Hosp Sick Children, Div Urol, Toronto, ON, Canada
关键词
Hydronephrosis; Urinary tract dilation; Multidisciplinary; Inter-rater reliability; Intra-rater reliability; Ultrasound; Pediatrics; PRENATAL HYDRONEPHROSIS; POSTNATAL EVALUATION; MANAGEMENT; CLASSIFICATION;
D O I
10.1016/j.jpurol.2016.09.005
中图分类号
R72 [儿科学];
学科分类号
100202 ;
摘要
Introduction The urinary tract dilation (UTD) classification system was introduced to standardize terminology in the reporting of hydronephrosis (HN), and bridge a gap between pre- and postnatal classification such as the Society for Fetal Urology (SFU) grading system. Herein we compare the intra/inter-rater reliability of both grading systems. Materials and methods SFU (I-IV) and UTD (I-III) grades were independently assigned by 13 raters (9 pediatric urology staff, 2 nephrologists, 2 radiologists), twice, 3 weeks apart, to 50 sagittal postnatal ultrasonographic views of hydronephrotic kidneys. Data regarding ureteral measurements and bladder abnormalities were included to allow proper UTD categorization. Ten images were repeated to assess intra-rater reliability. Krippendorff's alpha coefficient was used to measure overall and by grade intra/inter-rater reliability. Reliability between specialties and training levels were also analyzed. Results Overall inter-rater reliability was slightly higher for SFU (alpha = 0.842, 95% CI 0.812-0.879, in session 1; and alpha = 0.808, 95% CI 0.775-0.839, in session 2) than for UTD (alpha = 0.774, 95% CI 0.715-0.827, in session 1; and alpha = 0.679, 95% CI 0.605-0.750, in session 2). Reliability for intermediate grades (SFU II/III and UTD 2) of HN was poor regardless of the system. Reliabilities for SFU and UTD classifications among Urology, Nephrology, and Radiology, as well as between training levels were not significantly different. Discussion Despite the introduction of HN grading systems to standardize the interpretation and reporting of renal ultrasound in infants with HN, none have been proven superior in allowing clinicians to distinguish between "moderate" grades. While this study demonstrated high reliability in distinguishing between "mild" (SFU I/II and UTD 1) and "severe" (SFU IV and UTD 3) grades of HN, the overall reliability between specialties was poor. This is in keeping with a previous report of modest inter-rater reliability of the SFU system. This drawback is likely explained by the subjective interpretation required to assign grades, which can be impacted by experience, image quality, and scanning technique. As shown in the figure, which demonstrates SFU II (a) and SFU III (b), as assigned by a radiologist, it is possible to make an argument that either of these images can be classified into both categories that were observed during the grading sessions of this study. Conclusion Although both systems have acceptable reliability, the SFU grading system showed higher overall intra/inter-rater reliability regardless of rater specialty than the UTD classification. Inter-rater reliability for SFU grades II/III and UTD 2 was low, highlighting the limitations of both classifications in regards to properly segregating moderate HN grades.
引用
收藏
页码:80.e1 / 80.e5
页数:5
相关论文
共 20 条
[1]  
Arora S, 2015, J PEDIAT UROL, P1
[2]   Postnatal evaluation of infants with an abnormal antenatal renal sonogram [J].
Becker, Amy M. .
CURRENT OPINION IN PEDIATRICS, 2009, 21 (02) :207-213
[3]   ULTRASOUND GRADING OF HYDRONEPHROSIS - INTRODUCTION TO THE SYSTEM USED BY THE SOCIETY-FOR-FETAL-UROLOGY [J].
FERNBACH, SK ;
MAIZELS, M ;
CONWAY, JJ .
PEDIATRIC RADIOLOGY, 1993, 23 (06) :478-480
[4]  
Giron AM, 2011, INT BRAZ J UROL, V37, P605, DOI [10.1590/S1677-55382011000500006, 10.1590/S1677-55382011000400032]
[5]   Evaluation of Urinary Tract Dilation Classification System for Grading Postnatal Hydronephrosis [J].
Hodhod, Amr ;
Capolicchio, John-Paul ;
Jednak, Roman ;
El-Sherif, Eid ;
El-Doray, Abd El-Alim ;
El-Sherbiny, Mohamed .
JOURNAL OF UROLOGY, 2016, 195 (03) :725-730
[6]  
Holsti OR, 1968, HDB SOC PSYCHOL, V1, P403
[7]   Reliability assessment of Society for Fetal Urology ultrasound grading system for hydronephrosis [J].
Keays, M. A. ;
Guerra, L. A. ;
Mihill, J. ;
Raju, G. ;
Al-Asheeri, N. ;
Geier, P. ;
Gaboury, I. ;
Matzinger, M. ;
Pike, J. ;
Leonard, M. P. .
JOURNAL OF UROLOGY, 2008, 180 (04) :1680-1682
[8]   Comparison of the reliability of two hydronephrosis grading systems: The Society for Foetal Urology grading system vs. the Onen grading system [J].
Kim, S. -Y. ;
Kim, M. -J. ;
Yoon, C. S. ;
Lee, M. S. ;
Han, K. H. ;
Lee, M. -J. .
CLINICAL RADIOLOGY, 2013, 68 (09) :E484-E490
[9]   Reliability in content analysis - Some common misconceptions and recommendations [J].
Krippendorff, K .
HUMAN COMMUNICATION RESEARCH, 2004, 30 (03) :411-433
[10]   Prediction of the outcome of antenatally diagnosed hydronephrosis: A multivariable analysis [J].
Longpre, Michelle ;
Nguan, Angela ;
MacNeily, Andrew E. ;
Afshar, Kourosh .
JOURNAL OF PEDIATRIC UROLOGY, 2012, 8 (02) :135-139