Multi-institutional Validation of Improved Vesicoureteral Reflux Assessment With Simple and Machine Learning Approaches

被引:14
作者
Khondker, Adree [1 ]
Kwong, Jethro C. C. [2 ]
Yadav, Priyank [1 ]
Chan, Justin Y. H. [2 ]
Singh, Anuradha [3 ]
Skreta, Marta [4 ,5 ]
Erdman, Lauren [4 ,5 ]
Keefe, Daniel T. [1 ,2 ,6 ]
Fischer, Katherine [7 ]
Tasian, Gregory [7 ]
Hannick, Jessica H. [8 ]
Papanikolaou, Frank [1 ,9 ]
Cooper, Benjamin J. [10 ]
Cooper, Christopher S. [10 ]
Rickard, Mandy [1 ]
Lorenzo, Armando J. [1 ,2 ]
机构
[1] Hosp Sick Children, Div Urol, Toronto, ON, Canada
[2] Univ Toronto, Dept Surg, Div Urol, Toronto, ON, Canada
[3] Hosp Sick Children, Dept Diagnost Imaging, Toronto, ON, Canada
[4] Univ Toronto, Dept Comp Sci, Toronto, ON, Canada
[5] Vector Inst, Toronto, ON, Canada
[6] IWK Hosp, Dept Surg, Halifax, NS, Canada
[7] Childrens Hosp Philadelphia, Div Urol, Philadelphia, PA USA
[8] Rainbow Babies & Childrens Hosp, Div Pediat Urol, Cleveland, OH USA
[9] Trillium Hlth Partners, Div Urol, Mississauga, ON, Canada
[10] Hosp Sick Children, Dept Urol, Iowa City, IA USA
关键词
vesico-ureteral reflux; machine learning; urography; reproducibility of results; RELIABILITY;
D O I
10.1097/JU.0000000000002987
中图分类号
R5 [内科学]; R69 [泌尿科学(泌尿生殖系疾病)];
学科分类号
1002 ; 100201 ;
摘要
Purpose:Vesicoureteral reflux grading from voiding cystourethrograms is highly subjective with low reliability. We aimed to demonstrate improved reliability for vesicoureteral reflux grading with simple and machine learning approaches using ureteral tortuosity and dilatation on voiding cystourethrograms.Materials and Methods:Voiding cystourethrograms were collected from our institution for training and 5 external data sets for validation. Each voiding cystourethrogram was graded by 5-7 raters to determine a consensus vesicoureteral reflux grade label and inter- and intra-rater reliability was assessed. Each voiding cystourethrogram was assessed for 4 features: ureteral tortuosity, proximal, distal, and maximum ureteral dilatation. The labels were then assigned to the combination of the 4 features. A machine learning-based model, qVUR, was trained to predict vesicoureteral reflux grade from these features and model performance was assessed by AUROC (area under the receiver-operator-characteristic).Results:A total of 1,492 kidneys and ureters were collected from voiding cystourethrograms resulting in a total of 8,230 independent gradings. The internal inter-rater reliability for vesicoureteral reflux grading was 0.44 with a median percent agreement of 0.71 and low intra-rater reliability. Higher values for each feature were associated with higher vesicoureteral reflux grade. qVUR performed with an accuracy of 0.62 (AUROC=0.84) with stable performance across all external data sets. The model improved vesicoureteral reflux grade reliability by 3.6-fold compared to traditional grading (P < .001).Conclusions:In a large pediatric population from multiple institutions, we show that machine learning-based assessment for vesicoureteral reflux improves reliability compared to current grading methods. qVUR is generalizable and robust with similar accuracy to clinicians but the added prognostic value of quantitative measures warrants further study.
引用
收藏
页码:1314 / 1322
页数:9
相关论文
共 26 条
[1]   Validation of the ureteral diameter ratio for predicting early spontaneous resolution of primary vesicoureteral reflux [J].
Arlen, Angela M. ;
Kirsch, Andrew J. ;
Leong, Traci ;
Cooper, Christopher S. .
JOURNAL OF PEDIATRIC UROLOGY, 2017, 13 (04) :383-388
[2]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[3]   Current status of artificial intelligence applications in urology and their potential to influence clinical practice [J].
Chen, Jian ;
Remulla, Daphne ;
Nguyen, Jessica H. ;
Aastha, D. ;
Liu, Yan ;
Dasgupta, Prokar ;
Hung, Andrew J. .
BJU INTERNATIONAL, 2019, 124 (04) :567-577
[4]   The natural history of neonatal vesicoureteral reflux associated with antenatal hydronephrosis [J].
Farhat, W ;
McLorie, G ;
Geary, D ;
Capolicchio, G ;
Bägli, D ;
Merguerian, P ;
Khoury, A .
JOURNAL OF UROLOGY, 2000, 164 (03) :1057-1060
[5]  
George D., 2020, IBM SPSS STAT 26 STE, DOI [10.4324/9780429056765, DOI 10.4324/9780429056765]
[6]  
Hoberman A, 2014, NEW ENGL J MED, V371, P1072, DOI [10.1056/NEJMoa1401811, 10.1056/NEJMc1408559]
[7]   Mild Fetal Renal Pelvis Dilatation-Much Ado About Nothing? [J].
Hothi, Daljit K. ;
Wade, Angie S. ;
Gilbert, Ruth ;
Winyard, Paul J. D. .
CLINICAL JOURNAL OF THE AMERICAN SOCIETY OF NEPHROLOGY, 2009, 4 (01) :168-177
[8]   A machine learning-based approach for quantitative grading of vesicoureteral reflux from voiding cystourethrograms: Methods and proof of concept [J].
Khondker, Adree ;
Kwong, Jethro C. C. ;
Rickard, Mandy ;
Skreta, Marta ;
Keefe, Daniel T. ;
Lorenzo, Armando J. ;
Erdman, Lauren .
JOURNAL OF PEDIATRIC UROLOGY, 2022, 18 (01) :78.e1-78.e7
[9]   Non-Animal Stabilized Hyaluronic Acid/Dextranomer Gel (NASHA/Dx, Deflux) for Endoscopic Treatment of Vesicoureteral Reflux: What Have We Learned Over the Last 20 Years? [J].
Kirsch, Andrew J. ;
Cooper, Christopher S. ;
Lackgren, Goran .
UROLOGY, 2021, 157 :15-28
[10]  
Koehrsen Will., 2018, Transfer Learning with Convolutional Neural Networks in PyTorch"