Machine learning analyses of automated performance metrics during granular sub-stitch phases predict surgeon experience

被引:33
作者
Chen, Andrew B. [1 ]
Liang, Siqi [2 ]
Nguyen, Jessica H. [1 ]
Liu, Yan [2 ]
Hung, Andrew J. [1 ]
机构
[1] Univ Southern Calif, Inst Urol, Ctr Robot Simulat & Educ, Catherine & Joseph Aresty Dept Urol, Los Angeles, CA 90089 USA
[2] Univ Southern Calif, Dept Comp Sci, Viterbi Sch Engn, Los Angeles, CA 90089 USA
基金
美国国家卫生研究院;
关键词
TECHNICAL PERFORMANCE; SKILLS; VALIDATION;
D O I
10.1016/j.surg.2020.09.020
中图分类号
R61 [外科手术学];
学科分类号
摘要
Automated performance metrics objectively measure surgeon performance during a robot-assisted radical prostatectomy. Machine learning has demonstrated that automated performance metrics, especially during the vesico-urethral anastomosis of the robot-assisted radical prostatectomy, are predictive of long-term outcomes such as continence recovery time. This study focuses on automated performance metrics during the vesico-urethral anastomosis, specifically on stitch versus sub-stitch levels, to distinguish surgeon experience. During the vesico-urethral anastomosis, automated performance metrics, recorded by a systems data recorder (Intuitive Surgical, Sunnyvale, CA, USA), were reported for each overall stitch (C(tota)l) and its individual components: needle handling/targeting (C-1), needle driving (C-2), and suture cinching (C-3) (Fig 1, A). These metrics were organized into three datasets (GlobalSet [whole stitch], RowSet [independent sub-stitches], and ColumnSet [associated sub-stitches] (Fig 1, B) and applied to three machine learning models (AdaBoost, gradient boosting, and random forest) to solve two classifications tasks: experts (>= 100 cases) versus novices (<100 cases) and ordinary experts (>= 100 and <2,000 cases) versus super experts (>= 2,000 cases). Classification accuracy was determined using analysis of variance. Input features were evaluated through a Jaccard index. From 68 vesico-urethral anastomoses, we analyzed 1,570 stitches broken down into 4,708 sub-stitches. For both classification tasks, ColumnSet best distinguished experts (n = 8) versus novices (n = 9) and ordinary experts (n = 5) versus super experts (n = 3) at an accuracy of 0.774 and 0.844, respectively. Feature ranking highlighted Endowrist articulation and needle handling/targeting as most important in classifica-tion. Surgeon performance measured by automated performance metrics on a granular sub-stitch level more accurately distinguishes expertise when compared with summary automated performance metrics over whole stitches. (C) 2020 Elsevier Inc. All rights reserved.
引用
收藏
页码:1245 / 1249
页数:5
相关论文
共 16 条
[1]   Scaling to very very large corpora for natural language disambiguation [J].
Banko, M ;
Brill, E .
39TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE, 2001, :26-33
[2]   Surgical Skill and Complication Rates after Bariatric Surgery [J].
Birkmeyer, John D. ;
Finks, Jonathan F. ;
O'Reilly, Amanda ;
Oerline, Mary ;
Carlin, Arthur M. ;
Nunn, Andre R. ;
Dimick, Justin ;
Banerjee, Mousumi ;
Birkmeyer, Nancy J. O. .
NEW ENGLAND JOURNAL OF MEDICINE, 2013, 369 (15) :1434-1442
[3]   Objective Assessment of Robotic Surgical Technical Skill: A Systematic Review [J].
Chen, Jian ;
Cheng, Nathan ;
Cacciamani, Giovanni ;
Oh, Paul ;
Lin-Brande, Michael ;
Remulla, Daphne ;
Gill, Inderbir S. ;
Hung, Andrew J. .
JOURNAL OF UROLOGY, 2019, 201 (03) :461-469
[4]   Use of Automated Performance Metrics to Measure Surgeon Performance during Robotic Vesicourethral Anastomosis and Methodical Development of a Training Tutorial [J].
Chen, Jian ;
Oh, Paul J. ;
Cheng, Nathan ;
Shah, Ankeet ;
Montez, Jeremy ;
Jarc, Anthony ;
Guo, Liheng ;
Gill, Inderbir S. ;
Hung, Andrew J. .
JOURNAL OF UROLOGY, 2018, 200 (04) :895-902
[5]   Understanding the Impact of Label Granularity on CNN-based Image Classification [J].
Chen, Zhuo ;
Ding, Ruizhou ;
Chin, Ting-Wu ;
Marculescu, Diana .
2018 18TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW), 2018, :895-904
[6]   Crowd-sourced assessment of technical skills: an opportunity for improvement in the assessment of laparoscopic surgical skills [J].
Deal, Shanley B. ;
Lendvay, Thomas S. ;
Haque, Mohamad I. ;
Brand, Timothy ;
Comstock, Bryan ;
Warren, Justin ;
Alseidi, Adnan .
AMERICAN JOURNAL OF SURGERY, 2016, 211 (02) :398-404
[7]   The Effect of Technical Performance on Patient Outcomes in Surgery A Systematic Review [J].
Fecso, Andras B. ;
Szasz, Peter ;
Kerezov, Georgi ;
Grantcharov, Teodor P. .
ANNALS OF SURGERY, 2017, 265 (03) :492-501
[8]   Global Evaluative Assessment of Robotic Skills: Validation of a Clinical Assessment Tool to Measure Robotic Surgical Skills [J].
Goh, Alvin C. ;
Goldfarb, David W. ;
Sander, James C. ;
Miles, Brian J. ;
Dunkin, Brian J. .
JOURNAL OF UROLOGY, 2012, 187 (01) :247-252
[9]   Surgeon Performance Predicts Early Continence After Robot-Assisted Radical Prostatectomy [J].
Goldenberg, Mitchell G. ;
Goldenberg, Larry ;
Grantcharov, Teodor P. .
JOURNAL OF ENDOUROLOGY, 2017, 31 (09) :858-863
[10]   The Unreasonable Effectiveness of Data [J].
Halevy, Alon ;
Norvig, Peter ;
Pereira, Fernando .
IEEE INTELLIGENT SYSTEMS, 2009, 24 (02) :8-12