Artificial Intelligence in Surgery: A Systematic Review of Use and Validation

被引:3
作者
Kenig, Nitzan [1 ]
Echeverria, Javier Monton [2 ]
Vives, Aina Muntaner [3 ]
机构
[1] Quironsalud Palmaplanas Hosp, Dept Plast Surg, Palma De Mallorca 07010, Spain
[2] Albacete Univ Hosp, Dept Plast Surg, Albacete 02006, Spain
[3] Son Llatzer Univ Hosp, Dept Otolaryngol, Palma De Mallorca 07198, Spain
关键词
artificial intelligence; machine learning; surgery; validation; human-machine interaction; LYMPH-NODE METASTASIS; MACHINE LEARNING-MODELS; NEURAL-NETWORK; HEPATOCELLULAR-CARCINOMA; EXTERNAL VALIDATION; COMPUTED-TOMOGRAPHY; AUTOMATED DETECTION; COLORECTAL-CANCER; DECISION-SUPPORT; CARDIAC-SURGERY;
D O I
10.3390/jcm13237108
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Background: Artificial Intelligence (AI) holds promise for transforming healthcare, with AI models gaining increasing clinical use in surgery. However, new AI models are developed without established standards for their validation and use. Before AI can be widely adopted, it is crucial to ensure these models are both accurate and safe for patients. Without proper validation, there is a risk of integrating AI models into practice without sufficient evidence of their safety and accuracy, potentially leading to suboptimal patient outcomes. In this work, we review the current use and validation methods of AI models in clinical surgical settings and propose a novel classification system. Methods: A systematic review was conducted in PubMed and Cochrane using the keywords "validation", "artificial intelligence", and "surgery", following PRISMA guidelines. Results: The search yielded a total of 7627 articles, of which 102 were included for data extraction, encompassing 2,837,211 patients. A validation classification system named Surgical Validation Score (SURVAS) was developed. The primary applications of models were risk assessment and decision-making in the preoperative setting. Validation methods were ranked as high evidence in only 45% of studies, and only 14% of the studies provided publicly available datasets. Conclusions: AI has significant applications in surgery, but validation quality remains suboptimal, and public data availability is limited. Current AI applications are mainly focused on preoperative risk assessment and are suggested to improve decision-making. Classification systems such as SURVAS can help clinicians confirm the degree of validity of AI models before their application in practice.
引用
收藏
页数:27
相关论文
共 254 条
[1]   The future of artificial intelligence in thoracic surgery for non-small cell lung cancer treatment a narrative review [J].
Abbaker, Namariq ;
Minervini, Fabrizio ;
Guttadauro, Angelo ;
Solli, Piergiorgio ;
Cioffi, Ugo ;
Scarci, Marco .
FRONTIERS IN ONCOLOGY, 2024, 14
[2]   Application of radiomics for preoperative prediction of lymph node metastasis in colorectal cancer: a systematic review and meta-analysis [J].
Abbaspour, Elahe ;
Karimzadhagh, Sahand ;
Monsef, Abbas ;
Joukar, Farahnaz ;
Mansour-Ghanaei, Fariborz ;
Hassanipour, Soheil .
INTERNATIONAL JOURNAL OF SURGERY, 2024, 110 (06) :3795-3813
[3]   Complications Following Body Contouring: Performance Validation of Bard, a Novel AI Large Language Model, in Triaging and Managing Postoperative Patient Concerns [J].
Abi-Rafeh, Jad ;
Mroueh, Vanessa J. ;
Bassiri-Tehrani, Brian ;
Marks, Jacob ;
Kazan, Roy ;
Nahai, Foad .
AESTHETIC PLASTIC SURGERY, 2024, 48 (05) :953-976
[4]   The Brescia Internationally Validated European Guidelines on Minimally Invasive Pancreatic Surgery (EGUMIPS) [J].
Abu Hilal, Mohammad ;
van Ramshorst, Tess M. E. ;
Boggi, Ugo ;
Dokmak, Safi ;
Edwin, Bjorn ;
Keck, Tobias ;
Khatkov, Igor ;
Ahmad, Jawad ;
Al Saati, Hani ;
Alseidi, Adnan ;
Azagra, Juan S. ;
Bjoernsson, Bergthor ;
Can, Fatih M. ;
D'Hondt, Mathieu ;
Efanov, Mikhail ;
Espin Alvarez, Francisco ;
Esposito, Alessandro ;
Ferrari, Giovanni ;
Groot Koerkamp, Bas ;
Gumbs, Andrew A. ;
Hogg, Melissa E. ;
Huscher, Cristiano G. S. ;
Ielpo, Benedetto ;
Ivanecz, Arpad ;
Jang, Jin-Young ;
Liu, Rong ;
Luyer, Misha D. P. ;
Menon, Krishna ;
Nakamura, Masafumi ;
Piardi, Tullio ;
Saint-Marc, Olivier ;
White, Steve ;
Yoon, Yoo-Seok ;
Zerbi, Alessandro ;
Bassi, Claudio ;
Berrevoet, Frederik ;
Chan, Carlos ;
Coimbra, Felipe J. ;
Conlon, Kevin C. P. ;
Cook, Andrew ;
Dervenis, Christos ;
Falconi, Massimo ;
Ferrari, Clarissa ;
Frigerio, Isabella ;
Fusai, Giuseppe K. ;
De Oliveira, Michelle L. ;
Pinna, Antonio D. ;
Primrose, John N. ;
Sauvanet, Alain ;
Serrablo, Alejandro .
ANNALS OF SURGERY, 2024, 279 (01) :45-57
[5]   MRI-based artificial intelligence to predict infection following total hip arthroplasty failure [J].
Albano, Domenico ;
Gitto, Salvatore ;
Messina, Carmelo ;
Serpi, Francesca ;
Salvatore, Christian ;
Castiglioni, Isabella ;
Zagra, Luigi ;
De Vecchi, Elena ;
Sconfienza, Luca Maria .
RADIOLOGIA MEDICA, 2023, 128 (03) :340-346
[6]   Artificial intelligence-based model for the recurrence of hepatocellular carcinoma after liver transplantation [J].
Altaf, Abdullah ;
Mustafa, Ahmed ;
Dar, Abdullah ;
Nazer, Rashid ;
Riyaz, Shahzad ;
Rana, Atif ;
Bhatti, Abu Bakar Hafeez .
SURGERY, 2024, 176 (05) :1500-1506
[7]  
Aoyama Y, 2024, SURG ENDOSC, V38, P5601, DOI 10.1007/s00464-024-11117-x
[8]   A Narrative Review of Artificial Intelligence (AI) for Objective Assessment of Aesthetic Endpoints in Plastic Surgery [J].
Atiyeh, Bishara ;
Emsieh, Saif ;
Hakim, Christopher ;
Chalhoub, Rawad .
AESTHETIC PLASTIC SURGERY, 2023, 47 (06) :2862-2873
[9]   Vitreoretinal Surgical Instrument Tracking in Three Dimensions Using Deep Learning [J].
Baldi, Pierre F. ;
Abdelkarim, Sherif ;
Liu, Junze ;
To, Josiah K. ;
Ibarra, Marialejandra Diaz ;
Browne, Andrew W. .
TRANSLATIONAL VISION SCIENCE & TECHNOLOGY, 2023, 12 (01)
[10]   Simulated outcomes for durotomy repair in minimally invasive spine surgery [J].
Balu, Alan ;
Kugener, Guillaume ;
Pangal, Dhiraj J. ;
Lee, Heewon ;
Lasky, Sasha ;
Han, Jane ;
Buchanan, Ian ;
Liu, John ;
Zada, Gabriel ;
Donoho, Daniel A. .
SCIENTIFIC DATA, 2024, 11 (01)