Using Artificial Intelligence to Label Free-Text Operative and Ultrasound Reports for Grading Pediatric Appendicitis

被引:9
作者
Abu-Ashour, Waseem [1 ,2 ]
Emil, Sherif [1 ,2 ]
Poenaru, Dan [1 ,2 ]
机构
[1] McGill Univ, Montreal Childrens Hosp, Hlth Ctr, Harvey E Beardmore Div Pediat Surg, Montreal, PQ, Canada
[2] McGill Univ, Res Inst, Hlth Ctr, Montreal, PQ, Canada
关键词
Pediatric appendicitis; Artificial intelligence; Diagnosis; Appendicitis grade; Comparative study; DEFINITION;
D O I
10.1016/j.jpedsurg.2024.01.033
中图分类号
R72 [儿科学];
学科分类号
100202 ;
摘要
Purpose: Data science approaches personalizing pediatric appendicitis management are hampered by small datasets and unstructured electronic medical records (EMR). Artificial intelligence (AI) chatbots based on large language models can structure free-text EMR data. We compare data extraction quality between ChatGPT-4 and human data collectors. Methods: To train AI models to grade pediatric appendicitis preoperatively, several data collectors extracted detailed preoperative and operative data from 2100 children operated for acute appendicitis. Collectors were trained for the task based on satisfactory Kappa scores. ChatGPT-4 was prompted to structure free text from 103 random anonymized ultrasound and operative records in the dataset using the set variables and coding options, and to estimate appendicitis severity grade from the operative report. A pediatric surgeon then adjudicated all data, identifying errors in each method. Results: Within the 44 ultrasound (42.7%) and 32 operative reports (31.1%) discordant in at least one field, 98% of the errors were found in the manual data extraction. The appendicitis grade was erroneously assigned manually in 29 patients (28.2%), and by ChatGPT-4 in 3 (2.9%). Across datasets, the use of the AI chatbot was able to avoid misclassification in 59.2% of the records including both reports and extracted data approximately 40 times faster. Conclusion: AI chatbot significantly outperformed manual data extraction in accuracy for ultrasound and operative reports, and correctly assigned the appendicitis grade. While wider validation is required and data safety concerns must be addressed, these AI tools show significant promise in improving the accuracy and efficiency of research data collection. Levels of Evidence: Level III. (c) 2024 The Author(s). Published by Elsevier Inc. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
引用
收藏
页码:783 / 790
页数:8
相关论文
共 50 条
[1]   Chatbots: History, technology, and applications [J].
Adamopoulou, Eleni ;
Moussiades, Lefteris .
MACHINE LEARNING WITH APPLICATIONS, 2020, 2
[2]   Clinical prediction rules [J].
Adams, Simon T. ;
Leveson, Stephen H. .
BMJ-BRITISH MEDICAL JOURNAL, 2012, 344
[3]  
Adamson B, 2023, medRxiv, DOI [10.1101/2023.03.02.23286522, 10.1101/2023.03.02.23286522, DOI 10.1101/2023.03.02.23286522]
[4]   The Use of Machine Learning Approaches for the Diagnosis of Acute Appendicitis [J].
Akmese, Omer F. ;
Dogan, Gul ;
Kor, Hakan ;
Erbay, Hasan ;
Demir, Emre .
EMERGENCY MEDICINE INTERNATIONAL, 2020, 2020
[5]   Readership Awareness Series - Paper 4: Chatbots and ChatGPT - Ethical Considerations in Scientific Publications [J].
Ali, Mohammad Javed ;
Djalilian, Ali .
SEMINARS IN OPHTHALMOLOGY, 2023, 38 (05) :403-404
[6]   Using ChatGPT to write patient clinic letters [J].
Ali, Stephen R. ;
Dobbs, Thomas D. ;
Hutchings, Hayley A. ;
Whitaker, Iain S. .
LANCET, 2023, 5 (04) :E179-E181
[7]   I, Chatbot: Modeling the determinants of users' satisfaction and continuance intention of AI-powered service agents [J].
Ashfaq, Muhammad ;
Yun, Jiang ;
Yu, Shubin ;
Correia Loureiro, Sandra Maria .
TELEMATICS AND INFORMATICS, 2020, 54
[8]  
Birnbaum B, 2020, Arxiv, DOI [arXiv:2001.09765, 10.48550/arxiv.2001.09765, DOI 10.48550/ARXIV.2001.09765]
[9]  
Brown TB, 2020, ADV NEUR IN, V33
[10]   Deep learning in computer vision: A critical review of emerging techniques and application scenarios [J].
Chai, Junyi ;
Zeng, Hao ;
Li, Anming ;
Ngai, Eric W. T. .
MACHINE LEARNING WITH APPLICATIONS, 2021, 6