The performance of AI in medical examinations: an exploration of ChatGPT in ultrasound medical education

被引：1

作者：

Hong, Dao-Rong ^{[1
]}

Huang, Chun-Yan ^{[2
]}

机构：

[1] Fujian Med Univ, Affiliated Hosp 2, Dept Ultrasonog, Quanzhou, Fujian, Peoples R China

[2] Fujian Med Univ, Affiliated Hosp 2, Dept Gen Practice, Quanzhou, Fujian, Peoples R China

来源：

FRONTIERS IN MEDICINE | 2024年 / 11卷

关键词：

ChatGPT; ultrasound medicine; medical education; artificial intelligence (AI); examination;

D O I：

10.3389/fmed.2024.1472006

中图分类号：

R5 [内科学];

学科分类号：

1002 ; 100201 ;

摘要：

Objective This study aims to evaluate the accuracy of ChatGPT in the context of China's Intermediate Professional Technical Qualification Examination for Ultrasound Medicine, exploring its potential role in ultrasound medical education.Methods A total of 100 questions, comprising 70 single-choice and 30 multiple-choice questions, were selected from the examination's question bank. These questions were categorized into four groups: basic knowledge, relevant clinical knowledge, professional knowledge, and professional practice. ChatGPT versions 3.5 and 4.0 were tested, and accuracy was measured based on the proportion of correct answers for each version.Results ChatGPT 3.5 achieved an accuracy of 35.7% for single-choice and 30.0% for multiple-choice questions, while version 4.0 improved to 61.4 and 50.0%, respectively. Both versions performed better in basic knowledge questions but showed limitations in professional practice-related questions. Version 4.0 demonstrated significant improvements across all categories compared to version 3.5, but it still underperformed when compared to resident doctors in certain areas.Conclusion While ChatGPT did not meet the passing criteria for the Intermediate Professional Technical Qualification Examination in Ultrasound Medicine, its strong performance in basic medical knowledge suggests potential as a supplementary tool in medical education. However, its limitations in addressing professional practice tasks need to be addressed.

引用

页数：5

共 15 条

[1] ChatGPT in Clinical Toxicology [J].

Abdel-Messih, Mary Sabry ;

Boulos, Maged N. Kamel .

JMIR MEDICAL EDUCATION, 2023, 9

[2] Evaluating the Performance of ChatGPT in Ophthalmology [J].

Antaki, Fares ;

Touma, Samir ;

Milad, Daniel ;

El -Khoury, Jonathan ;

Duval, Renaud .

OPHTHALMOLOGY SCIENCE, 2023, 3 (04)

[3] Performance of ChatGPT on a Radiology Board-style Examination: Insights into Current Strengths and Limitations [J].

Bhayana, Rajesh ;

Krishna, Satheesh ;

Bleakney, Robert R. .

RADIOLOGY, 2023, 307 (05)

[4] ChatGPT and Generative Artificial Intelligence for Medical Education: Potential Impact and Opportunity [J].

Boscardin, Christy K. ;

Gin, Brian ;

Golde, Polo Black ;

Hauer, Karen E. .

ACADEMIC MEDICINE, 2024, 99 (01) :22-27

[5]

Castelvecchi Davide, 2022, Nature, DOI 10.1038/d41586-022-04383-z

[6] How Does ChatGPT Perform on the United States Medical Licensing Examination (USMLE)? The Implications of Large Language Models for Medical Education and Knowledge Assessment [J].

Gilson, Aidan ;

Safranek, Conrad W. ;

Huang, Thomas ;

Socrates, Vimig ;

Chi, Ling ;

Taylor, Richard Andrew ;

Chartash, David .

JMIR MEDICAL EDUCATION, 2023, 9

[7] ChatGPT makes medicine easy to swallow: an exploratory case study on simplified radiology reports [J].

Jeblick, Katharina ;

Schachtner, Balthasar ;

Dexl, Jakob ;

Mittermeier, Andreas ;

Stueber, Anna Theresa ;

Topalis, Johanna ;

Weber, Tobias ;

Wesp, Philipp ;

Sabel, Bastian Oliver ;

Ricke, Jens ;

Ingrisch, Michael .

EUROPEAN RADIOLOGY, 2024, 34 (05) :2817-2825

[8] The application of Chat Generative Pre-trained Transformer in nursing education [J].

Liu, Jialin ;

Liu, Fan ;

Fang, Jinbo ;

Liu, Siru .

NURSING OUTLOOK, 2023, 71 (06)

[9] Artificial Intelligence and Objective Structured Clinical Examinations: Using ChatGPT to Revolutionize Clinical Skills Assessment in Medical Education [J].

Misra, Sanghamitra M. ;

Suresh, Srinivasan .

JOURNAL OF MEDICAL EDUCATION AND CURRICULAR DEVELOPMENT, 2024, 11

[10] Artificial Intelligence in Ophthalmology: A Comparative Analysis of GPT-3.5, GPT-4, and Human Expertise in Answering StatPearls Questions [J].

Moshirfar, Majid ;

Altaf, Amal W. ;

Stoakes, Isabella M. ;

Tuttle, Jared J. ;

Hoopes, Phillip C. .

CUREUS JOURNAL OF MEDICAL SCIENCE, 2023, 15 (06)

← 1 2 →