Evaluating GPT-4V's performance in the Japanese national dental examination: A challenge explored

被引:10
作者
Morishita, Masaki [1 ,2 ]
Fukuda, Hikaru [3 ]
Muraoka, Kosuke [1 ]
Nakamura, Taiji [4 ]
Hayashi, Masanari [5 ]
Yoshioka, Izumi [6 ]
Ono, Kentaro [7 ]
Awano, Shuji [1 ]
机构
[1] Kyushu Dent Univ, Dept Oral Funct, Div Clin Educ Dev & Res, 2-6-1 Manazuru, Kokurakita 8038580, Japan
[2] Kyushu Dent Univ Hosp, Hlth Informat Management Off, Kitakyushu, Japan
[3] Kyushu Dent Univ, Dept Phys Funct, Div Maxillofacial Surg, Kitakyushu, Japan
[4] Kyushu Dent Univ, Dept Oral Funct, Div Periodontol, Kitakyushu, Japan
[5] Kyushu Dent Univ Hosp, Adm Dept, Kitakyushu, Japan
[6] Dept Phys Funct, Div Oral Med, Kitakyushu, Japan
[7] Kyushu Dent Univ, Dept Hlth Promot, Div Physiol, Kitakyushu, Japan
关键词
ChatGPT-4V; Image recognition; Medical image analysis; National dental examination;
D O I
10.1016/j.jds.2023.12.007
中图分类号
R78 [口腔科学];
学科分类号
1003 ;
摘要
Background/purpose: Rapid advancements in AI technology have led to significant interest in its application across various fields, including medicine and dentistry. This study aimed to assess the capabilities of ChatGPT-4V with image recognition in answering imagebased questions from the Japanese National Dental Examination (JNDE) to explore its potential as an educational support tool for dental students. Materials and methods: The dataset used questions from the JNDE, which was conducted in January 2023, with a focus on image -related queries. ChatGPT-4V was utilized, and standardized prompts, question texts, and images were input. Data and statistical analyses were conducted using Qlik Sense (R) and GraphPad Prism. Results: The overall correct response rate of ChatGPT-4V for image -based JNDE questions was 35.0 %. The correct response rates were 57.1 % for compulsory questions, 43.6 % for general questions, and 28.6 % for clinical practical questions. In specialties like Dental Anesthesiology and Endodontics, ChatGPT-4V achieved correct response rates above 70 %, while response rates for Orthodontics and Oral Surgery were lower. A higher number of images in questions was correlated with lower accuracy, suggesting an impact of the number of images on correct and incorrect responses. Conclusion: While innovative, ChatGPT-4V's image recognition feature exhibited limitations, especially in handling image-intensive and complex clinical practical questions, and is not yet fully suitable as an educational support tool for dental students at its current stage. Further technological refinement and re-evaluation with a broader dataset are recommended. <feminine ordinal indicator> 2024 Association for Dental Sciences of the Republic of China. Publishing services by Elsevier B.V. This is an open access article under the CC BY license (http://creativecommons.org/ licenses/by/4.0/).
引用
收藏
页码:1595 / 1600
页数:6
相关论文
共 13 条
[1]  
2023, Arxiv, DOI arXiv:2303.08774
[2]  
Azabu Dental Academy, 2023, Question booklet by times-116th Japanese national dental examination question booklet
[3]   How Does ChatGPT Perform on the United States Medical Licensing Examination (USMLE)? The Implications of Large Language Models for Medical Education and Knowledge Assessment [J].
Gilson, Aidan ;
Safranek, Conrad W. ;
Huang, Thomas ;
Socrates, Vimig ;
Chi, Ling ;
Taylor, Richard Andrew ;
Chartash, David .
JMIR MEDICAL EDUCATION, 2023, 9
[4]  
Kumari S, 2023, PREPRINT
[5]  
OpenAI, GPT 4V ISION SYSTEM
[6]   Performance of the Large Language Model ChatGPT on the National Nurse Examinations in Japan: Evaluation Study [J].
Taira, Kazuya ;
Itaya, Takahiro ;
Hanada, Ayame .
JMIR NURSING, 2023, 6
[7]   Performance of GPT-3.5 and GPT-4 on the Japanese Medical Licensing Examination: Comparison Study [J].
Takagi, Soshi ;
Watari, Takashi ;
Erabi, Ayano ;
Sakaguchi, Kota .
JMIR MEDICAL EDUCATION, 2023, 9
[8]  
The Ministry of Health Labour and Welfare of Japan, The guidelines for the Japanese national dental examination
[9]  
The Ministry of Health Labour and Welfare of Japan, Questions and correct answers for the 116th national dental examination
[10]  
The Ministry of Health Labour and Welfare of Japan, Exclusion of questions from the 116th national dental examination