Can ChatGPT Generate Acceptable Case-Based Multiple-Choice Questions for Medical School Anatomy Exams? A Pilot Study on Item Difficulty and Discrimination

被引:0
作者
Kiyak, Yavuz Selim [1 ]
Soylu, Ayse [2 ]
Coskun, Ozlem [1 ]
Budakoglu, Isil Irem [1 ]
Peker, Tuncay Veysel [2 ]
机构
[1] Gazi Univ, Fac Med, Dept Med Educ & Informat, Ankara, Turkiye
[2] Gazi Univ, Fac Med, Dept Anat, Ankara, Turkiye
关键词
anatomy; artificial intelligence; ChatGPT; medical education; multiple-choice questions; psychometrics; written assessment;
D O I
10.1002/ca.24271
中图分类号
R602 [外科病理学、解剖学]; R32 [人体形态学];
学科分类号
100101 ;
摘要
Developing high-quality multiple-choice questions (MCQs) for medical school exams is effortful and time-consuming. In this study, we investigated the ability of ChatGPT to generate case-based anatomy MCQs with acceptable levels of item difficulty and discrimination for medical school exams. We used ChatGPT to generate case-based anatomy MCQs for an endocrine and urogenital system exam based on a framework for artificial intelligence (AI)-assisted item generation. The questions were evaluated by experts, approved by the department, and administered to 502 second-year medical students (372 Turkish-language, 130 English-language). The items were analyzed to determine the discrimination and difficulty indices. The item discrimination indices ranged from 0.29 to 0.54, indicating acceptable differentiation between high- and low-performing students. All items in Turkish (six out of six) and five out of six in English met the higher discrimination threshold (>= 0.30) required for large-scale standardized tests. The item difficulty indices ranged from 0.41 to 0.89, most items falling within the moderate difficulty range (0.20-0.80). Therefore, it was concluded that ChatGPT can generate case-based anatomy MCQs with acceptable psychometric properties, offering a promising tool for medical educators. However, human expertise remains crucial for reviewing and refining AI-generated assessment items. Future research should explore AI-generated MCQs across various anatomy topics and investigate different AI models for question generation.
引用
收藏
页码:505 / 510
页数:6
相关论文
共 34 条
[1]   Comparative assessment of three AI platforms in answering USMLE Step 1 anatomy questions or identifying anatomical structures on radiographs [J].
Al-Khater, Khulood Mohammed Khalid .
CLINICAL ANATOMY, 2025, 38 (02) :186-199
[2]   Using artificial intelligence to create diverse and inclusive medical case vignettes for education [J].
Bakkum, Michiel J. ;
Hartjes, Marielle G. ;
Piet, Joost D. ;
Donker, Erik M. ;
Likic, Robert ;
Sanz, Emilio ;
de Ponti, Fabrizio ;
Verdonk, Petra ;
Richir, Milan C. ;
van Agtmael, Michiel A. ;
Tichelaar, Jelle .
BRITISH JOURNAL OF CLINICAL PHARMACOLOGY, 2024, 90 (03) :640-648
[3]   Creating Virtual Patients using Robots and Large Language Models: A Preliminary Study with Medical Students [J].
Borg, Alexander ;
Parodis, Ioannis ;
Skantze, Gabriel .
COMPANION OF THE 2024 ACM/IEEE INTERNATIONAL CONFERENCE ON HUMAN-ROBOT INTERACTION, HRI 2024 COMPANION, 2024, :273-277
[4]   The integrated curriculum in medical education: AMEE Guide No. 96 [J].
Brauer, David G. ;
Ferguson, Kristi J. .
MEDICAL TEACHER, 2015, 37 (04) :312-322
[5]   Introducing AnatomyGPT: A customized artificial intelligence application for anatomical sciences education [J].
Collins, Bradley R. ;
Black, Erik W. ;
Rarey, Kyle E. .
CLINICAL ANATOMY, 2024, 37 (06) :661-669
[6]   Creating virtual patients using large language models: scalable, global, and low cost [J].
Cook, David A. .
MEDICAL TEACHER, 2025, 47 (01) :40-42
[7]   ChatGPT to generate clinical vignettes for teaching and multiple-choice questions for assessment: A randomized controlled experiment [J].
Coskun, Oezlem ;
Kiyak, Yavuz Selim ;
Budakoglu, Isil Irem .
MEDICAL TEACHER, 2025, 47 (02) :268-274
[8]  
Downing S.M., 2009, Assessment in health professions education
[9]   AI-based avatars are changing the way we learn and teach: benefits and challenges [J].
Fink, Maximilian C. ;
Robinson, Seth A. ;
Ertl, Bernhard .
FRONTIERS IN EDUCATION, 2024, 9
[10]  
Gierl M. J., 2021, ADV METHODS AUTOMATI, P1, DOI [10.4324/9781003025634, DOI 10.4324/9781003025634]