Evaluating GPT-4's Cognitive Functions Through the Bloom Taxonomy: Insights and Clarifications

被引:1
作者
Herrmann-Werner, Anne [1 ,2 ]
Festl-Wietek, Teresa [1 ]
Holderried, Friederike [1 ,3 ]
Herschbach, Lea [1 ]
Griewatz, Jan [1 ]
Masters, Ken [4 ]
Zipfel, Stephan [2 ]
Mahling, Moritz [1 ,5 ]
机构
[1] Univ Tubingen, Tubingen Inst Med Educ, Fac Med, Elfriede Aulhorn Str 10, D-72076 Tubingen, Germany
[2] Univ Hosp Tubingen, Dept Psychosomat Med & Psychotherapy, Tubingen, Germany
[3] Univ Tubingen, Tubingen Univ Hosp, Dept Anesthesiol & Intens Care Med, Tubingen, Germany
[4] Sultan Qaboos Univ, Coll Med & Hlth Sci, Med Educ & Informat Dept, Muscat, Oman
[5] Univ Hosp Tubingen, Dept Diabetol Endocrinol Nephrol, Sect Nephrol & Hypertens, Tubingen, Germany
关键词
answer; artificial intelligence; assessment; Bloom's taxonomy; ChatGPT; classification; error; exam; examination; generative; GPT-4; Generative Pre-trained Transformer 4; language model; learning outcome; LLM; MCQ; medical education; medical exam; multiple-choice question; natural language processing; NLP; psychosomatic; question; response; taxonomy;
D O I
10.2196/57778
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
引用
收藏
页数:2
相关论文
共 50 条
[41]   Evaluating GPT-4-based ChatGPT's clinical potential on the NEJM quiz [J].
Daiju Ueda ;
Shannon L. Walston ;
Toshimasa Matsumoto ;
Ryo Deguchi ;
Hiroyuki Tatekawa ;
Yukio Miki .
BMC Digital Health, 2 (1)
[42]   Evaluating Large Language Model-Assisted Emergency Triage: A Comparison of Acuity Assessments by GPT-4 and Medical Experts [J].
Haim, Gal Ben ;
Saban, Mor ;
Barash, Yiftach ;
Cirulnik, David ;
Shaham, Amit ;
Eisenman, Ben Zion ;
Burshtein, Livnat ;
Mymon, Orly ;
Klang, Eyal .
JOURNAL OF CLINICAL NURSING, 2024,
[43]   Response accuracy of GPT-4 across languages: insights from an expert-level diagnostic radiology examination in Japan [J].
Harigai, Ayaka ;
Toyama, Yoshitaka ;
Nagano, Mitsutoshi ;
Abe, Mirei ;
Kawabata, Masahiro ;
Li, Li ;
Yamamura, Jin ;
Takase, Kei .
JAPANESE JOURNAL OF RADIOLOGY, 2025, 43 (02) :319-329
[44]   Cognitive Evaluation of Examinees by Dynamic Question Set Generation based on Bloom's Taxonomy [J].
Dutta, Anjan ;
Chatterjee, Punyasha ;
Dey, Nilanjan ;
Moreno-Ger, Pablo ;
Sen, Soumya .
IETE JOURNAL OF RESEARCH, 2024, 70 (03) :2570-2582
[45]   An Evaluation of the Algerian EFL Baccalaureate Exam under the Cognitive Domains of Bloom's Taxonomy [J].
Belarbi, Fatine Merieme ;
Bensafa, Abdelkader .
ARAB WORLD ENGLISH JOURNAL, 2020, 11 (04) :534-546
[46]   A Rule-based Approach in Bloom's Taxonomy Question Classification through Natural Language Processing [J].
Haris, Syahidah Sufi ;
Omar, Nazlia .
2012 7TH INTERNATIONAL CONFERENCE ON COMPUTING AND CONVERGENCE TECHNOLOGY (ICCCT2012), 2012, :410-414
[47]   Performance of GPT-4 on the American College of Radiology In-training Examination: Evaluating Accuracy, Model Drift, and Fine-tuning [J].
Payne, David L. ;
Purohit, Kush ;
Borrero, Walter Morales ;
Chung, Katherine ;
Hao, Max ;
Mpoy, Mutshipay ;
Jin, Michael ;
Prasanna, Prateek ;
Hill, Virginia .
ACADEMIC RADIOLOGY, 2024, 31 (07) :3046-3054
[48]   Evaluating Students' Open-ended Written Responses with LLMs: Using the RAG Framework for GPT-3.5, GPT-4, Claude-3, and Mistral-Large [J].
Jauhiainen, Jussi S. ;
Guerra, Agustin Garagorry .
ADVANCES IN ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING, 2024, 4 (04) :3097-3113
[49]   AI as a Threat to Education: Contrasting GPT-3 and Google in Answering Questions Along Bloom's Taxonomy of Educational Objectives [J].
Li, Nina .
INTELLIGENT SYSTEMS AND APPLICATIONS, VOL 1, INTELLISYS 2023, 2024, 822 :469-476
[50]   Evaluation of GPT-4's Chest X-Ray Impression Generation: A Reader Study on Performance and Perception [J].
Ziegelmayer, Sebastian ;
Marka, Alexander W. ;
Lenhart, Nicolas ;
Nehls, Nadja ;
Reischl, Stefan ;
Harder, Felix ;
Sauter, Andreas ;
Makowski, Marcus ;
Graf, Markus ;
Gawlitza, Joshua .
JOURNAL OF MEDICAL INTERNET RESEARCH, 2023, 25