Evaluating GPT-4's Cognitive Functions Through the Bloom Taxonomy: Insights and Clarifications

被引：1

作者：

Herrmann-Werner, Anne ^{[1
,2
]}

Festl-Wietek, Teresa ^{[1
]}

Holderried, Friederike ^{[1
,3
]}

Herschbach, Lea ^{[1
]}

Griewatz, Jan ^{[1
]}

Masters, Ken ^{[4
]}

Zipfel, Stephan ^{[2
]}

Mahling, Moritz ^{[1
,5
]}

机构：

[1] Univ Tubingen, Tubingen Inst Med Educ, Fac Med, Elfriede Aulhorn Str 10, D-72076 Tubingen, Germany

[2] Univ Hosp Tubingen, Dept Psychosomat Med & Psychotherapy, Tubingen, Germany

[3] Univ Tubingen, Tubingen Univ Hosp, Dept Anesthesiol & Intens Care Med, Tubingen, Germany

[4] Sultan Qaboos Univ, Coll Med & Hlth Sci, Med Educ & Informat Dept, Muscat, Oman

[5] Univ Hosp Tubingen, Dept Diabetol Endocrinol Nephrol, Sect Nephrol & Hypertens, Tubingen, Germany

来源：

JOURNAL OF MEDICAL INTERNET RESEARCH | 2024年 / 26卷

关键词：

answer; artificial intelligence; assessment; Bloom's taxonomy; ChatGPT; classification; error; exam; examination; generative; GPT-4; Generative Pre-trained Transformer 4; language model; learning outcome; LLM; MCQ; medical education; medical exam; multiple-choice question; natural language processing; NLP; psychosomatic; question; response; taxonomy;

D O I：

10.2196/57778

中图分类号：

R19 [保健组织与事业（卫生事业管理）];

学科分类号：

摘要：

引用

页数：2

共 50 条

[1] Evaluating GPT-4's Cognitive Functions Through the Bloom Taxonomy: Insights and Clarifications
Huang, Kuan-Ju
JOURNAL OF MEDICAL INTERNET RESEARCH, 2024, 26
[2] Is GPT-4 a reliable rater? Evaluating consistency in GPT-4's text ratings
Hackl, Veronika
Mueller, Alexandra Elena
Granitzer, Michael
Sailer, Maximilian
FRONTIERS IN EDUCATION, 2023, 8
[3] Evaluating Large Language Models for the National Premedical Exam in India: Comparative Analysis of GPT-3.5, GPT-4, and Bard
Farhat, Faiza
Chaudhry, Beenish Moalla
Nadeem, Mohammad
Sohail, Shahab Saquib
Madsen, Dag Oivind
JMIR MEDICAL EDUCATION, 2024, 10
[4] Re-evaluating GPT-4's bar exam performance
Martinez, Eric
ARTIFICIAL INTELLIGENCE AND LAW, 2024,
[5] Evaluating GPT-4's proficiency in addressing cryptography examinations
Mikhalev, Vasily
Kopal, Nils
Esslinger, Bernhard
CRYPTOLOGIA, 2025, 49 (02) : 170 - 185
[6] Assessing GPT-4's Performance in Delivering Medical Advice: Comparative Analysis With Human Experts
Jo, Eunbeen
Song, Sanghoun
Kim, Jong -Ho
Lim, Subin
Kim, Ju Hyeon
Cha, Jung - Joon
Kim, Young -Min
Joo, Hyung Joon
JMIR MEDICAL EDUCATION, 2024, 10
[7] Mind meets machine: Unravelling GPT-4’s cognitive psychology
Dhingra S.
Singh M.
S.B. V.
Malviya N.
Gill S.S.
BenchCouncil Transactions on Benchmarks, Standards and Evaluations, 2023, 3 (03):
[8] Evaluating the performance of GPT-3.5, GPT-4, and GPT-4o in the Chinese National Medical Licensing Examination
Luo, Dingyuan
Liu, Mengke
Yu, Runyuan
Liu, Yulian
Jiang, Wenjun
Fan, Qi
Kuang, Naifeng
Gao, Qiang
Yin, Tao
Zheng, Zuncheng
SCIENTIFIC REPORTS, 2025, 15 (01):
[9] Evaluating Bard Gemini Pro and GPT-4 Vision Against Student Performance in Medical Visual Question Answering: Comparative Case Study
Roos, Jonas
Martin, Ron
Kaczmarczyk, Robert
JMIR FORMATIVE RESEARCH, 2024, 8
[10] Evaluating the Utility of OpenAI's GPT-4 as a Diagnostic and Management Aid in Medicine
Ge, Alan
Pandya, Vidish
Ferrick, Kevin J.
Krumerman, Andrew
CIRCULATION, 2023, 148

← 1 2 3 4 5 →