Evaluating GPT-4's Cognitive Functions Through the Bloom Taxonomy: Insights and Clarifications

被引：1

作者：

Herrmann-Werner, Anne ^{[1
,2
]}

Festl-Wietek, Teresa ^{[1
]}

Holderried, Friederike ^{[1
,3
]}

Herschbach, Lea ^{[1
]}

Griewatz, Jan ^{[1
]}

Masters, Ken ^{[4
]}

Zipfel, Stephan ^{[2
]}

Mahling, Moritz ^{[1
,5
]}

机构：

[1] Univ Tubingen, Tubingen Inst Med Educ, Fac Med, Elfriede Aulhorn Str 10, D-72076 Tubingen, Germany

[2] Univ Hosp Tubingen, Dept Psychosomat Med & Psychotherapy, Tubingen, Germany

[3] Univ Tubingen, Tubingen Univ Hosp, Dept Anesthesiol & Intens Care Med, Tubingen, Germany

[4] Sultan Qaboos Univ, Coll Med & Hlth Sci, Med Educ & Informat Dept, Muscat, Oman

[5] Univ Hosp Tubingen, Dept Diabetol Endocrinol Nephrol, Sect Nephrol & Hypertens, Tubingen, Germany

来源：

JOURNAL OF MEDICAL INTERNET RESEARCH | 2024年 / 26卷

关键词：

answer; artificial intelligence; assessment; Bloom's taxonomy; ChatGPT; classification; error; exam; examination; generative; GPT-4; Generative Pre-trained Transformer 4; language model; learning outcome; LLM; MCQ; medical education; medical exam; multiple-choice question; natural language processing; NLP; psychosomatic; question; response; taxonomy;

D O I：

10.2196/57778

中图分类号：

R19 [保健组织与事业（卫生事业管理）];

学科分类号：

摘要：

引用

收藏

页数：2

相关论文

共 50 条

[41] Evaluating GPT-4-based ChatGPT's clinical potential on the NEJM quiz [J].

Daiju Ueda ;

Shannon L. Walston ;

Toshimasa Matsumoto ;

Ryo Deguchi ;

Hiroyuki Tatekawa ;

Yukio Miki .

BMC Digital Health, 2 (1)

[42] Evaluating Large Language Model-Assisted Emergency Triage: A Comparison of Acuity Assessments by GPT-4 and Medical Experts [J].

Haim, Gal Ben ;

Saban, Mor ;

Barash, Yiftach ;

Cirulnik, David ;

Shaham, Amit ;

Eisenman, Ben Zion ;

Burshtein, Livnat ;

Mymon, Orly ;

Klang, Eyal .

JOURNAL OF CLINICAL NURSING, 2024,

[43] Response accuracy of GPT-4 across languages: insights from an expert-level diagnostic radiology examination in Japan [J].

Harigai, Ayaka ;

Toyama, Yoshitaka ;

Nagano, Mitsutoshi ;

Abe, Mirei ;

Kawabata, Masahiro ;

Li, Li ;

Yamamura, Jin ;

Takase, Kei .

JAPANESE JOURNAL OF RADIOLOGY, 2025, 43 (02) :319-329

[44] Cognitive Evaluation of Examinees by Dynamic Question Set Generation based on Bloom's Taxonomy [J].

Dutta, Anjan ;

Chatterjee, Punyasha ;

Dey, Nilanjan ;

Moreno-Ger, Pablo ;

Sen, Soumya .

IETE JOURNAL OF RESEARCH, 2024, 70 (03) :2570-2582

[45] An Evaluation of the Algerian EFL Baccalaureate Exam under the Cognitive Domains of Bloom's Taxonomy [J].

Belarbi, Fatine Merieme ;

Bensafa, Abdelkader .

ARAB WORLD ENGLISH JOURNAL, 2020, 11 (04) :534-546

[46] A Rule-based Approach in Bloom's Taxonomy Question Classification through Natural Language Processing [J].

Haris, Syahidah Sufi ;

Omar, Nazlia .

2012 7TH INTERNATIONAL CONFERENCE ON COMPUTING AND CONVERGENCE TECHNOLOGY (ICCCT2012), 2012, :410-414

[47] Performance of GPT-4 on the American College of Radiology In-training Examination: Evaluating Accuracy, Model Drift, and Fine-tuning [J].

Payne, David L. ;

Purohit, Kush ;

Borrero, Walter Morales ;

Chung, Katherine ;

Hao, Max ;

Mpoy, Mutshipay ;

Jin, Michael ;

Prasanna, Prateek ;

Hill, Virginia .

ACADEMIC RADIOLOGY, 2024, 31 (07) :3046-3054

[48] Evaluating Students' Open-ended Written Responses with LLMs: Using the RAG Framework for GPT-3.5, GPT-4, Claude-3, and Mistral-Large [J].

Jauhiainen, Jussi S. ;

Guerra, Agustin Garagorry .

ADVANCES IN ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING, 2024, 4 (04) :3097-3113

[49] AI as a Threat to Education: Contrasting GPT-3 and Google in Answering Questions Along Bloom's Taxonomy of Educational Objectives [J].

Li, Nina .

INTELLIGENT SYSTEMS AND APPLICATIONS, VOL 1, INTELLISYS 2023, 2024, 822 :469-476

[50] Evaluation of GPT-4's Chest X-Ray Impression Generation: A Reader Study on Performance and Perception [J].

Ziegelmayer, Sebastian ;

Marka, Alexander W. ;

Lenhart, Nicolas ;

Nehls, Nadja ;

Reischl, Stefan ;

Harder, Felix ;

Sauter, Andreas ;

Makowski, Marcus ;

Graf, Markus ;

Gawlitza, Joshua .

JOURNAL OF MEDICAL INTERNET RESEARCH, 2023, 25

← 1 2 3 4 5 →