Evaluating GPT-4's Cognitive Functions Through the Bloom Taxonomy: Insights and Clarifications

被引:1
作者
Herrmann-Werner, Anne [1 ,2 ]
Festl-Wietek, Teresa [1 ]
Holderried, Friederike [1 ,3 ]
Herschbach, Lea [1 ]
Griewatz, Jan [1 ]
Masters, Ken [4 ]
Zipfel, Stephan [2 ]
Mahling, Moritz [1 ,5 ]
机构
[1] Univ Tubingen, Tubingen Inst Med Educ, Fac Med, Elfriede Aulhorn Str 10, D-72076 Tubingen, Germany
[2] Univ Hosp Tubingen, Dept Psychosomat Med & Psychotherapy, Tubingen, Germany
[3] Univ Tubingen, Tubingen Univ Hosp, Dept Anesthesiol & Intens Care Med, Tubingen, Germany
[4] Sultan Qaboos Univ, Coll Med & Hlth Sci, Med Educ & Informat Dept, Muscat, Oman
[5] Univ Hosp Tubingen, Dept Diabetol Endocrinol Nephrol, Sect Nephrol & Hypertens, Tubingen, Germany
关键词
answer; artificial intelligence; assessment; Bloom's taxonomy; ChatGPT; classification; error; exam; examination; generative; GPT-4; Generative Pre-trained Transformer 4; language model; learning outcome; LLM; MCQ; medical education; medical exam; multiple-choice question; natural language processing; NLP; psychosomatic; question; response; taxonomy;
D O I
10.2196/57778
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
引用
收藏
页数:2
相关论文
共 50 条
  • [1] Evaluating GPT-4's Cognitive Functions Through the Bloom Taxonomy: Insights and Clarifications
    Huang, Kuan-Ju
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2024, 26
  • [2] Is GPT-4 a reliable rater? Evaluating consistency in GPT-4's text ratings
    Hackl, Veronika
    Mueller, Alexandra Elena
    Granitzer, Michael
    Sailer, Maximilian
    FRONTIERS IN EDUCATION, 2023, 8
  • [3] Evaluating Large Language Models for the National Premedical Exam in India: Comparative Analysis of GPT-3.5, GPT-4, and Bard
    Farhat, Faiza
    Chaudhry, Beenish Moalla
    Nadeem, Mohammad
    Sohail, Shahab Saquib
    Madsen, Dag Oivind
    JMIR MEDICAL EDUCATION, 2024, 10
  • [4] Re-evaluating GPT-4's bar exam performance
    Martinez, Eric
    ARTIFICIAL INTELLIGENCE AND LAW, 2024,
  • [5] Evaluating GPT-4's proficiency in addressing cryptography examinations
    Mikhalev, Vasily
    Kopal, Nils
    Esslinger, Bernhard
    CRYPTOLOGIA, 2025, 49 (02) : 170 - 185
  • [6] Assessing GPT-4's Performance in Delivering Medical Advice: Comparative Analysis With Human Experts
    Jo, Eunbeen
    Song, Sanghoun
    Kim, Jong -Ho
    Lim, Subin
    Kim, Ju Hyeon
    Cha, Jung - Joon
    Kim, Young -Min
    Joo, Hyung Joon
    JMIR MEDICAL EDUCATION, 2024, 10
  • [7] Mind meets machine: Unravelling GPT-4’s cognitive psychology
    Dhingra S.
    Singh M.
    S.B. V.
    Malviya N.
    Gill S.S.
    BenchCouncil Transactions on Benchmarks, Standards and Evaluations, 2023, 3 (03):
  • [8] Evaluating the performance of GPT-3.5, GPT-4, and GPT-4o in the Chinese National Medical Licensing Examination
    Luo, Dingyuan
    Liu, Mengke
    Yu, Runyuan
    Liu, Yulian
    Jiang, Wenjun
    Fan, Qi
    Kuang, Naifeng
    Gao, Qiang
    Yin, Tao
    Zheng, Zuncheng
    SCIENTIFIC REPORTS, 2025, 15 (01):
  • [9] Evaluating Bard Gemini Pro and GPT-4 Vision Against Student Performance in Medical Visual Question Answering: Comparative Case Study
    Roos, Jonas
    Martin, Ron
    Kaczmarczyk, Robert
    JMIR FORMATIVE RESEARCH, 2024, 8
  • [10] Evaluating the Utility of OpenAI's GPT-4 as a Diagnostic and Management Aid in Medicine
    Ge, Alan
    Pandya, Vidish
    Ferrick, Kevin J.
    Krumerman, Andrew
    CIRCULATION, 2023, 148