EVALUATING THE PERFORMANCE OF A LARGE LANGUAGE MODEL (LLM) COMPARED TO HUMANS IN A COMPLEX CATEGORIZATION TASK

被引:0
|
作者
Edema, C. [1 ]
Martin, A. [1 ]
Martin, C. [1 ]
Bertuzzi, A. [1 ]
King, E. [1 ]
Wesson, F. [1 ]
Witkowski, M. [1 ]
机构
[1] Crystallise, Stanford Hope, Essex, England
关键词
D O I
暂无
中图分类号
F [经济];
学科分类号
02 ;
摘要
MSR193
引用
收藏
页数:2
相关论文
共 50 条
  • [21] Synchronous Bilateral Breast Cancer: A Case Report Piloting and Evaluating the Implementation of the AI-Powered Large Language Model (LLM) ChatGPT
    Naik, Himani R.
    Prather, Andrew D.
    Gurda, Grzegorz T.
    CUREUS JOURNAL OF MEDICAL SCIENCE, 2023, 15 (04)
  • [22] Evaluating Large Language Models for Enhanced Fuzzing: An Analysis Framework for LLM-Driven Seed Generation
    Black, Gavin
    Vaidyan, Varghese Mathew
    Comert, Gurcan
    IEEE ACCESS, 2024, 12 : 156065 - 156081
  • [24] Large language model (LLM)-driven chatbots for neuro-ophthalmic medical education
    Ethan Waisberg
    Joshua Ong
    Mouayad Masalkhi
    Andrew G. Lee
    Eye, 2024, 38 : 639 - 641
  • [25] LLM-CDM: A Large Language Model Enhanced Cognitive Diagnosis for Intelligent Education
    Chen, Xin
    Zhang, Jin
    Zhou, Tong
    Zhang, Feng
    IEEE ACCESS, 2025, 13 : 47165 - 47180
  • [26] Large language model (LLM)-driven chatbots for neuro-ophthalmic medical education
    Waisberg, Ethan
    Ong, Joshua
    Masalkhi, Mouayad
    Lee, Andrew G.
    EYE, 2024, 38 (04) : 639 - 641
  • [27] Evaluation of a novel large language model (LLM)-powered chatbot for oral boards scenarios
    Caitlin Silvestri
    Joshua Roshal
    Meghal Shah
    Warren D. Widmann
    Courtney Townsend
    Riley Brian
    Joseph C. L’Huillier
    Sergio M. Navarro
    Sarah Lund
    Tejas S. Sathe
    Global Surgical Education - Journal of the Association for Surgical Education, 3 (1):
  • [28] Software Engineering Education Must Adapt and Evolve for an LLM (Large Language Model) Environment
    Kirova, Vassilka D.
    Ku, Cyril S.
    Laracy, Joseph R.
    Marlowe, Thomas J.
    PROCEEDINGS OF THE 55TH ACM TECHNICAL SYMPOSIUM ON COMPUTER SCIENCE EDUCATION, SIGCSE 2024, VOL. 1, 2024, : 666 - 672
  • [29] Artificial intelligence and qualitative research: The promise and perils of large language model (LLM) 'assistance'
    Roberts, John
    Baker, Max
    Andrew, Jane
    CRITICAL PERSPECTIVES ON ACCOUNTING, 2024, 99
  • [30] Standardize clinical trials monitoring with Large Language Model (LLM)-enhanced FAQ management
    Lai, Jason K.
    Delporte, Nicolas
    Tung, Brian
    Zhang, Youshi
    Madu, Chisom
    Douletbekov, Daniyar
    Ruiz, Carlos Quezada
    Dai, Jian
    INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 2024, 65 (07)