EVALUATING THE PERFORMANCE OF A LARGE LANGUAGE MODEL (LLM) COMPARED TO HUMANS IN A COMPLEX CATEGORIZATION TASK

被引:0
|
作者
Edema, C. [1 ]
Martin, A. [1 ]
Martin, C. [1 ]
Bertuzzi, A. [1 ]
King, E. [1 ]
Wesson, F. [1 ]
Witkowski, M. [1 ]
机构
[1] Crystallise, Stanford Hope, Essex, England
关键词
D O I
暂无
中图分类号
F [经济];
学科分类号
02 ;
摘要
MSR193
引用
收藏
页数:2
相关论文
共 50 条
  • [1] Evaluating Large Language Model (LLM) Performance on Established Breast Classification Systems
    Haider, Syed Ali
    Pressman, Sophia M.
    Borna, Sahar
    Gomez-Cabello, Cesar A.
    Sehgal, Ajai
    Leibovich, Bradley C.
    Forte, Antonio Jorge
    DIAGNOSTICS, 2024, 14 (14)
  • [2] FD-LLM: Large language model for fault diagnosis of complex equipment
    Lin, Lin
    Zhang, Sihao
    Fu, Song
    Liu, Yikun
    ADVANCED ENGINEERING INFORMATICS, 2025, 65
  • [3] EVALUATING LARGE LANGUAGE MODELS' (LLM) PERFORMANCE IN CONTENT GENERATION FOR GLOBAL VALUE DOSSIERS (GVD)
    Walters, J.
    Rtveladze, K.
    Xu, W.
    Green, N.
    Joseph, J.
    Matev, K.
    Gallinaro, J.
    Guerra, I
    VALUE IN HEALTH, 2024, 27 (12)
  • [4] Evaluating the Performance of Interpretability Methods in Text Categorization Task
    Rogov, A. A.
    Loukachevitch, N. V.
    LOBACHEVSKII JOURNAL OF MATHEMATICS, 2024, 45 (03) : 1234 - 1245
  • [5] ChatGPT and large language model (LLM) chatbots: Correspondence
    Kleebayoon, Amnuay
    Wiwanitkit, Viroj
    JOURNAL OF PEDIATRIC UROLOGY, 2023, 19 (05) : 605 - 606
  • [6] Use of a large language model (LLM) for ambulance dispatch and triage
    Shekhar, Aditya C.
    Kimbrell, Joshua
    Saharan, Aaryan
    Stebel, Jacob
    Ashley, Evan
    Abbott, Ethan E.
    AMERICAN JOURNAL OF EMERGENCY MEDICINE, 2025, 89 : 27 - 29
  • [7] Evaluating the Diagnostic Performance of Large Language Models on Complex Multimodal Medical Cases
    Chiu, Wan Hang Keith
    Ko, Wei Sum Koel
    Cho, William Chi Shing
    Hui, Sin Yu Joanne
    Chan, Wing Chi Lawrence
    Kuo, Michael D.
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2024, 26
  • [8] Benchmarking Large Language Model (LLM) Performance for Game Playing via Tic-Tac-Toe
    Topsakal, Oguzhan
    Harper, Jackson B.
    ELECTRONICS, 2024, 13 (08)
  • [9] LLM in a flash: Efficient Large Language Model Inference with Limited Memory
    Alizadeh, Keivan
    Mirzadeh, Iman
    Belenko, Dmitry
    Khatamifard, S. Karen
    Cho, Minsik
    Del Mundo, Carlo C.
    Rastegari, Mohammad
    Farajtabar, Mehrdad
    PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, : 12562 - 12584
  • [10] A systematic review of large language model (LLM) evaluations in clinical medicine
    Sina Shool
    Sara Adimi
    Reza Saboori Amleshi
    Ehsan Bitaraf
    Reza Golpira
    Mahmood Tara
    BMC Medical Informatics and Decision Making, 25 (1)