EVALUATING THE PERFORMANCE OF A LARGE LANGUAGE MODEL (LLM) COMPARED TO HUMANS IN A COMPLEX CATEGORIZATION TASK

被引:0
|
作者
Edema, C. [1 ]
Martin, A. [1 ]
Martin, C. [1 ]
Bertuzzi, A. [1 ]
King, E. [1 ]
Wesson, F. [1 ]
Witkowski, M. [1 ]
机构
[1] Crystallise, Stanford Hope, Essex, England
关键词
D O I
暂无
中图分类号
F [经济];
学科分类号
02 ;
摘要
MSR193
引用
收藏
页数:2
相关论文
共 50 条
  • [31] Edge-LLM: A Collaborative Framework for Large Language Model Serving in Edge Computing
    Cai, Fenglong
    Yuan, Dong
    Yang, Zhe
    Cui, Lizhen
    2024 IEEE INTERNATIONAL CONFERENCE ON WEB SERVICES, ICWS 2024, 2024, : 799 - 809
  • [32] Understanding Large-Language Model (LLM)-powered Human-Robot Interaction
    Kim, Callie Y.
    Lee, Christine P.
    Mutlu, Bilge
    PROCEEDINGS OF THE 2024 ACM/IEEE INTERNATIONAL CONFERENCE ON HUMAN-ROBOT INTERACTION, HRI 2024, 2024, : 371 - 380
  • [33] The influence of task demand and social categorization diversity on performance and enjoyment in a language learning game
    Peng, Wei
    Song, Hayeon
    Kim, Jinyoung
    Day, Tom
    COMPUTERS & EDUCATION, 2016, 95 : 285 - 295
  • [34] LLM vs Small Model? Large Language Model Based Text Augmentation Enhanced Personality Detection Model
    Hu, Linmei
    He, Hongyu
    Wang, Duokang
    Zhao, Ziwang
    Shao, Yingxia
    Nie, Liqiang
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 16, 2024, : 18234 - 18242
  • [35] Evaluating Large Language Model Understanding of Due Process
    Johnson, Joshua P.
    Lauf, Adrian P.
    2024 IEEE 3RD INTERNATIONAL CONFERENCE ON COMPUTING AND MACHINE INTELLIGENCE, ICMI 2024, 2024,
  • [36] Evaluating a Large Language Model on Searching for GUI Layouts
    Brie P.
    Burny N.
    Sluyters A.
    Vanderdonckt J.
    Proceedings of the ACM on Human-Computer Interaction, 2023, 7 (EICS)
  • [37] Humans vs large language models: An assessment of evaluating online dermatological misinformation
    Fanous, A. H.
    Le, M.
    Rezaei, S.
    Xu, S.
    Ko, J.
    Lipoff, J.
    Daneshjou, R.
    JOURNAL OF INVESTIGATIVE DERMATOLOGY, 2024, 144 (08) : S130 - S130
  • [38] Evaluating accuracy and reproducibility of large language model performance on critical care assessments in pharmacy education
    Yang, Huibo
    Hu, Mengxuan
    Most, Amoreena
    Hawkins, W. Anthony
    Murray, Brian
    Smith, Susan E.
    Li, Sheng
    Sikora, Andrea
    FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2025, 7
  • [39] Evaluating the Performance of Large Language Models for Spanish Language in Undergraduate Admissions Exams
    Miranda, Sabino
    Pichardo-Lagunas, Obdulia
    Martinez-Seis, Bella
    Baldi, Pierre
    COMPUTACION Y SISTEMAS, 2023, 27 (04): : 1241 - 1248
  • [40] Impact of Conversational and Generative AI Systems on Libraries: A Use Case Large Language Model (LLM)
    Khan R.
    Gupta N.
    Sinhababu A.
    Chakravarty R.
    Science and Technology Libraries, 2024, 43 (04): : 319 - 333