Evaluation of large language models for the classification of medical device software

被引:1
|
作者
Han, Yu [1 ]
Ceross, Aaron [1 ]
Bourgeois, Florence [2 ,3 ]
Savaget, Paulo [1 ]
Bergmann, Jeroen H. M. [1 ,2 ,4 ]
机构
[1] Univ Oxford, Dept Engn Sci, Old Rd Campus, Oxford OX3 7DQ, England
[2] Harvard Med Sch, Harvard MIT Ctr Regulatory Sci, Boston, MA 02115 USA
[3] Boston Childrens Hosp, Computat Hlth Informat Program CHIP, Boston, MA USA
[4] Univ Southern Denmark, Dept Technol & Innovat, Odense, Denmark
关键词
D O I
10.1007/s42242-024-00307-0
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
引用
收藏
页码:819 / 822
页数:4
相关论文
共 50 条
  • [1] Evaluation of large language models for the classification of medical device software
    Yu Han
    Aaron Ceross
    Florence Bourgeois
    Paulo Savaget
    Jeroen HMBergmann
    Bio-Design and Manufacturing, 2024, 7 (05) : 819 - 822
  • [2] Scientific Software Citation Intent Classification Using Large Language Models
    Istrate, Ana-Maria
    Fisher, Joshua
    Yang, Xinyu
    Moraw, Kara
    Li, Kai
    Li, Donghui
    Klein, Martin
    NATURAL SCIENTIFIC LANGUAGE PROCESSING AND RESEARCH KNOWLEDGE GRAPHS, NSLP 2024, 2024, 14770 : 80 - 99
  • [3] Autonomous medical evaluation for guideline adherence of large language models
    Fast, Dennis
    Adams, Lisa C.
    Busch, Felix
    Fallon, Conor
    Huppertz, Marc
    Siepmann, Robert
    Prucker, Philipp
    Bayerl, Nadine
    Truhn, Daniel
    Makowski, Marcus
    Löser, Alexander
    Bressem, Keno K.
    npj Digital Medicine, 2024, 7 (01)
  • [4] Unregulated large language models produce medical device-like output
    Weissman, Gary E.
    Mankowitz, Toni
    Kanter, Genevieve P.
    NPJ DIGITAL MEDICINE, 2025, 8 (01):
  • [5] Evaluation of large language models as a diagnostic aid for complex medical cases
    Rios-Hoyo, Alejandro
    Shan, Naing Lin
    Li, Anran
    Pearson, Alexander T.
    Pusztai, Lajos
    Howard, Frederick M.
    FRONTIERS IN MEDICINE, 2024, 11
  • [6] An Exploratory Evaluation of Large Language Models Using Empirical Software Engineering Tasks
    Liang, Wenjun
    Xiao, Guanping
    PROCEEDINGS OF THE 15TH ASIA-PACIFIC SYMPOSIUM ON INTERNETWARE, INTERNETWARE 2024, 2024, : 31 - 40
  • [7] Benchmarking medical large language models
    Bakhshandeh, Sadra
    NATURE REVIEWS BIOENGINEERING, 2023, 1 (08): : 543 - 543
  • [8] Evaluating large language models for software testing
    Li, Yihao
    Liu, Pan
    Wang, Haiyang
    Chu, Jie
    Wong, W. Eric
    COMPUTER STANDARDS & INTERFACES, 2025, 93
  • [9] Increased Software Security with Large Language Models
    Sagodi, Zoltan
    Hegedus, Peter
    Ferenc, Rudolf
    ERCIM NEWS, 2024, (139):
  • [10] Software Modeling Assistance with Large Language Models
    Ben Chaaben, Meriem
    ACM/IEEE 27TH INTERNATIONAL CONFERENCE ON MODEL DRIVEN ENGINEERING LANGUAGES AND SYSTEMS: COMPANION PROCEEDINGS, MODELS 2024, 2024, : 188 - 191