A Survey of Metrics to Enhance Training Dependability in Large Language Models

被引:0
|
作者
Fang, Wenyi [1 ]
Zhang, Hao [1 ]
Gong, Ziyu [1 ]
Zeng, Longbin [1 ]
Lu, Xuhui [1 ,2 ]
Liu, Biao [1 ]
Wu, Xiaoyu [1 ]
Zheng, Yang [1 ]
Hu, Zheng [1 ]
Zhang, Xun [1 ]
机构
[1] Huawei Technol Co Ltd, Shenzhen, Peoples R China
[2] Beihang Univ, Sch Automat Sci & Elect Engn, Beijing, Peoples R China
来源
2023 IEEE 34TH INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING WORKSHOPS, ISSREW | 2023年
关键词
Large Language Model; Dependability; Monitoring Metric;
D O I
10.1109/ISSREW60843.2023.00071
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The rapidly advancing field of artificial intelligence requires meticulous attention to the training and monitoring of large language models (LLMs). This paper offers a systematic analysis of existing metrics and introduces new ones, focusing on their theoretical underpinnings and practical implementations. We present empirical results and insights into the performance of selected metrics, elucidating the complex interplay of variables in the training process. Our comprehensive approach provides significant insights into LLM training, and promises to improve the dependability and efficiency of future models.
引用
收藏
页码:180 / 185
页数:6
相关论文
共 50 条
  • [31] On the Effectiveness of Large Language Models for GitHub Workflows
    Zhang, Xinyu
    Muralee, Siddharth
    Cherupattamoolayil, Sourag
    Machiry, Aravind
    19TH INTERNATIONAL CONFERENCE ON AVAILABILITY, RELIABILITY, AND SECURITY, ARES 2024, 2024,
  • [32] Harnessing Large Language Models for Chart Review
    Xu, Dongchu
    Cunningham, Jonathan W.
    JOURNAL OF THE AMERICAN HEART ASSOCIATION, 2025, 14 (07):
  • [33] On the Capacity of Citation Generation by Large Language Models
    Qian, Haosheng
    Fan, Yixing
    Zhang, Ruqing
    Guo, Jiafeng
    INFORMATION RETRIEVAL, CCIR 2024, 2025, 15418 : 109 - 123
  • [34] Robustness of large language models in moral judgements
    Oh, Soyoung
    Demberg, Vera
    ROYAL SOCIETY OPEN SCIENCE, 2025, 12 (04):
  • [35] Environmental impact of large language models in medicine
    Kleinig, Oliver
    Sinhal, Shreyans
    Khurram, Rushan
    Gao, Christina
    Spajic, Luke
    Zannettino, Andrew
    Schnitzler, Margaret
    Guo, Christina
    Zaman, Sarah
    Smallbone, Harry
    Ittimani, Mana
    Chan, Weng Onn
    Stretton, Brandon
    Godber, Harry
    Chan, Justin
    Turner, Richard C.
    Warren, Leigh R.
    Clarke, Jonathan
    Sivagangabalan, Gopal
    Marshall-Webb, Matthew
    Moseley, Genevieve
    Driscoll, Simon
    Kovoor, Pramesh
    Chow, Clara K.
    Luo, Yuchen
    Thiagalingam, Aravinda
    Zaka, Ammar
    Gould, Paul
    Ramponi, Fabio
    Gupta, Aashray
    Kovoor, Joshua G.
    Bacchi, Stephen
    INTERNAL MEDICINE JOURNAL, 2024, 54 (12) : 2083 - 2086
  • [36] Applying Large Language Models to Issue Classification
    Aracena, Gabriel
    Luster, Kyle
    Santos, Fabio
    Steinmacher, Igor
    Gerosa, Marco Aurelio
    PROCEEDINGS 2024 ACM/IEEE INTERNATIONAL WORKSHOP ON NL-BASED SOFTWARE ENGINEERING, NLBSE 2024, 2024, : 57 - 60
  • [37] Safety of Large Language Models in Addressing Depression
    Heston, Thomas F.
    CUREUS JOURNAL OF MEDICAL SCIENCE, 2023, 15 (12)
  • [38] Benchmarking DNA large language models on quadruplexes
    Cherednichenko, Oleksandr
    Herbert, Alan
    Poptsova, Maria
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2025, 27 : 992 - 1000
  • [39] Ontologies in the era of large language models - a perspective
    Neuhaus, Fabian
    APPLIED ONTOLOGY, 2023, 18 (04) : 399 - 407
  • [40] Large Language Models: A Comprehensive Guide for Radiologists
    Kim, Sunkyu
    Lee, Choong-kun
    Kim, Seung-seob
    JOURNAL OF THE KOREAN SOCIETY OF RADIOLOGY, 2024, 85 (05): : 861 - 882