A Survey of Metrics to Enhance Training Dependability in Large Language Models

被引：0

作者：

Fang, Wenyi ^{[1
]}

Zhang, Hao ^{[1
]}

Gong, Ziyu ^{[1
]}

Zeng, Longbin ^{[1
]}

Lu, Xuhui ^{[1
,2
]}

Liu, Biao ^{[1
]}

Wu, Xiaoyu ^{[1
]}

Zheng, Yang ^{[1
]}

Hu, Zheng ^{[1
]}

Zhang, Xun ^{[1
]}

机构：

[1] Huawei Technol Co Ltd, Shenzhen, Peoples R China

[2] Beihang Univ, Sch Automat Sci & Elect Engn, Beijing, Peoples R China

来源：

2023 IEEE 34TH INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING WORKSHOPS, ISSREW | 2023年

关键词：

Large Language Model; Dependability; Monitoring Metric;

D O I：

10.1109/ISSREW60843.2023.00071

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The rapidly advancing field of artificial intelligence requires meticulous attention to the training and monitoring of large language models (LLMs). This paper offers a systematic analysis of existing metrics and introduces new ones, focusing on their theoretical underpinnings and practical implementations. We present empirical results and insights into the performance of selected metrics, elucidating the complex interplay of variables in the training process. Our comprehensive approach provides significant insights into LLM training, and promises to improve the dependability and efficiency of future models.

引用

页码：180 / 185

页数：6

共 50 条

[31] On the Effectiveness of Large Language Models for GitHub Workflows
Zhang, Xinyu
Muralee, Siddharth
Cherupattamoolayil, Sourag
Machiry, Aravind
19TH INTERNATIONAL CONFERENCE ON AVAILABILITY, RELIABILITY, AND SECURITY, ARES 2024, 2024,
[32] Harnessing Large Language Models for Chart Review
Xu, Dongchu
Cunningham, Jonathan W.
JOURNAL OF THE AMERICAN HEART ASSOCIATION, 2025, 14 (07):
[33] On the Capacity of Citation Generation by Large Language Models
Qian, Haosheng
Fan, Yixing
Zhang, Ruqing
Guo, Jiafeng
INFORMATION RETRIEVAL, CCIR 2024, 2025, 15418 : 109 - 123
[34] Robustness of large language models in moral judgements
Oh, Soyoung
Demberg, Vera
ROYAL SOCIETY OPEN SCIENCE, 2025, 12 (04):
[35] Environmental impact of large language models in medicine
Kleinig, Oliver
Sinhal, Shreyans
Khurram, Rushan
Gao, Christina
Spajic, Luke
Zannettino, Andrew
Schnitzler, Margaret
Guo, Christina
Zaman, Sarah
Smallbone, Harry
Ittimani, Mana
Chan, Weng Onn
Stretton, Brandon
Godber, Harry
Chan, Justin
Turner, Richard C.
Warren, Leigh R.
Clarke, Jonathan
Sivagangabalan, Gopal
Marshall-Webb, Matthew
Moseley, Genevieve
Driscoll, Simon
Kovoor, Pramesh
Chow, Clara K.
Luo, Yuchen
Thiagalingam, Aravinda
Zaka, Ammar
Gould, Paul
Ramponi, Fabio
Gupta, Aashray
Kovoor, Joshua G.
Bacchi, Stephen
INTERNAL MEDICINE JOURNAL, 2024, 54 (12) : 2083 - 2086
[36] Applying Large Language Models to Issue Classification
Aracena, Gabriel
Luster, Kyle
Santos, Fabio
Steinmacher, Igor
Gerosa, Marco Aurelio
PROCEEDINGS 2024 ACM/IEEE INTERNATIONAL WORKSHOP ON NL-BASED SOFTWARE ENGINEERING, NLBSE 2024, 2024, : 57 - 60
[37] Safety of Large Language Models in Addressing Depression
Heston, Thomas F.
CUREUS JOURNAL OF MEDICAL SCIENCE, 2023, 15 (12)
[38] Benchmarking DNA large language models on quadruplexes
Cherednichenko, Oleksandr
Herbert, Alan
Poptsova, Maria
COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2025, 27 : 992 - 1000
[39] Ontologies in the era of large language models - a perspective
Neuhaus, Fabian
APPLIED ONTOLOGY, 2023, 18 (04) : 399 - 407
[40] Large Language Models: A Comprehensive Guide for Radiologists
Kim, Sunkyu
Lee, Choong-kun
Kim, Seung-seob
JOURNAL OF THE KOREAN SOCIETY OF RADIOLOGY, 2024, 85 (05): : 861 - 882

← 1 2 3 4 5 →