A Survey of Metrics to Enhance Training Dependability in Large Language Models

被引：0

作者：

Fang, Wenyi ^{[1
]}

Zhang, Hao ^{[1
]}

Gong, Ziyu ^{[1
]}

Zeng, Longbin ^{[1
]}

Lu, Xuhui ^{[1
,2
]}

Liu, Biao ^{[1
]}

Wu, Xiaoyu ^{[1
]}

Zheng, Yang ^{[1
]}

Hu, Zheng ^{[1
]}

Zhang, Xun ^{[1
]}

机构：

[1] Huawei Technol Co Ltd, Shenzhen, Peoples R China

[2] Beihang Univ, Sch Automat Sci & Elect Engn, Beijing, Peoples R China

来源：

2023 IEEE 34TH INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING WORKSHOPS, ISSREW | 2023年

关键词：

Large Language Model; Dependability; Monitoring Metric;

D O I：

10.1109/ISSREW60843.2023.00071

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The rapidly advancing field of artificial intelligence requires meticulous attention to the training and monitoring of large language models (LLMs). This paper offers a systematic analysis of existing metrics and introduces new ones, focusing on their theoretical underpinnings and practical implementations. We present empirical results and insights into the performance of selected metrics, elucidating the complex interplay of variables in the training process. Our comprehensive approach provides significant insights into LLM training, and promises to improve the dependability and efficiency of future models.

引用

页码：180 / 185

页数：6

共 50 条

[11] A survey of large language models for healthcare: from data, technology, and applications to accountability and ethics
He, Kai
Mao, Rui
Lin, Qika
Ruan, Yucheng
Lan, Xiang
Feng, Mengling
Cambria, Erik
INFORMATION FUSION, 2025, 118
[12] Editing Personality For Large Language Models
Mao, Shengyu
Wang, Xiaohan
Wang, Mengru
Jiang, Yong
Xie, Pengjun
Huang, Fei
Zhang, Ningyu
NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, PT II, NLPCC 2024, 2025, 15360 : 241 - 254
[13] Applications of Large Language Models in Pathology
Cheng, Jerome
BIOENGINEERING-BASEL, 2024, 11 (04):
[14] Consumer segmentation with large language models
Li, Yinan
Liu, Ying
Yu, Muran
JOURNAL OF RETAILING AND CONSUMER SERVICES, 2025, 82
[15] Large Language Models in Cosmetic Dermatology
Landau, Marina
Kroumpouzos, George
Goldust, Mohamad
JOURNAL OF COSMETIC DERMATOLOGY, 2025, 24 (02)
[16] Large Language Models: A Guide for Radiologists
Kim, Sunkyu
Lee, Choong-kun
Kim, Seung-seob
KOREAN JOURNAL OF RADIOLOGY, 2024, 25 (02) : 126 - 133
[17] Attention heads of large language models
Zheng, Zifan
Wang, Yezhaohui
Huang, Yuxin
Song, Shichao
Yang, Mingchuan
Tang, Bo
Xiong, Feiyu
Li, Zhiyu
PATTERNS, 2025, 6 (02):
[18] Quo Vadis ChatGPT? From large language models to Large Knowledge Models
Venkatasubramanian, Venkat
Chakraborty, Arijit
COMPUTERS & CHEMICAL ENGINEERING, 2025, 192
[19] Qualitative metrics from the biomedical literature for evaluating large language models in clinical decision-making: a narrative review
Ho, Cindy N.
Tian, Tiffany
Ayers, Alessandra T.
Aaron, Rachel E.
Phillips, Vidith
Wolf, Risa M.
Mathioudakis, Nestoras
Dai, Tinglong
Klonoff, David C.
BMC MEDICAL INFORMATICS AND DECISION MAKING, 2024, 24 (01)
[20] Large language model for table processing: a survey
Lu, Weizheng
Zhang, Jing
Fan, Ju
Fu, Zihao
Chen, Yueguo
Du, Xiaoyong
FRONTIERS OF COMPUTER SCIENCE, 2025, 19 (02)

← 1 2 3 4 5 →