A Survey of Metrics to Enhance Training Dependability in Large Language Models

被引:0
|
作者
Fang, Wenyi [1 ]
Zhang, Hao [1 ]
Gong, Ziyu [1 ]
Zeng, Longbin [1 ]
Lu, Xuhui [1 ,2 ]
Liu, Biao [1 ]
Wu, Xiaoyu [1 ]
Zheng, Yang [1 ]
Hu, Zheng [1 ]
Zhang, Xun [1 ]
机构
[1] Huawei Technol Co Ltd, Shenzhen, Peoples R China
[2] Beihang Univ, Sch Automat Sci & Elect Engn, Beijing, Peoples R China
来源
2023 IEEE 34TH INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING WORKSHOPS, ISSREW | 2023年
关键词
Large Language Model; Dependability; Monitoring Metric;
D O I
10.1109/ISSREW60843.2023.00071
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The rapidly advancing field of artificial intelligence requires meticulous attention to the training and monitoring of large language models (LLMs). This paper offers a systematic analysis of existing metrics and introduces new ones, focusing on their theoretical underpinnings and practical implementations. We present empirical results and insights into the performance of selected metrics, elucidating the complex interplay of variables in the training process. Our comprehensive approach provides significant insights into LLM training, and promises to improve the dependability and efficiency of future models.
引用
收藏
页码:180 / 185
页数:6
相关论文
共 50 条
  • [11] A survey of large language models for healthcare: from data, technology, and applications to accountability and ethics
    He, Kai
    Mao, Rui
    Lin, Qika
    Ruan, Yucheng
    Lan, Xiang
    Feng, Mengling
    Cambria, Erik
    INFORMATION FUSION, 2025, 118
  • [12] Editing Personality For Large Language Models
    Mao, Shengyu
    Wang, Xiaohan
    Wang, Mengru
    Jiang, Yong
    Xie, Pengjun
    Huang, Fei
    Zhang, Ningyu
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, PT II, NLPCC 2024, 2025, 15360 : 241 - 254
  • [13] Applications of Large Language Models in Pathology
    Cheng, Jerome
    BIOENGINEERING-BASEL, 2024, 11 (04):
  • [14] Consumer segmentation with large language models
    Li, Yinan
    Liu, Ying
    Yu, Muran
    JOURNAL OF RETAILING AND CONSUMER SERVICES, 2025, 82
  • [15] Large Language Models in Cosmetic Dermatology
    Landau, Marina
    Kroumpouzos, George
    Goldust, Mohamad
    JOURNAL OF COSMETIC DERMATOLOGY, 2025, 24 (02)
  • [16] Large Language Models: A Guide for Radiologists
    Kim, Sunkyu
    Lee, Choong-kun
    Kim, Seung-seob
    KOREAN JOURNAL OF RADIOLOGY, 2024, 25 (02) : 126 - 133
  • [17] Attention heads of large language models
    Zheng, Zifan
    Wang, Yezhaohui
    Huang, Yuxin
    Song, Shichao
    Yang, Mingchuan
    Tang, Bo
    Xiong, Feiyu
    Li, Zhiyu
    PATTERNS, 2025, 6 (02):
  • [18] Quo Vadis ChatGPT? From large language models to Large Knowledge Models
    Venkatasubramanian, Venkat
    Chakraborty, Arijit
    COMPUTERS & CHEMICAL ENGINEERING, 2025, 192
  • [19] Qualitative metrics from the biomedical literature for evaluating large language models in clinical decision-making: a narrative review
    Ho, Cindy N.
    Tian, Tiffany
    Ayers, Alessandra T.
    Aaron, Rachel E.
    Phillips, Vidith
    Wolf, Risa M.
    Mathioudakis, Nestoras
    Dai, Tinglong
    Klonoff, David C.
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2024, 24 (01)
  • [20] Large language model for table processing: a survey
    Lu, Weizheng
    Zhang, Jing
    Fan, Ju
    Fu, Zihao
    Chen, Yueguo
    Du, Xiaoyong
    FRONTIERS OF COMPUTER SCIENCE, 2025, 19 (02)