A Survey of Metrics to Enhance Training Dependability in Large Language Models

被引:0
|
作者
Fang, Wenyi [1 ]
Zhang, Hao [1 ]
Gong, Ziyu [1 ]
Zeng, Longbin [1 ]
Lu, Xuhui [1 ,2 ]
Liu, Biao [1 ]
Wu, Xiaoyu [1 ]
Zheng, Yang [1 ]
Hu, Zheng [1 ]
Zhang, Xun [1 ]
机构
[1] Huawei Technol Co Ltd, Shenzhen, Peoples R China
[2] Beihang Univ, Sch Automat Sci & Elect Engn, Beijing, Peoples R China
来源
2023 IEEE 34TH INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING WORKSHOPS, ISSREW | 2023年
关键词
Large Language Model; Dependability; Monitoring Metric;
D O I
10.1109/ISSREW60843.2023.00071
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The rapidly advancing field of artificial intelligence requires meticulous attention to the training and monitoring of large language models (LLMs). This paper offers a systematic analysis of existing metrics and introduces new ones, focusing on their theoretical underpinnings and practical implementations. We present empirical results and insights into the performance of selected metrics, elucidating the complex interplay of variables in the training process. Our comprehensive approach provides significant insights into LLM training, and promises to improve the dependability and efficiency of future models.
引用
收藏
页码:180 / 185
页数:6
相关论文
共 50 条
  • [41] Large Language Models Are Zero-Shot Fuzzers: Fuzzing Deep-Learning Libraries via Large Language Models
    Deng, Yinlin
    Xia, Chunqiu Steven
    Peng, Haoran
    Yang, Chenyuan
    Zhan, Lingming
    PROCEEDINGS OF THE 32ND ACM SIGSOFT INTERNATIONAL SYMPOSIUM ON SOFTWARE TESTING AND ANALYSIS, ISSTA 2023, 2023, : 423 - 435
  • [42] Large Language Models in Gastroenterology: Systematic Review
    Gong, Eun Jeong
    Bang, Chang Seok
    Lee, Jae Jun
    Park, Jonghyung
    Kim, Eunsil
    Kim, Subeen
    Kimm, Minjae
    Choi, Seoung-Ho
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2024, 26
  • [43] Ethical considerations for large language models in ophthalmology
    Kalaw, Fritz Gerald P.
    Baxter, Sally L.
    CURRENT OPINION IN OPHTHALMOLOGY, 2024, 35 (06) : 438 - 446
  • [44] Demystifying Data Management for Large Language Models
    Miao, Xupeng
    Jia, Zhihao
    Cui, Bin
    COMPANION OF THE 2024 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, SIGMOD-COMPANION 2024, 2024, : 547 - 555
  • [45] LARGE LANGUAGE MODELS (LLMS) AND CHATGPT FOR BIOMEDICINE
    Arighi, Cecilia
    Brenner, Steven
    Lu, Zhiyong
    BIOCOMPUTING 2024, PSB 2024, 2024, : 641 - 644
  • [46] Large Language Models for Emotion Evolution Prediction
    Leung, Clement
    Xu, Zhifei
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS-ICCSA 2024 WORKSHOPS, PT I, 2024, 14815 : 3 - 19
  • [47] Performance of Recent Large Language Models for a Low-Resourced Language
    Jayakody, Ravindu
    Dias, Gihan
    2024 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING, IALP 2024, 2024, : 162 - 167
  • [48] A survey on large language model based autonomous agents
    Wang, Lei
    Ma, Chen
    Feng, Xueyang
    Zhang, Zeyu
    Yang, Hao
    Zhang, Jingsen
    Chen, Zhiyuan
    Tang, Jiakai
    Chen, Xu
    Lin, Yankai
    Zhao, Wayne Xin
    Wei, Zhewei
    Wen, Jirong
    FRONTIERS OF COMPUTER SCIENCE, 2024, 18 (06)
  • [49] A survey on large language model based autonomous agents
    WANG Lei
    MA Chen
    FENG Xueyang
    ZHANG Zeyu
    YANG Hao
    ZHANG Jingsen
    CHEN Zhiyuan
    TANG Jiakai
    CHEN Xu
    LIN Yankai
    ZHAO Wayne Xin
    WEI Zhewei
    WEN Jirong
    Frontiers of Computer Science, 2024, 18 (06)
  • [50] A survey on large language model based autonomous agents
    Lei Wang
    Chen Ma
    Xueyang Feng
    Zeyu Zhang
    Hao Yang
    Jingsen Zhang
    Zhiyuan Chen
    Jiakai Tang
    Xu Chen
    Yankai Lin
    Wayne Xin Zhao
    Zhewei Wei
    Jirong Wen
    Frontiers of Computer Science, 2024, 18