Had Enough of Experts? Quantitative Knowledge Retrieval From Large Language Models

被引:0
作者
Selby, David [1 ]
Iwashita, Yuichiro [1 ,2 ]
Spriestersbach, Kai [1 ]
Saad, Mohammad [1 ]
Bappert, Dennis [3 ]
Warrier, Archana [1 ]
Mukherjee, Sumantrak [1 ]
Kise, Koichi [1 ,2 ]
Vollmer, Sebastian [1 ,4 ]
机构
[1] DFKI GmbH, Kaiserslautern, Germany
[2] Osaka Metropolitan Univ, Osaka, Japan
[3] Amazon Web Serv, Seattle, WA USA
[4] Univ Kaiserslautern Landau, Kaiserslautern, Germany
关键词
Bayesian models; expert systems; large language models; missing data imputation; prior elicitation;
D O I
10.1002/sta4.70054
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Large language models (LLMs) have been extensively studied for their ability to generate convincing natural language sequences; however, their utility for quantitative information retrieval is less well understood. Here, we explore the feasibility of LLMs as a mechanism for quantitative knowledge retrieval to aid two data analysis tasks: elicitation of prior distributions for Bayesian models and imputation of missing data. We introduce a framework that leverages LLMs to enhance Bayesian workflows by eliciting expert-like prior knowledge and imputing missing data. Tested on diverse datasets, this approach can improve predictive accuracy and reduce data requirements, offering significant potential in healthcare, environmental science and engineering applications. We discuss the implications and challenges of treating LLMs as 'experts'.
引用
收藏
页数:17
相关论文
共 70 条
[1]  
Anonymous, 2023, Understanding the Effects of RLHF on LLM Generalisation and Diversity
[2]  
Barberio A., 2023, Large Language Models in Data Preparation: Opportunities and Challenges
[3]  
Bischl B, 2021, Arxiv, DOI arXiv:1708.03731
[4]  
Brilleman S.L., 2018, STANCON 2018
[5]  
Capstick A, 2025, Arxiv, DOI arXiv:2411.17284
[6]  
Chen YK, 2023, Arxiv, DOI arXiv:2311.14332
[7]  
Cheng Vincent, 2023, P 35 C COMPUTATIONAL, P188
[8]  
Cooke R.M., 1991, Experts in uncertainty
[9]   Review: A gentle introduction to imputation of missing values [J].
Donders, A. Rogier T. ;
van der Heijden, Geert J. M. G. ;
Stijnen, Theo ;
Moons, Karel G. M. .
JOURNAL OF CLINICAL EPIDEMIOLOGY, 2006, 59 (10) :1087-1091
[10]   Guidance on Expert Knowledge Elicitation in Food and Feed Safety Risk Assessment [J].
European Food Safety Authority .
EFSA JOURNAL, 2014, 12 (06)