Evaluating the capabilities of large language models using machine learning tasks at inference-time

被引：0

作者：

Grm, Klemen ^{[1
]}

机构：

[1] Univ Ljubljani, Fak Elektrotehniko, Trzaska Cesta 25, Ljubljana 1000, Slovenia

来源：

ELEKTROTEHNISKI VESTNIK | 2023年 / 90卷 / 05期

关键词：

language models; machine learning; evaluation methodology;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Machine learning is the domain of algorithms capable of learning from data to improve their performance on a task or set of tasks. Common machine learning tasks include classification, regression, and generative modelling. The most common modern example of machine learners in practical use is deep neural networks coupled with an extrinsic optimizer such as stochastic gradient descent. Recently, scaled-up large language models have shown increasing capabilities of in-context meta-learning, which has been used to improve their performance on language tasks through few-shot learning. In this paper, we show that pre-trained large language models can act as machine learners with regard to in-context data, without using extrinsic optimization tools or weight updates. By evaluating the language models' inference time machine learning abilities on synthetic or appropriately transformed datasets, we conclusively show that they're able to model complex relationships between data in the input context. This implies that inference-time machine learning tasks represent a meaningful capability evaluation task for large language models.

引用

页码：247 / 253

页数：7

共 50 条

[31] EVALUATING MACHINE LEARNING MODELS FOR HEALTHCARE SERVICES EFFICIENCY
Sukiasyan, Ani
CADERNOS EDUCACAO TECNOLOGIA E SOCIEDADE, 2023, 16 (04): : 1280 - 1289
[32] Efficient Bayesian inference using adversarial machine learning and low-complexity surrogate models
Na, Jonggeol
Bak, Ji Hyun
Sahinidis, Nikolaos V.
COMPUTERS & CHEMICAL ENGINEERING, 2021, 151
[33] A new algorithm for time series prediction using machine learning models
Yeturu Jahnavi
Poongothai Elango
S. P. Raja
Javier Parra Fuente
Elena Verdú
Evolutionary Intelligence, 2023, 16 : 1449 - 1460
[34] Integrating Knowledge Graph Data with Large Language Models for Explainable Inference
Efrain Quintero-Narvaez, Carlos
Monroy, Raul
PROCEEDINGS OF THE 17TH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, WSDM 2024, 2024, : 1198 - 1199
[35] Using Machine Learning Techniques for Evaluating the Similarity of Enterprise Architecture Models Technical Paper
Borozanov, Vasil
Hacks, Simon
Silva, Nuno
ADVANCED INFORMATION SYSTEMS ENGINEERING (CAISE 2019), 2019, 11483 : 563 - 578
[36] From statistics to deep learning: Using large language models in psychiatric research
Hua, Yining
Beam, Andrew
Chibnik, Lori B.
Torous, John
INTERNATIONAL JOURNAL OF METHODS IN PSYCHIATRIC RESEARCH, 2025, 34 (01)
[37] Machine Learning Models Applied in Sign Language Recognition
Novillo Quinde, Esteban Gustavo
Saldana Torres, Juan Pablo
Alvarez Valdez, Michael Andres
Llivicota Leon, John Santiago
Hurtado Ortiz, Remigio Ismael
PATTERN RECOGNITION, MCPR 2023, 2023, 13902 : 263 - 272
[38] A new algorithm for time series prediction using machine learning models
Jahnavi, Yeturu
Elango, Poongothai
Raja, S. P.
Parra Fuente, Javier
Verdu, Elena
EVOLUTIONARY INTELLIGENCE, 2023, 16 (05) : 1449 - 1460
[39] Decoding cortical folding patterns in marmosets using machine learning and large language model
Wu, Yue
Gao, Xuesong
Liu, Zhengliang
Wang, Pengcheng
Wu, Zihao
Li, Yiwei
Zhang, Tuo
Liu, Tianming
Liu, Tao
Li, Xiao
NEUROIMAGE, 2025, 308
[40] Language learning using Machine Learning: a systematic review
Cruzado, Javier Gamboa
Huamani-Jeri, Jhon
Najarro-Buitron, Abel
Sanchez, Augusto Hidalgo
Chaca, Marisol Daga
Zegarra, Indalecio Horna
APUNTES UNIVERSITARIOS, 2022, 12 (04) : 321 - 345

← 1 2 3 4 5 →