Evaluating the capabilities of large language models using machine learning tasks at inference-time

被引:0
|
作者
Grm, Klemen [1 ]
机构
[1] Univ Ljubljani, Fak Elektrotehniko, Trzaska Cesta 25, Ljubljana 1000, Slovenia
来源
ELEKTROTEHNISKI VESTNIK | 2023年 / 90卷 / 05期
关键词
language models; machine learning; evaluation methodology;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Machine learning is the domain of algorithms capable of learning from data to improve their performance on a task or set of tasks. Common machine learning tasks include classification, regression, and generative modelling. The most common modern example of machine learners in practical use is deep neural networks coupled with an extrinsic optimizer such as stochastic gradient descent. Recently, scaled-up large language models have shown increasing capabilities of in-context meta-learning, which has been used to improve their performance on language tasks through few-shot learning. In this paper, we show that pre-trained large language models can act as machine learners with regard to in-context data, without using extrinsic optimization tools or weight updates. By evaluating the language models' inference time machine learning abilities on synthetic or appropriately transformed datasets, we conclusively show that they're able to model complex relationships between data in the input context. This implies that inference-time machine learning tasks represent a meaningful capability evaluation task for large language models.
引用
收藏
页码:247 / 253
页数:7
相关论文
共 50 条
  • [31] EVALUATING MACHINE LEARNING MODELS FOR HEALTHCARE SERVICES EFFICIENCY
    Sukiasyan, Ani
    CADERNOS EDUCACAO TECNOLOGIA E SOCIEDADE, 2023, 16 (04): : 1280 - 1289
  • [32] Efficient Bayesian inference using adversarial machine learning and low-complexity surrogate models
    Na, Jonggeol
    Bak, Ji Hyun
    Sahinidis, Nikolaos V.
    COMPUTERS & CHEMICAL ENGINEERING, 2021, 151
  • [33] A new algorithm for time series prediction using machine learning models
    Yeturu Jahnavi
    Poongothai Elango
    S. P. Raja
    Javier Parra Fuente
    Elena Verdú
    Evolutionary Intelligence, 2023, 16 : 1449 - 1460
  • [34] Integrating Knowledge Graph Data with Large Language Models for Explainable Inference
    Efrain Quintero-Narvaez, Carlos
    Monroy, Raul
    PROCEEDINGS OF THE 17TH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, WSDM 2024, 2024, : 1198 - 1199
  • [35] Using Machine Learning Techniques for Evaluating the Similarity of Enterprise Architecture Models Technical Paper
    Borozanov, Vasil
    Hacks, Simon
    Silva, Nuno
    ADVANCED INFORMATION SYSTEMS ENGINEERING (CAISE 2019), 2019, 11483 : 563 - 578
  • [36] From statistics to deep learning: Using large language models in psychiatric research
    Hua, Yining
    Beam, Andrew
    Chibnik, Lori B.
    Torous, John
    INTERNATIONAL JOURNAL OF METHODS IN PSYCHIATRIC RESEARCH, 2025, 34 (01)
  • [37] Machine Learning Models Applied in Sign Language Recognition
    Novillo Quinde, Esteban Gustavo
    Saldana Torres, Juan Pablo
    Alvarez Valdez, Michael Andres
    Llivicota Leon, John Santiago
    Hurtado Ortiz, Remigio Ismael
    PATTERN RECOGNITION, MCPR 2023, 2023, 13902 : 263 - 272
  • [38] A new algorithm for time series prediction using machine learning models
    Jahnavi, Yeturu
    Elango, Poongothai
    Raja, S. P.
    Parra Fuente, Javier
    Verdu, Elena
    EVOLUTIONARY INTELLIGENCE, 2023, 16 (05) : 1449 - 1460
  • [39] Decoding cortical folding patterns in marmosets using machine learning and large language model
    Wu, Yue
    Gao, Xuesong
    Liu, Zhengliang
    Wang, Pengcheng
    Wu, Zihao
    Li, Yiwei
    Zhang, Tuo
    Liu, Tianming
    Liu, Tao
    Li, Xiao
    NEUROIMAGE, 2025, 308
  • [40] Language learning using Machine Learning: a systematic review
    Cruzado, Javier Gamboa
    Huamani-Jeri, Jhon
    Najarro-Buitron, Abel
    Sanchez, Augusto Hidalgo
    Chaca, Marisol Daga
    Zegarra, Indalecio Horna
    APUNTES UNIVERSITARIOS, 2022, 12 (04) : 321 - 345