Evaluating the capabilities of large language models using machine learning tasks at inference-time

被引:0
|
作者
Grm, Klemen [1 ]
机构
[1] Univ Ljubljani, Fak Elektrotehniko, Trzaska Cesta 25, Ljubljana 1000, Slovenia
来源
ELEKTROTEHNISKI VESTNIK | 2023年 / 90卷 / 05期
关键词
language models; machine learning; evaluation methodology;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Machine learning is the domain of algorithms capable of learning from data to improve their performance on a task or set of tasks. Common machine learning tasks include classification, regression, and generative modelling. The most common modern example of machine learners in practical use is deep neural networks coupled with an extrinsic optimizer such as stochastic gradient descent. Recently, scaled-up large language models have shown increasing capabilities of in-context meta-learning, which has been used to improve their performance on language tasks through few-shot learning. In this paper, we show that pre-trained large language models can act as machine learners with regard to in-context data, without using extrinsic optimization tools or weight updates. By evaluating the language models' inference time machine learning abilities on synthetic or appropriately transformed datasets, we conclusively show that they're able to model complex relationships between data in the input context. This implies that inference-time machine learning tasks represent a meaningful capability evaluation task for large language models.
引用
收藏
页码:247 / 253
页数:7
相关论文
共 50 条
  • [41] Large language models as tax attorneys: a case study in legal capabilities emergence
    Nay, John J.
    Karamardian, David
    Lawsky, Sarah B.
    Tao, Wenting
    Bhat, Meghana
    Jain, Raghav
    Lee, Aaron Travis
    Choi, Jonathan H.
    Kasai, Jungo
    PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 2024, 382 (2270):
  • [42] Enhancing Traffic Incident Management with Large Language Models: A Hybrid Machine Learning Approach for Severity Classification
    Grigorev, Artur
    Saleh, Khaled
    Ou, Yuming
    Mihaita, Adriana-Simona
    INTERNATIONAL JOURNAL OF INTELLIGENT TRANSPORTATION SYSTEMS RESEARCH, 2024, : 259 - 280
  • [43] PCA-based membership inference attack for machine learning models
    Peng C.
    Gao T.
    Liu H.
    Ding H.
    Tongxin Xuebao/Journal on Communications, 2022, 43 (01): : 149 - 160
  • [44] Towards Securing Machine Learning Models Against Membership Inference Attacks
    Ben Hamida, Sana
    Mrabet, Hichem
    Belguith, Sana
    Alhomoud, Adeeb
    Jemai, Abderrazak
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 70 (03): : 4897 - 4919
  • [45] Performance analysis of various machine learning models for membership inference attack
    Karthikeyan, K.
    Padmanaban, K.
    Kavitha, Datchanamoorthy
    Sekhar, Jampani Chandra
    INTERNATIONAL JOURNAL OF SENSOR NETWORKS, 2023, 43 (04) : 232 - 245
  • [46] Real-Time Anomalous Branch Behavior Inference with a GPU-inspired Engine for Machine Learning Models
    Oh, Hyunyoung
    Yi, Hayoon
    Choe, Hyeokjun
    Cho, Yeongpil
    Yoon, Sungroh
    Paek, Yunheung
    2019 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE), 2019, : 908 - 913
  • [47] Toward interpretable machine learning: evaluating models of heterogeneous predictions
    Zhang, Ruixun
    ANNALS OF OPERATIONS RESEARCH, 2024, : 867 - 887
  • [48] Bioclimatic inference based on mammal community using machine learning regression models: perspectives for paleoecological studies
    Linchamps, Pierre
    Stoetzel, Emmanuelle
    Robinet, Francois
    Hanon, Raphael
    Latouche, Pierre
    Cornette, Raphael
    FRONTIERS IN ECOLOGY AND EVOLUTION, 2023, 11
  • [49] Machine learning models for evaluating the benefits of business intelligence systems
    Tripathi M.A.
    Madhavi K.
    Kandi V.S.P.
    Nassa V.K.
    Mallik B.
    Chakravarthi M.K.
    Journal of High Technology Management Research, 2023, 34 (02)
  • [50] Evaluating the efficacy of bioelectrical impedance analysis using machine learning models for the classification of parasitized goats
    Terrill, Thomas H.
    Siddique, Aftab
    Erukulla, Tharun Tej
    Batchu, Phaneendra
    Chelkapally, Sai
    Brown, Davia
    Stegall, Kensley
    Kannan, Govind
    Mahapatra, Ajit
    Panda, Sudhanshu
    Morgan, Eric
    van Wyk, Jan
    JOURNAL OF ANIMAL SCIENCE, 2024, 102 : 458 - 459