Evaluating the capabilities of large language models using machine learning tasks at inference-time

被引：0

作者：

Grm, Klemen ^{[1
]}

机构：

[1] Univ Ljubljani, Fak Elektrotehniko, Trzaska Cesta 25, Ljubljana 1000, Slovenia

来源：

ELEKTROTEHNISKI VESTNIK | 2023年 / 90卷 / 05期

关键词：

language models; machine learning; evaluation methodology;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Machine learning is the domain of algorithms capable of learning from data to improve their performance on a task or set of tasks. Common machine learning tasks include classification, regression, and generative modelling. The most common modern example of machine learners in practical use is deep neural networks coupled with an extrinsic optimizer such as stochastic gradient descent. Recently, scaled-up large language models have shown increasing capabilities of in-context meta-learning, which has been used to improve their performance on language tasks through few-shot learning. In this paper, we show that pre-trained large language models can act as machine learners with regard to in-context data, without using extrinsic optimization tools or weight updates. By evaluating the language models' inference time machine learning abilities on synthetic or appropriately transformed datasets, we conclusively show that they're able to model complex relationships between data in the input context. This implies that inference-time machine learning tasks represent a meaningful capability evaluation task for large language models.

引用

页码：247 / 253

页数：7

共 50 条

[41] Large language models as tax attorneys: a case study in legal capabilities emergence
Nay, John J.
Karamardian, David
Lawsky, Sarah B.
Tao, Wenting
Bhat, Meghana
Jain, Raghav
Lee, Aaron Travis
Choi, Jonathan H.
Kasai, Jungo
PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 2024, 382 (2270):
[42] Enhancing Traffic Incident Management with Large Language Models: A Hybrid Machine Learning Approach for Severity Classification
Grigorev, Artur
Saleh, Khaled
Ou, Yuming
Mihaita, Adriana-Simona
INTERNATIONAL JOURNAL OF INTELLIGENT TRANSPORTATION SYSTEMS RESEARCH, 2024, : 259 - 280
[43] PCA-based membership inference attack for machine learning models
Peng C.
Gao T.
Liu H.
Ding H.
Tongxin Xuebao/Journal on Communications, 2022, 43 (01): : 149 - 160
[44] Towards Securing Machine Learning Models Against Membership Inference Attacks
Ben Hamida, Sana
Mrabet, Hichem
Belguith, Sana
Alhomoud, Adeeb
Jemai, Abderrazak
CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 70 (03): : 4897 - 4919
[45] Performance analysis of various machine learning models for membership inference attack
Karthikeyan, K.
Padmanaban, K.
Kavitha, Datchanamoorthy
Sekhar, Jampani Chandra
INTERNATIONAL JOURNAL OF SENSOR NETWORKS, 2023, 43 (04) : 232 - 245
[46] Real-Time Anomalous Branch Behavior Inference with a GPU-inspired Engine for Machine Learning Models
Oh, Hyunyoung
Yi, Hayoon
Choe, Hyeokjun
Cho, Yeongpil
Yoon, Sungroh
Paek, Yunheung
2019 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE), 2019, : 908 - 913
[47] Toward interpretable machine learning: evaluating models of heterogeneous predictions
Zhang, Ruixun
ANNALS OF OPERATIONS RESEARCH, 2024, : 867 - 887
[48] Bioclimatic inference based on mammal community using machine learning regression models: perspectives for paleoecological studies
Linchamps, Pierre
Stoetzel, Emmanuelle
Robinet, Francois
Hanon, Raphael
Latouche, Pierre
Cornette, Raphael
FRONTIERS IN ECOLOGY AND EVOLUTION, 2023, 11
[49] Machine learning models for evaluating the benefits of business intelligence systems
Tripathi M.A.
Madhavi K.
Kandi V.S.P.
Nassa V.K.
Mallik B.
Chakravarthi M.K.
Journal of High Technology Management Research, 2023, 34 (02)
[50] Evaluating the efficacy of bioelectrical impedance analysis using machine learning models for the classification of parasitized goats
Terrill, Thomas H.
Siddique, Aftab
Erukulla, Tharun Tej
Batchu, Phaneendra
Chelkapally, Sai
Brown, Davia
Stegall, Kensley
Kannan, Govind
Mahapatra, Ajit
Panda, Sudhanshu
Morgan, Eric
van Wyk, Jan
JOURNAL OF ANIMAL SCIENCE, 2024, 102 : 458 - 459

← 1 2 3 4 5 →