Utilizing Large Language Models for Ablation Studies in Machine Learning and Deep Learning

被引:0
作者
Sheikholeslami, Sina [1 ]
Ghasemirahni, Hamid [1 ]
Payberah, Amir H. [1 ]
Wang, Tianze [1 ]
Dowling, Jim [2 ]
Vlassov, Vladimir [1 ]
机构
[1] KTH Royal Inst Technol, Stockholm, Sweden
[2] Hopsworks AB, Stockholm, Sweden
来源
PROCEEDINGS OF THE 2025 THE 5TH WORKSHOP ON MACHINE LEARNING AND SYSTEMS, EUROMLSYS 2025 | 2025年
关键词
Ablation Studies; Machine Learning; Deep Learning; Deep Neural Networks; Feature Ablation; Model Ablation; Large Language Models;
D O I
10.1145/3721146.3721957
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In Machine Learning (ML) and Deep Learning (DL) research, ablation studies are typically performed to provide insights into the individual contribution of different building blocks and components of an ML/DL system (e.g., a deep neural network), as well as to justify that certain additions or modifications to an existing ML/DL system can result in the proposed improved performance. Although dedicated frameworks for performing ablation studies have been introduced in recent years, conducting such experiments is still associated with requiring tedious, redundant work, typically involving maintaining redundant and nearly identical versions of code that correspond to different ablation trials. Inspired by the recent promising performance of Large Language Models (LLMs) in the generation and analysis of ML/DL code, in this paper we discuss the potential of LLMs as facilitators of ablation study experiments for scientific research projects that involve or deal with ML and DL models. We first discuss the different ways in which LLMs can be utilized for ablation studies and then present the prototype of a tool called ABLATIONMAGE, that leverages LLMs to semi-automate the overall process of conducting ablation study experiments. We showcase the usability of ABLATIONMAGE as a tool through three experiments, including one in which we reproduce the ablation studies from a recently published applied DL paper.
引用
收藏
页码:230 / 237
页数:8
相关论文
共 34 条
[1]   The Higgs Machine Learning Challenge [J].
Adam-Bourdarios, C. ;
Cowan, G. ;
Germain-Renaud, C. ;
Guyon, I. ;
Kegl, B. ;
Rousseau, D. .
21ST INTERNATIONAL CONFERENCE ON COMPUTING IN HIGH ENERGY AND NUCLEAR PHYSICS (CHEP2015), PARTS 1-9, 2015, 664
[2]  
Chen A, 2023, ADV NEUR IN
[3]   XGBoost: A Scalable Tree Boosting System [J].
Chen, Tianqi ;
Guestrin, Carlos .
KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, :785-794
[4]  
Chowdhery A, 2023, J MACH LEARN RES, V24
[5]   Leveraging graph neural networks for supporting automatic triage of patients [J].
Defilippo, Annamaria ;
Veltri, Pierangelo ;
Lio, Pietro ;
Guzzi, Pietro Hiram .
SCIENTIFIC REPORTS, 2024, 14 (01)
[6]  
Dehghani Mostafa, P MACHINE LEARNING R
[7]  
Dubey Abhimanyu., The Llama 3 herd of models
[8]  
Fostiropoulos Iordanis, 2023, PR MACH LEARN RES, V224
[9]   Deploying Stateful Network Functions Efficiently using Large Language Models [J].
Ghasemirahni, Hamid ;
Farshin, Alireza ;
Scazzariello, Mariano ;
Chiesa, Marco ;
Kostic, Dejan .
PROCEEDINGS OF THE 2024 4TH WORKSHOP ON MACHINE LEARNING AND SYSTEMS, EUROMLSYS 2024, 2024, :28-38
[10]   Enhancing Source Code Representations for Deep Learning with Static Analysis [J].
Guan, Xueting ;
Treude, Christoph .
PROCEEDINGS 2024 32ND IEEE/ACM INTERNATIONAL CONFERENCE ON PROGRAM COMPREHENSION, ICPC 2024, 2024, :64-68