LLM-Adapters: An Adapter Family for Parameter-Efficient Fine-Tuning of Large Language Models

被引：0

作者：

Hu, Zhiqiang ^{[1
]}

Wang, Lei ^{[2
]}

Lan, Yihuai

Xu, Wanyu ^{[4
]}

Lim, Ee-Peng ^{[2
]}

Bing, Lidong ^{[3
]}

Xu, Xing ^{[5
]}

Poria, Soujanya ^{[1
]}

Lee, Roy Ka-Wei ^{[1
]}

机构：

[1] Singapore Univ Technol & Design, Singapore, Singapore

[2] Singapore Management Univ, Singapore, Singapore

[3] Alibaba Grp, DAMO Acad, Singapore, Singapore

[4] Southwest Jiaotong Univ, Chengdu, Peoples R China

[5] Univ Elect Sci & Technol China, Chengdu, Peoples R China

来源：

2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, EMNLP 2023 | 2023年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The success of large language models (LLMs), like GPT-4 and ChatGPT, has led to the development of numerous cost-effective and accessible alternatives that are created by finetuning open-access LLMs with task-specific data (e.g., ChatDoctor) or instruction data (e.g., Alpaca). Among the various fine-tuning methods, adapter-based parameter-efficient fine-tuning (PEFT) is undoubtedly one of the most attractive topics, as it only requires fine-tuning a few external parameters instead of the entire LLMs while achieving comparable or even better performance. To enable further research on PEFT methods of LLMs, this paper presents LLM-Adapters, an easy-to-use framework that integrates various adapters into LLMs and can execute these adapter-based PEFT methods of LLMs for different tasks. The framework includes state-of-the-art open-access LLMs such as LLaMA, BLOOM, and GPT-J, as well as widely used adapters such as Series adapters, Parallel adapter, Prompt-based learning and Reparametrization-based methods. Moreover, we conduct extensive empirical studies on the impact of adapter types, placement locations, and hyper-parameters to the best design for each adapter-based methods. We evaluate the effectiveness of the adapters on fourteen datasets from two different reasoning tasks, Arithmetic Reasoning and Commonsense Reasoning. The results demonstrate that using adapter-based PEFT in smaller-scale LLMs (7B) with few extra trainable parameters yields comparable, and in some cases superior, performance to powerful LLMs (175B) in zero-shot inference on both reasoning tasks. The code and datasets can be found in https://github.com/AGI-Edgerunners/LLM-Adapters.

引用

页码：5254 / 5276

页数：23

共 50 条

[1] Parameter-efficient fine-tuning in large language models: a survey of methodologies
Luping Wang
Sheng Chen
Linnan Jiang
Shu Pan
Runze Cai
Sen Yang
Fei Yang
Artificial Intelligence Review, 58 (8)
[2] Parameter-efficient fine-tuning of large language models using semantic knowledge tuning
Prottasha, Nusrat Jahan
Mahmud, Asif
Sobuj, Md. Shohanur Islam
Bhat, Prakash
Kowsher, Md
Yousefi, Niloofar
Garibay, Ozlem Ozmen
SCIENTIFIC REPORTS, 2024, 14 (01):
[3] Characterizing Communication in Distributed Parameter-Efficient Fine-Tuning for Large Language Models
Alnaasan, Nawras
Huang, Horng-Ruey
Shafi, Aamir
Subramoni, Hari
Panda, Dhabaleswar K.
2024 IEEE SYMPOSIUM ON HIGH-PERFORMANCE INTERCONNECTS, HOTI 2024, 2024, : 11 - 19
[4] Democratizing protein language models with parameter-efficient fine-tuning
Sledzieski, Samuel
Kshirsagar, Meghana
Baek, Minkyung
Dodhia, Rahul
Ferres, Juan Lavista
Berger, Bonnie
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2024, 121 (26)
[5] Parameter-efficient fine-tuning of large-scale pre-trained language models
Ning Ding
Yujia Qin
Guang Yang
Fuchao Wei
Zonghan Yang
Yusheng Su
Shengding Hu
Yulin Chen
Chi-Min Chan
Weize Chen
Jing Yi
Weilin Zhao
Xiaozhi Wang
Zhiyuan Liu
Hai-Tao Zheng
Jianfei Chen
Yang Liu
Jie Tang
Juanzi Li
Maosong Sun
Nature Machine Intelligence, 2023, 5 : 220 - 235
[6] Parameter-efficient fine-tuning of large-scale pre-trained language models
Ding, Ning
Qin, Yujia
Yang, Guang
Wei, Fuchao
Yang, Zonghan
Su, Yusheng
Hu, Shengding
Chen, Yulin
Chan, Chi-Min
Chen, Weize
Yi, Jing
Zhao, Weilin
Wang, Xiaozhi
Liu, Zhiyuan
Zheng, Hai-Tao
Chen, Jianfei
Liu, Yang
Tang, Jie
Li, Juanzi
Sun, Maosong
NATURE MACHINE INTELLIGENCE, 2023, 5 (03) : 220 - +
[7] On the Effectiveness of Parameter-Efficient Fine-Tuning
Fu, Zihao
Yang, Haoran
So, Anthony Man-Cho
Lam, Wai
Bing, Lidong
Collier, Nigel
THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 11, 2023, : 12799 - 12807
[8] Parameter-Efficient Fine-Tuning of Large Pretrained Models for Instance Segmentation Tasks
Baker, Nermeen Abou
Rohrschneider, David
Handmann, Uwe
MACHINE LEARNING AND KNOWLEDGE EXTRACTION, 2024, 6 (04): : 2783 - 2807
[9] Parameter-Efficient Fine-Tuning of Pre-trained Large Language Models for Financial Text Analysis
Langa, Kelly
Wang, Hairong
Okuboyejo, Olaperi
ARTIFICIAL INTELLIGENCE RESEARCH, SACAIR 2024, 2025, 2326 : 3 - 20
[10] Neural Architecture Search for Parameter-Efficient Fine-tuning of Large Pre-trained Language Models
Lawton, Neal
Kumar, Anoop
Thattai, Govind
Galstyan, Aram
Ver Steeg, Greg
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023), 2023, : 8506 - 8515

← 1 2 3 4 5 →