PELA: Learning Parameter-Efficient Models with Low-Rank Approximation

被引:0
|
作者
Guo, Yangyang [1 ]
Wang, Guangzhi [1 ]
Kankanhalli, Mohan [1 ]
机构
[1] Natl Univ Singapore, Singapore, Singapore
来源
2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2024年
基金
新加坡国家研究基金会;
关键词
D O I
10.1109/CVPR52733.2024.01486
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Applying a pre-trained large model to downstream tasks is prohibitive under resource-constrained conditions. Re-cent dominant approaches for addressing efficiency issues involve adding a few learnable parameters to the fixed backbone model. This strategy, however, leads to more challenges in loading large models for downstream fine-tuning with limited resources. In this paper, we propose a novel method for increasing the parameter efficiency of pre-trained models by introducing an intermediate pre-training stage. To this end, we first employ low-rank approximation to compress the original large model and then devise a feature distillation module and a weight perturbation regularization module. These modules are specifically designed to enhance the low-rank model. In particular, we update only the low-rank model while freezing the backbone parameters during pre-training. This allows for direct and efficient utilization of the low-rank model for downstream fine-tuning tasks. The proposed method achieves both efficiencies in terms of required parameters and computation time while maintaining comparable results with minimal modifications to the backbone architecture. Specifically, when applied to three vision-only and one vision-language Transformer models, our approach often demonstrates a merely similar to 0.6 point decrease in performance while reducing the original parameter size by 1/3 to 2/3. We release our code at link.
引用
收藏
页码:15699 / 15709
页数:11
相关论文
共 50 条
  • [1] LORS: Low-rank Residual Structure for Parameter-Efficient Network Stacking
    Li, Jialin
    Nie, Qiang
    Fu, Weifu
    Lin, Yuhuan
    Tao, Guangpin
    Liu, Yong
    Wang, Chengjie
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 15866 - 15876
  • [2] Structure-Aware Low-Rank Adaptation for Parameter-Efficient Fine-Tuning
    Hu, Yahao
    Xie, Yifei
    Wang, Tianfeng
    Chen, Man
    Pan, Zhisong
    MATHEMATICS, 2023, 11 (20)
  • [3] LoRAPrune: Structured Pruning Meets Low-Rank Parameter-Efficient Fine-Tuning
    Zhang, Mingyang
    Chen, Hao
    Shen, Chunhua
    Yang, Zhen
    Ou, Linlin
    Yu, Xinyi
    Zhuang, Bohan
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, : 3013 - 3026
  • [4] Introducing Routing Functions to Vision-Language Parameter-Efficient Fine-Tuning with Low-Rank Bottlenecks
    Qui, Tingyu
    Tuytelaars, Tinne
    Moens, Marie-Francine
    COMPUTER VISION - ECCV 2024, PT LXXXVIII, 2025, 15146 : 291 - 308
  • [5] Learning Mixtures of Low-Rank Models
    Chen, Yanxi
    Ma, Cong
    Poor, H. Vincent
    Chen, Yuxin
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2021, 67 (07) : 4613 - 4636
  • [6] A Fast and Efficient Algorithm for Low-rank Approximation of a Matrix
    Nguyen, Nam H.
    Do, Thong T.
    Tran, Trac D.
    STOC'09: PROCEEDINGS OF THE 2009 ACM SYMPOSIUM ON THEORY OF COMPUTING, 2009, : 215 - 224
  • [7] Modeling the Parameter Interactions in Ranking SVM with Low-Rank Approximation
    Xu, Jun
    Zeng, Wei
    Lan, Yanyan
    Guo, Jiafeng
    Cheng, Xueqi
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2019, 31 (06) : 1181 - 1193
  • [8] Randomized low-rank approximation of parameter-dependent matrices
    Kressner, Daniel
    Lam, Hei Yin
    NUMERICAL LINEAR ALGEBRA WITH APPLICATIONS, 2024, 31 (06)
  • [9] EFFICIENT LEARNING OF DICTIONARIES WITH LOW-RANK ATOMS
    Ravishankar, Saiprasad
    Moore, Brian E.
    Nadakuditi, Raj Rao
    Fessler, Jeffrey A.
    2016 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP), 2016, : 222 - 226
  • [10] Learning Low-Rank Models From Compressive Measurements for Efficient Projection Design
    Coutts, Fraser K.
    Thompson, John
    Mulgrew, Bernard
    2022 SENSOR SIGNAL PROCESSING FOR DEFENCE CONFERENCE, SSPD, 2022, : 96 - 100