2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR)
|
2024年
基金:
新加坡国家研究基金会;
关键词:
D O I:
10.1109/CVPR52733.2024.01486
中图分类号:
TP18 [人工智能理论];
学科分类号:
081104 ;
0812 ;
0835 ;
1405 ;
摘要:
Applying a pre-trained large model to downstream tasks is prohibitive under resource-constrained conditions. Re-cent dominant approaches for addressing efficiency issues involve adding a few learnable parameters to the fixed backbone model. This strategy, however, leads to more challenges in loading large models for downstream fine-tuning with limited resources. In this paper, we propose a novel method for increasing the parameter efficiency of pre-trained models by introducing an intermediate pre-training stage. To this end, we first employ low-rank approximation to compress the original large model and then devise a feature distillation module and a weight perturbation regularization module. These modules are specifically designed to enhance the low-rank model. In particular, we update only the low-rank model while freezing the backbone parameters during pre-training. This allows for direct and efficient utilization of the low-rank model for downstream fine-tuning tasks. The proposed method achieves both efficiencies in terms of required parameters and computation time while maintaining comparable results with minimal modifications to the backbone architecture. Specifically, when applied to three vision-only and one vision-language Transformer models, our approach often demonstrates a merely similar to 0.6 point decrease in performance while reducing the original parameter size by 1/3 to 2/3. We release our code at link.
机构:
Natl Univ Def Technol, Coll Liberal Arts & Sci, Changsha 410073, Peoples R ChinaNatl Univ Def Technol, Coll Liberal Arts & Sci, Changsha 410073, Peoples R China
Tao, Hong
Hou, Chenping
论文数: 0引用数: 0
h-index: 0
机构:
Natl Univ Def Technol, Coll Liberal Arts & Sci, Changsha 410073, Peoples R ChinaNatl Univ Def Technol, Coll Liberal Arts & Sci, Changsha 410073, Peoples R China
Hou, Chenping
Yi, Dongyun
论文数: 0引用数: 0
h-index: 0
机构:
Natl Univ Def Technol, Coll Liberal Arts & Sci, Changsha 410073, Peoples R ChinaNatl Univ Def Technol, Coll Liberal Arts & Sci, Changsha 410073, Peoples R China
Yi, Dongyun
Zhu, Jubo
论文数: 0引用数: 0
h-index: 0
机构:
Natl Univ Def Technol, Coll Liberal Arts & Sci, Changsha 410073, Peoples R ChinaNatl Univ Def Technol, Coll Liberal Arts & Sci, Changsha 410073, Peoples R China
Zhu, Jubo
Hu, Dewen
论文数: 0引用数: 0
h-index: 0
机构:
Natl Univ Def Technol, Coll Mechatron & Automat, Changsha 410073, Peoples R ChinaNatl Univ Def Technol, Coll Liberal Arts & Sci, Changsha 410073, Peoples R China