E2VPT: An Effective and Efficient Approach for Visual Prompt Tuning

被引:9
|
作者
Han, Cheng [1 ]
Wang, Qifan [2 ]
Cui, Yiming [3 ]
Cao, Zhiwen [1 ,4 ]
Wang, Wenguan [5 ]
Qi, Siyuan [6 ]
Liu, Dongfang [1 ]
机构
[1] Rochester Inst Technol, Rochester, NY 14623 USA
[2] Meta AI, New York, NY USA
[3] Univ Florida, Gainesville, FL USA
[4] Purdue Univ, W Lafayette, IN USA
[5] Zhejiang Univ, Hangzhou, Peoples R China
[6] BIGAI, Haveli, Maharashtra, India
来源
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023) | 2023年
基金
美国国家科学基金会;
关键词
D O I
10.1109/ICCV51070.2023.01604
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As the size of transformer-based, models continues to grow, fine-tuning these large-scale pretrained vision models for new tasks has become increasingly parameter-intensive. Parameter-efficient learning has been developed to reduce the number of tunable parameters during fine-tuning. Although these methods show promising results, there is still a significant performance gap compared to full fine-tuning. To address this challenge, we propose an Effective and Efficient Visual Prompt Tuning ((EVPT)-V-2) approach for large-scale transformer-based model adaptation. Specifically, we introduce a set of learnable key-value prompts and visual prompts into self-attention and input layers, respectively, to improve the effectiveness of model fine-tuning. Moreover, we design a prompt pruning procedure to systematically prune low importance prompts while preserving model performance, which largely enhances the models efficiency. Empirical results demonstrate that our approach outperforms several state-of-the-art baselines on two benchmarks, with considerably low parameter usage (e.g., 0.32% of model parameters on VTAB-1k). Our code is available at https://github.com/ChengHan111/E2VPT.
引用
收藏
页码:17445 / 17456
页数:12
相关论文
共 40 条
  • [1] Dynamic Visual Prompt Tuning for Parameter Efficient Transfer Learning
    Ruan, Chunqing
    Wang, Hongjian
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT VIII, 2024, 14432 : 293 - 303
  • [2] P-Distill: Efficient and Effective Prompt Tuning Using Knowledge Distillation
    Won, Hyun-Sik
    Choi, Joon-Young
    Zaman, Namrah
    Aliyeva, Dinara
    Kim, Kang-Min
    APPLIED SCIENCES-BASEL, 2025, 15 (05):
  • [3] SMoP: Towards Efficient and Effective Prompt Tuning with Sparse Mixture-of-Prompts
    Choi, Joon-Young
    Kim, Junho
    Park, Jun-Hyung
    Mok, Wing-Lam
    Lee, SangKeun
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2023), 2023, : 14306 - 14316
  • [4] A2Pt : Anti-Associative Prompt Tuning for Open Set Visual Recognition
    Ren, Hairui
    Tang, Fan
    Pan, Xingjia
    Cao, Juan
    Dong, Weiming
    Lin, Zhiwen
    Yan, Ke
    Xu, Changsheng
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 8419 - 8431
  • [5] A Quantum Annealing Instance Selection Approach for Efficient and Effective Transformer Fine-Tuning
    Pasin, Andrea
    Cunha, Washington
    Goncalves, Marcos Andre
    Ferro, Nicola
    PROCEEDINGS OF THE 2024 ACM SIGIR INTERNATIONAL CONFERENCE ON THE THEORY OF INFORMATION RETRIEVAL, ICTIR 2024, 2024, : 205 - 214
  • [6] An Efficient and Effective Approach to Developing Engineering E-Training Courses
    Tseng, Judy C. R.
    Tsai, Wen-Ling
    Hwang, Gwo-Jen
    Wu, Po-Han
    INTERNATIONAL JOURNAL OF DISTANCE EDUCATION TECHNOLOGIES, 2007, 5 (01) : 37 - 53
  • [7] Visual Query Tuning: Towards Effective Usage of Intermediate Representations for Parameter and Memory Efficient Transfer Learning
    Tu, Cheng-Hao
    Mai, Zheda
    Chao, Wei-Lun
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 7725 - 7735
  • [8] Gage R2&E2: an effective tool to improve the visual control of products
    Maire, Jean-Luc
    Pillet, Maurice
    Baudet, Nathalie
    INTERNATIONAL JOURNAL OF QUALITY & RELIABILITY MANAGEMENT, 2013, 30 (02) : 161 - +
  • [9] ESRDO: An Efficient E2E SFC Resource Dynamic Orchestration Framework and Approach
    Fan, Weixuan
    Li, Jin
    Zhang, Min
    2024 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE, WCNC 2024, 2024,
  • [10] E2LNet: An Efficient and Effective Lightweight Network for Panoramic Depth Estimation
    Xu, Jiayue
    Zhao, Jianping
    Li, Hua
    Han, Cheng
    Xu, Chao
    SENSORS, 2023, 23 (22)