Black-Box Prompt Tuning With Subspace Learning

被引:0
|
作者
Zheng, Yuanhang [1 ]
Tan, Zhixing [2 ]
Li, Peng [3 ,4 ]
Liu, Yang [1 ,3 ,4 ,5 ]
机构
[1] Tsinghua Univ, Dept Comp Sci & Technol, Beijing 100084, Peoples R China
[2] Zhongguancun Lab, Beijing 100086, Peoples R China
[3] Tsinghua Univ, Inst AI Ind Res AIR, Beijing 100084, Peoples R China
[4] Shanghai Artificial Intelligence Lab, Shanghai 200030, Peoples R China
[5] Jiangsu Collaborat Innovat Ctr Language Competence, Xuzhou 221116, Jiangsu, Peoples R China
基金
国家重点研发计划;
关键词
Task analysis; Tuning; Closed box; Speech processing; Metalearning; Sun; Optimization; Black-box; large language models (LLMs); meta-learning; prompt tuning; subspace learning; ADAPTATION;
D O I
10.1109/TASLP.2024.3407519
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Black-box prompt tuning employs derivative-free optimization algorithms to learn prompts within low-dimensional subspaces rather than back-propagating through the network of Large Language Models (LLMs). Recent studies reveal that black-box prompt tuning lacks versatility across tasks and LLMs, which we believe is related to the suboptimal choice of subspaces. In this paper, we introduce Black-box prompt tuning with Subspace Learning (BSL) to enhance the versatility of black-box prompt tuning. Based on the assumption that nearly optimal prompts for similar tasks reside in a common subspace, we propose identifying such subspaces through meta-learning on a collection of similar source tasks. Consequently, for a target task that shares similarities with the source tasks, we expect that optimizing within the identified subspace can yield a prompt that performs well on the target task. Experimental results confirm that our BSL framework consistently achieves competitive performance across various downstream tasks and LLMs.
引用
收藏
页码:3002 / 3013
页数:12
相关论文
共 50 条
  • [31] Revizor: Testing Black-Box CPUs Against Speculation Contracts
    Oleksenko, Oleksii
    Fetzer, Christof
    Kopf, Boris
    Silberstein, Mark
    IEEE MICRO, 2023, 43 (04) : 37 - 44
  • [32] Effective Sampling, Modeling and Optimization of Constrained Black-box Problems
    Bajaj, Ishan
    Hasan, M. M. Faruque
    26TH EUROPEAN SYMPOSIUM ON COMPUTER AIDED PROCESS ENGINEERING (ESCAPE), PT A, 2016, 38A : 553 - 558
  • [33] A kriging method for the solution of nonlinear programs with black-box functions
    Davis, Eddie
    Ierapetritou, Marianthi
    AICHE JOURNAL, 2007, 53 (08) : 2001 - 2012
  • [34] DISTRIBUTED BLACK-BOX OPTIMIZATION OF NONCONVEX FUNCTIONS
    Valcarcel Macua, Sergio
    Zazo, Santiago
    Zazo, Javier
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 3591 - 3595
  • [35] ARTHROSCOPY TRAINING USING A BLACK-BOX TECHNIQUE
    MEYER, RD
    TAMARAPALLI, JR
    LEMONS, JE
    ARTHROSCOPY, 1993, 9 (03): : 338 - 340
  • [36] Black-box model adaptation for semantic segmentation
    Zhou, Zhiheng
    Yue, Wanlin
    Cao, Yinglie
    Shen, Shifu
    IMAGE AND VISION COMPUTING, 2024, 150
  • [37] AutoTinyML for microcontrollers: Dealing with black-box deployability
    Perego, Riccardo
    Candelieri, Antonio
    Archetti, Francesco
    Pau, Danilo
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 207
  • [38] Toward Visual Distortion in Black-Box Attacks
    Li, Nannan
    Chen, Zhenzhong
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 6156 - 6167
  • [39] A Black-Box Approach to Latency and Throughput Analysis
    Brahneborg, Daniel
    Afzal, Wasif
    Causevic, Adnan
    2017 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY AND SECURITY COMPANION (QRS-C), 2017, : 603 - 604
  • [40] Black-box Adaptation of ASR for Accented Speech
    Khandelwal, Kartik
    Jyothi, Preethi
    Awasthi, Abhijeet
    Sarawagi, Sunita
    INTERSPEECH 2020, 2020, : 1281 - 1285