Black-Box Prompt Tuning With Subspace Learning

被引：0

作者：

Zheng, Yuanhang ^{[1
]}

Tan, Zhixing ^{[2
]}

Li, Peng ^{[3
,4
]}

Liu, Yang ^{[1
,3
,4
,5
]}

机构：

[1] Tsinghua Univ, Dept Comp Sci & Technol, Beijing 100084, Peoples R China

[2] Zhongguancun Lab, Beijing 100086, Peoples R China

[3] Tsinghua Univ, Inst AI Ind Res AIR, Beijing 100084, Peoples R China

[4] Shanghai Artificial Intelligence Lab, Shanghai 200030, Peoples R China

[5] Jiangsu Collaborat Innovat Ctr Language Competence, Xuzhou 221116, Jiangsu, Peoples R China

来源：

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2024年 / 32卷

基金：

国家重点研发计划;

关键词：

Task analysis; Tuning; Closed box; Speech processing; Metalearning; Sun; Optimization; Black-box; large language models (LLMs); meta-learning; prompt tuning; subspace learning; ADAPTATION;

D O I：

10.1109/TASLP.2024.3407519

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Black-box prompt tuning employs derivative-free optimization algorithms to learn prompts within low-dimensional subspaces rather than back-propagating through the network of Large Language Models (LLMs). Recent studies reveal that black-box prompt tuning lacks versatility across tasks and LLMs, which we believe is related to the suboptimal choice of subspaces. In this paper, we introduce Black-box prompt tuning with Subspace Learning (BSL) to enhance the versatility of black-box prompt tuning. Based on the assumption that nearly optimal prompts for similar tasks reside in a common subspace, we propose identifying such subspaces through meta-learning on a collection of similar source tasks. Consequently, for a target task that shares similarities with the source tasks, we expect that optimizing within the identified subspace can yield a prompt that performs well on the target task. Experimental results confirm that our BSL framework consistently achieves competitive performance across various downstream tasks and LLMs.

引用

页码：3002 / 3013

页数：12

共 50 条

[31] Revizor: Testing Black-Box CPUs Against Speculation Contracts
Oleksenko, Oleksii
Fetzer, Christof
Kopf, Boris
Silberstein, Mark
IEEE MICRO, 2023, 43 (04) : 37 - 44
[32] Effective Sampling, Modeling and Optimization of Constrained Black-box Problems
Bajaj, Ishan
Hasan, M. M. Faruque
26TH EUROPEAN SYMPOSIUM ON COMPUTER AIDED PROCESS ENGINEERING (ESCAPE), PT A, 2016, 38A : 553 - 558
[33] A kriging method for the solution of nonlinear programs with black-box functions
Davis, Eddie
Ierapetritou, Marianthi
AICHE JOURNAL, 2007, 53 (08) : 2001 - 2012
[34] DISTRIBUTED BLACK-BOX OPTIMIZATION OF NONCONVEX FUNCTIONS
Valcarcel Macua, Sergio
Zazo, Santiago
Zazo, Javier
2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 3591 - 3595
[35] ARTHROSCOPY TRAINING USING A BLACK-BOX TECHNIQUE
MEYER, RD
TAMARAPALLI, JR
LEMONS, JE
ARTHROSCOPY, 1993, 9 (03): : 338 - 340
[36] Black-box model adaptation for semantic segmentation
Zhou, Zhiheng
Yue, Wanlin
Cao, Yinglie
Shen, Shifu
IMAGE AND VISION COMPUTING, 2024, 150
[37] AutoTinyML for microcontrollers: Dealing with black-box deployability
Perego, Riccardo
Candelieri, Antonio
Archetti, Francesco
Pau, Danilo
EXPERT SYSTEMS WITH APPLICATIONS, 2022, 207
[38] Toward Visual Distortion in Black-Box Attacks
Li, Nannan
Chen, Zhenzhong
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 6156 - 6167
[39] A Black-Box Approach to Latency and Throughput Analysis
Brahneborg, Daniel
Afzal, Wasif
Causevic, Adnan
2017 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY AND SECURITY COMPANION (QRS-C), 2017, : 603 - 604
[40] Black-box Adaptation of ASR for Accented Speech
Khandelwal, Kartik
Jyothi, Preethi
Awasthi, Abhijeet
Sarawagi, Sunita
INTERSPEECH 2020, 2020, : 1281 - 1285

← 1 2 3 4 5 →