Open-World Dynamic Prompt and Continual Visual Representation Learning

被引：0

作者：

Kim, Youngeun ^{[1
]}

Fang, Jun ^{[2
]}

Zhang, Qin ^{[2
]}

Cai, Zhaowei ^{[3
]}

Shen, Yantao ^{[2
]}

Duggal, Rahul ^{[2
]}

Raychaudhuri, Dripta S. ^{[2
]}

Tut, Zhuowen ^{[2
]}

Xing, Yifan ^{[2
]}

Dabeer, Onkar ^{[2
]}

机构：

[1] Yale Univ, New Haven, CT USA

[2] AWS AI Labs, Seattle, WA 98109 USA

[3] Amazon AGI, Seattle, WA USA

来源：

COMPUTER VISION - ECCV 2024, PT XLIX | 2025年 / 15107卷

关键词：

Dynamic Prompt Generation; Continual Learning; Open-World Visual Representation Learning;

D O I：

10.1007/978-3-031-72967-6_20

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The open world is inherently dynamic, characterized by ever-evolving concepts and distributions. Continual learning (CL) in this dynamic open-world environment presents a significant challenge in effectively generalizing to unseen test-time classes. To address this challenge, we introduce a new practical CL setting tailored for open-world visual representation learning. In this setting, subsequent data streams systematically introduce novel classes that are disjoint from those seen in previous training phases, while also remaining distinct from the unseen test classes. In response, we present Dynamic Prompt and Representation Learner (DPaRL), a simple yet effective Prompt-based CL (PCL) method. Our DPaRL learns to generate dynamic prompts for inference, as opposed to relying on a static prompt pool in previous PCL methods. In addition, DPaRL jointly learns dynamic prompt generation and discriminative representation at each training stage whereas prior PCL methods only refine the prompt learning throughout the process. Our experimental results demonstrate the superiority of our approach, surpassing state-of-the-art methods on well-established open-world image retrieval benchmarks by an average of 4.7% improvement in Recall@1 performance.

引用

页码：357 / 374

页数：18

共 45 条

[1] Rusu AA, 2016, Arxiv, DOI arXiv:1606.04671
[2] Alexey D, 2020, arXiv, DOI [arXiv:2010.11929, DOI 10.48550/ARXIV.2010.11929]
[3] Memory Aware Synapses: Learning What (not) to Forget
Aljundi, Rahaf
Babiloni, Francesca
Elhoseiny, Mohamed
Rohrbach, Marcus
Tuytelaars, Tinne
[J]. COMPUTER VISION - ECCV 2018, PT III, 2018, 11207 : 144 - 161
[4] An X., 2023, ICLR, DOI arXiv:2304.05884
[5] Ba J.L., 2016, arXiv
[6] Buzzega P., 2020, Adv Neural Inf Process Syst, V33, P15920
[7] PSS: Progressive Sample Selection for Open-World Visual Representation Learning
Cao, Tianyue
Wang, Yongxin
Xing, Yifan
Xiao, Tianjun
He, Tong
Zhang, Zheng
Zhou, Hao
Tighe, Joseph
[J]. COMPUTER VISION, ECCV 2022, PT XXXI, 2022, 13691 : 278 - 294
[8] Chaudhry A., 2019, arXiv
[9] Chaudhry A, 2019, Arxiv, DOI arXiv:1812.00420
[10] Chen SF, 2022, Arxiv, DOI arXiv:2205.13535

← 1 2 3 4 5 →