Open-World Dynamic Prompt and Continual Visual Representation Learning

被引:0
作者
Kim, Youngeun [1 ]
Fang, Jun [2 ]
Zhang, Qin [2 ]
Cai, Zhaowei [3 ]
Shen, Yantao [2 ]
Duggal, Rahul [2 ]
Raychaudhuri, Dripta S. [2 ]
Tut, Zhuowen [2 ]
Xing, Yifan [2 ]
Dabeer, Onkar [2 ]
机构
[1] Yale Univ, New Haven, CT USA
[2] AWS AI Labs, Seattle, WA 98109 USA
[3] Amazon AGI, Seattle, WA USA
来源
COMPUTER VISION - ECCV 2024, PT XLIX | 2025年 / 15107卷
关键词
Dynamic Prompt Generation; Continual Learning; Open-World Visual Representation Learning;
D O I
10.1007/978-3-031-72967-6_20
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The open world is inherently dynamic, characterized by ever-evolving concepts and distributions. Continual learning (CL) in this dynamic open-world environment presents a significant challenge in effectively generalizing to unseen test-time classes. To address this challenge, we introduce a new practical CL setting tailored for open-world visual representation learning. In this setting, subsequent data streams systematically introduce novel classes that are disjoint from those seen in previous training phases, while also remaining distinct from the unseen test classes. In response, we present Dynamic Prompt and Representation Learner (DPaRL), a simple yet effective Prompt-based CL (PCL) method. Our DPaRL learns to generate dynamic prompts for inference, as opposed to relying on a static prompt pool in previous PCL methods. In addition, DPaRL jointly learns dynamic prompt generation and discriminative representation at each training stage whereas prior PCL methods only refine the prompt learning throughout the process. Our experimental results demonstrate the superiority of our approach, surpassing state-of-the-art methods on well-established open-world image retrieval benchmarks by an average of 4.7% improvement in Recall@1 performance.
引用
收藏
页码:357 / 374
页数:18
相关论文
共 45 条
  • [1] Rusu AA, 2016, Arxiv, DOI arXiv:1606.04671
  • [2] Alexey D, 2020, arXiv, DOI [arXiv:2010.11929, DOI 10.48550/ARXIV.2010.11929]
  • [3] Memory Aware Synapses: Learning What (not) to Forget
    Aljundi, Rahaf
    Babiloni, Francesca
    Elhoseiny, Mohamed
    Rohrbach, Marcus
    Tuytelaars, Tinne
    [J]. COMPUTER VISION - ECCV 2018, PT III, 2018, 11207 : 144 - 161
  • [4] An X., 2023, ICLR, DOI arXiv:2304.05884
  • [5] Ba J.L., 2016, arXiv
  • [6] Buzzega P., 2020, Adv Neural Inf Process Syst, V33, P15920
  • [7] PSS: Progressive Sample Selection for Open-World Visual Representation Learning
    Cao, Tianyue
    Wang, Yongxin
    Xing, Yifan
    Xiao, Tianjun
    He, Tong
    Zhang, Zheng
    Zhou, Hao
    Tighe, Joseph
    [J]. COMPUTER VISION, ECCV 2022, PT XXXI, 2022, 13691 : 278 - 294
  • [8] Chaudhry A., 2019, arXiv
  • [9] Chaudhry A, 2019, Arxiv, DOI arXiv:1812.00420
  • [10] Chen SF, 2022, Arxiv, DOI arXiv:2205.13535