Learning Task-Aware Language-Image Representation for Class-Incremental Object Detection

被引:0
|
作者
Zhang, Hongquan [1 ,2 ,3 ]
Gao, Bin-Bin [2 ]
Zeng, Yi [2 ]
Tian, Xudong [1 ,3 ]
Tan, Xin [1 ,3 ]
Zhang, Zhizhong [1 ,3 ]
Qu, Yanyun [4 ]
Liu, Jun [2 ]
Xie, Yuan [1 ,3 ]
机构
[1] East China Normal Univ, Shanghai, Peoples R China
[2] Tencent YouTu Lab, Shenzhen, Guangdong, Peoples R China
[3] East China Normal Univ, Chongqing Inst, Shanghai, Peoples R China
[4] Xiamen Univ, Xiamen, Peoples R China
来源
THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 7 | 2024年
基金
上海市自然科学基金; 中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Class-incremental object detection (CIOD) is a real-world desired capability, requiring an object detector to continuously adapt to new tasks without forgetting learned ones, with the main challenge being catastrophic forgetting. Many methods based on distillation and replay have been proposed to alleviate this problem. However, they typically learn on a pure visual backbone, neglecting the powerful representation capabilities of textual cues, which to some extent limits their performance. In this paper, we propose task-aware language-image representation to mitigate catastrophic forgetting, introducing a new paradigm for language-image-based CIOD. First of all, we demonstrate the significant advantage of language-image detectors in mitigating catastrophic forgetting. Secondly, we propose a learning task-aware language-image representation method that overcomes the existing drawback of directly utilizing the language-image detector for CIOD. More specifically, we learn the language-image representation of different tasks through an insulating approach in the training stage, while using the alignment scores produced by task-specific language-image representation in the inference stage. Through our proposed method, language-image detectors can be more practical for CIOD. We conduct extensive experiments on COCO 2017 and Pascal VOC 2007 and demonstrate that the proposed method achieves state-of-the-art results under the various CIOD settings.
引用
收藏
页码:7096 / 7104
页数:9
相关论文
共 50 条
  • [21] Task-Aware Monocular Depth Estimation for 3D Object Detection
    Wang, Xinlong
    Yin, Wei
    Kong, Tao
    Jiang, Yuning
    Li, Lei
    Shen, Chunhua
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 12257 - 12264
  • [22] Overcomplete-to-sparse representation learning for few-shot class-incremental learning
    Fu, Mengying
    Liu, Binghao
    Ma, Tianren
    Ye, Qixiang
    MULTIMEDIA SYSTEMS, 2024, 30 (02)
  • [23] Context-aware feature reconstruction for class-incremental anomaly detection and localization
    Pang, Jingxuan
    Li, Chunguang
    NEURAL NETWORKS, 2025, 181
  • [24] Overcomplete-to-sparse representation learning for few-shot class-incremental learning
    Fu Mengying
    Liu Binghao
    Ma Tianren
    Ye Qixiang
    Multimedia Systems, 2024, 30
  • [25] GENERALIZABLE TWO-BRANCH FRAMEWORK FOR IMAGE CLASS-INCREMENTAL LEARNING
    Wu, Chao
    Chang, Xiaobin
    Wang, Ruixuan
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 4265 - 4269
  • [26] Knowledge Representation by Generic Models for Few-Shot Class-Incremental Learning
    Chen, Xiaodong
    Jiang, Weijie
    Huang, Zhiyong
    Su, Jiangwen
    Yu, Yuanlong
    ADVANCES IN NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, ICNC-FSKD 2022, 2023, 153 : 1237 - 1247
  • [27] CLOSER: Towards Better Representation Learning for Few-Shot Class-Incremental Learning
    Oh, Junghun
    Baik, Sungyong
    Lee, Kyoung Mu
    COMPUTER VISION - ECCV 2024, PT XLIX, 2025, 15107 : 18 - 35
  • [28] Hyperspectral Image Classification Based on Class-Incremental Learning with Knowledge Distillation
    Xu, Meng
    Zhao, Yuanyuan
    Liang, Yajun
    Ma, Xiaorui
    REMOTE SENSING, 2022, 14 (11)
  • [29] Few-Shot Class-Incremental Learning via Class-Aware Bilateral Distillation
    Zhao, Linglan
    Lu, Jing
    Xu, Yunlu
    Cheng, Zhanzhan
    Guo, Dashan
    Niu, Yi
    Fang, Xiangzhong
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 11838 - 11847
  • [30] COOLer: Class-Incremental Learning for Appearance-Based Multiple Object Tracking
    Liu, Zhizheng
    Segu, Mattia
    Yu, Fisher
    PATTERN RECOGNITION, DAGM GCPR 2023, 2024, 14264 : 443 - 458