Learning Task-Aware Language-Image Representation for Class-Incremental Object Detection

被引：0

作者：

Zhang, Hongquan ^{[1
,2
,3
]}

Gao, Bin-Bin ^{[2
]}

Zeng, Yi ^{[2
]}

Tian, Xudong ^{[1
,3
]}

Tan, Xin ^{[1
,3
]}

Zhang, Zhizhong ^{[1
,3
]}

Qu, Yanyun ^{[4
]}

Liu, Jun ^{[2
]}

Xie, Yuan ^{[1
,3
]}

机构：

[1] East China Normal Univ, Shanghai, Peoples R China

[2] Tencent YouTu Lab, Shenzhen, Guangdong, Peoples R China

[3] East China Normal Univ, Chongqing Inst, Shanghai, Peoples R China

[4] Xiamen Univ, Xiamen, Peoples R China

来源：

THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 7 | 2024年

基金：

上海市自然科学基金; 中国国家自然科学基金;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Class-incremental object detection (CIOD) is a real-world desired capability, requiring an object detector to continuously adapt to new tasks without forgetting learned ones, with the main challenge being catastrophic forgetting. Many methods based on distillation and replay have been proposed to alleviate this problem. However, they typically learn on a pure visual backbone, neglecting the powerful representation capabilities of textual cues, which to some extent limits their performance. In this paper, we propose task-aware language-image representation to mitigate catastrophic forgetting, introducing a new paradigm for language-image-based CIOD. First of all, we demonstrate the significant advantage of language-image detectors in mitigating catastrophic forgetting. Secondly, we propose a learning task-aware language-image representation method that overcomes the existing drawback of directly utilizing the language-image detector for CIOD. More specifically, we learn the language-image representation of different tasks through an insulating approach in the training stage, while using the alignment scores produced by task-specific language-image representation in the inference stage. Through our proposed method, language-image detectors can be more practical for CIOD. We conduct extensive experiments on COCO 2017 and Pascal VOC 2007 and demonstrate that the proposed method achieves state-of-the-art results under the various CIOD settings.

引用

页码：7096 / 7104

页数：9

共 50 条

[21] Task-Aware Monocular Depth Estimation for 3D Object Detection
Wang, Xinlong
Yin, Wei
Kong, Tao
Jiang, Yuning
Li, Lei
Shen, Chunhua
THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 12257 - 12264
[22] Overcomplete-to-sparse representation learning for few-shot class-incremental learning
Fu, Mengying
Liu, Binghao
Ma, Tianren
Ye, Qixiang
MULTIMEDIA SYSTEMS, 2024, 30 (02)
[23] Context-aware feature reconstruction for class-incremental anomaly detection and localization
Pang, Jingxuan
Li, Chunguang
NEURAL NETWORKS, 2025, 181
[24] Overcomplete-to-sparse representation learning for few-shot class-incremental learning
Fu Mengying
Liu Binghao
Ma Tianren
Ye Qixiang
Multimedia Systems, 2024, 30
[25] GENERALIZABLE TWO-BRANCH FRAMEWORK FOR IMAGE CLASS-INCREMENTAL LEARNING
Wu, Chao
Chang, Xiaobin
Wang, Ruixuan
2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 4265 - 4269
[26] Knowledge Representation by Generic Models for Few-Shot Class-Incremental Learning
Chen, Xiaodong
Jiang, Weijie
Huang, Zhiyong
Su, Jiangwen
Yu, Yuanlong
ADVANCES IN NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, ICNC-FSKD 2022, 2023, 153 : 1237 - 1247
[27] CLOSER: Towards Better Representation Learning for Few-Shot Class-Incremental Learning
Oh, Junghun
Baik, Sungyong
Lee, Kyoung Mu
COMPUTER VISION - ECCV 2024, PT XLIX, 2025, 15107 : 18 - 35
[28] Hyperspectral Image Classification Based on Class-Incremental Learning with Knowledge Distillation
Xu, Meng
Zhao, Yuanyuan
Liang, Yajun
Ma, Xiaorui
REMOTE SENSING, 2022, 14 (11)
[29] Few-Shot Class-Incremental Learning via Class-Aware Bilateral Distillation
Zhao, Linglan
Lu, Jing
Xu, Yunlu
Cheng, Zhanzhan
Guo, Dashan
Niu, Yi
Fang, Xiangzhong
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 11838 - 11847
[30] COOLer: Class-Incremental Learning for Appearance-Based Multiple Object Tracking
Liu, Zhizheng
Segu, Mattia
Yu, Fisher
PATTERN RECOGNITION, DAGM GCPR 2023, 2024, 14264 : 443 - 458

← 1 2 3 4 5 →