Task-aware Disentanglement for Object Detection

被引：0

作者：

Yin, Jun ^{[1
]}

Wang, Keyang ^{[2
]}

Wu, Fei ^{[1
]}

Shao, Ming ^{[2
]}

机构：

[1] Zhejiang Univ, Hangzhou, Peoples R China

[2] Zhejiang Dahua Technol Co Ltd, Hangzhou, Peoples R China

来源：

2024 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN 2024 | 2024年

关键词：

feature Disentanglement; object detection; task-aware sampling; task-aware activation;

D O I：

10.1109/IJCNN60899.2024.10650168

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Sibling-head structure is widely used to alleviate the feature conflict between classification and regression tasks in most object detectors. However, as the two branches of the sibling head are trained with exactly the same positive samples and lack explicit feature disentanglement in the forward propagation, the classification-sensitive features and localization-sensitive features are still somewhat coupled. As a result, the feature conflict between the two tasks still remains, which seriously hurts the performance of the classifier and regressor in the testing phase. In this paper, we propose a Task-Aware Disentangled object Detector (TDD) that explicitly disentangles the classification and regression from the aspect of feature disentanglement and sampling strategy. In terms of feature disentanglement, we design a task-aware activation head driven by a reconstruction-activation mechanism to explicitly activate corresponding sensitive features for classification and localization in the forward propagation. Furthermore, we explore a novel task-aware sampling strategy that explicitly assigns the task-adaptive samples for classification and regression tasks according to their quality distributions. Extensive experiments on MS COCO show that our TDD consistently surpasses the baseline by similar to 2.0 AP with different backbones. Moreover, our best model achieves 55.1 AP, outperforming most state-of-the-art detectors.

引用

页数：8

共 37 条

[1]

[Anonymous], 2020, EUR C COMP VIS, DOI DOI 10.1109/SII46433.2020.9026272

[2]

Chen K., 2019, ARXIV

[3] Hybrid Task Cascade for Instance Segmentation [J].

Chen, Kai ;

Pang, Jiangmiao ;

Wang, Jiaqi ;

Xiong, Yu ;

Li, Xiaoxiao ;

Sun, Shuyang ;

Feng, Wansen ;

Liu, Ziwei ;

Shi, Jianping ;

Ouyang, Wanli ;

Loy, Chen Change ;

Lin, Dahua .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :4969-4978

[4] CenterNet: Keypoint Triplets for Object Detection [J].

Duan, Kaiwen ;

Bai, Song ;

Xie, Lingxi ;

Qi, Honggang ;

Huang, Qingming ;

Tian, Qi .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :6568-6577

[5] TOOD: Task-aligned One-stage Object Detection [J].

Feng, Chengjian ;

Zhong, Yujie ;

Gao, Yu ;

Scott, Matthew R. ;

Huang, Weilin .

2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :3490-3499

[6] Mutual Supervision for Dense Object Detection [J].

Gao, Ziteng ;

Wang, Limin ;

Wu, Gangshan .

2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :3621-3630

[7] OTA: Optimal Transport Assignment for Object Detection [J].

Ge, Zheng ;

Liu, Songtao ;

Liu, Zeming ;

Yoshie, Osamu ;

Sun, Jian .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :303-312

[8]

He KM, 2020, IEEE T PATTERN ANAL, V42, P386, DOI [10.1109/TPAMI.2018.2844175, 10.1109/ICCV.2017.322]

[9] Deep Residual Learning for Image Recognition [J].

He, Kaiming ;

Zhang, Xiangyu ;

Ren, Shaoqing ;

Sun, Jian .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778

[10] Deep Spatial Feature Reconstruction for Partial Person Re-identification: Alignment-free Approach [J].

He, Lingxiao ;

Liang, Jian ;

Li, Haiqing ;

Sun, Zhenan .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :7073-7082

← 1 2 3 4 →