A Refinement Method for Single-Stage Object Detection Based on Progressive Decoupled Task Alignment

被引:3
作者
Tang, Xianlun [1 ]
Yang, Qiao [2 ]
Zhang, Xi [3 ]
Deng, Wuquan [4 ]
Wang, Huiming [1 ]
Gao, Xinbo [1 ]
机构
[1] Chongqing Univ Posts & Telecommun, Chongqing Key Lab Complex Syst & Bion Control, Chongqing 400065, Peoples R China
[2] China Elect Technol Grp Corp, Res Inst 10, Chengdu 610036, Peoples R China
[3] Chongqing Coll Mobile Commun, Chongqing Key Lab Publ Big Data Secur Technol, Chongqing 401520, Peoples R China
[4] Chongqing Univ, Chongqing Emergency Med Ctr, Dept Endocrinol & Metab, Cent Hosp, Chongqing 400014, Peoples R China
关键词
Single-stage object detection; task alignment; feature conflicts; probabilistic mapping method; information interaction;
D O I
10.1109/TCSVT.2023.3323879
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The parallel branches with independent optimized classification and localization capabilities are widely used in single-stage object detection. Defects such as feature conflicts, low level of information interaction, and empirical sample allocation scheme lead to weak spatial consistency of the outputs from different branches. In this work, we propose a Progressive Decoupled Task Alignment (PDTA) that enhances the information interaction between tasks while reducing the degree of feature coupling, and adopts a strategy based on sample screening and learning to achieve task alignment. First, we design the Discrepant Feature Decoupling Module (DFDM) embedded with the novel Oriented Decoupling Convolution (ODC) for the coupled features of the shared input, and the features extracted by ODC are utilized for disentanglement through the feed-in scheme with differences. Second, the Probabilistic Mapping Interaction Head (PMI-Head) utilizes the probabilistic mapping method to enhance task-specific semantics by information interaction. Finally, the network's common attention to the content and position of the target is enhanced through the metric in the proposed Relevance-Guided Adaptive Task Alignment (RATA), in which an exponentially decaying manner is used to preserve the training samples that are more efficient for both tasks. During training, task-aligned learning is performed by Relevance-Guided Loss. Experiments on MS COCO and DIOR datasets demonstrate the effectiveness of our method, PDTA achieves better performance for object detection.
引用
收藏
页码:3383 / 3394
页数:12
相关论文
共 43 条
  • [1] Cascade R-CNN: Delving into High Quality Object Detection
    Cai, Zhaowei
    Vasconcelos, Nuno
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 6154 - 6162
  • [2] Chenchen Zhu, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12354), P91, DOI 10.1007/978-3-030-58545-7_6
  • [3] Cross-Scale Feature Fusion for Object Detection in Optical Remote Sensing Images
    Cheng, Gong
    Si, Yongjie
    Hong, Hailong
    Yao, Xiwen
    Guo, Lei
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2021, 18 (03) : 431 - 435
  • [4] Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
  • [5] CenterNet: Keypoint Triplets for Object Detection
    Duan, Kaiwen
    Bai, Song
    Xie, Lingxi
    Qi, Honggang
    Huang, Qingming
    Tian, Qi
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 6568 - 6577
  • [6] Exploring Classification Equilibrium in Long-Tailed Object Detection
    Feng, Chengjian
    Zhong, Yujie
    Huang, Weilin
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 3397 - 3406
  • [7] TOOD: Task-aligned One-stage Object Detection
    Feng, Chengjian
    Zhong, Yujie
    Gao, Yu
    Scott, Matthew R.
    Huang, Weilin
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 3490 - 3499
  • [8] Mutual Supervision for Dense Object Detection
    Gao, Ziteng
    Wang, Limin
    Wu, Gangshan
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 3621 - 3630
  • [9] OTA: Optimal Transport Assignment for Object Detection
    Ge, Zheng
    Liu, Songtao
    Liu, Zeming
    Yoshie, Osamu
    Sun, Jian
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 303 - 312
  • [10] Region-Based Convolutional Networks for Accurate Object Detection and Segmentation
    Girshick, Ross
    Donahue, Jeff
    Darrell, Trevor
    Malik, Jitendra
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2016, 38 (01) : 142 - 158