Decoupling and Interaction: task coordination in single-stage object detection

被引:0
|
作者
Ma J.-W. [1 ]
Tian S. [1 ]
Man H. [2 ]
Chen S.-L. [1 ]
Qin J. [1 ]
Yin X.-C. [1 ]
机构
[1] Department of Computer Science and Technology, University of Science and Technology Beijing, No. 30 Xueyuan Road, Haidian District, Beijing
[2] School of Foreign Studies, University of Science and Technology Beijing, No. 30 Xueyuan Road, Haidian District, Beijing
关键词
Feature decoupling; Information interaction; Object detection; Task coordination;
D O I
10.1007/s11042-024-19257-x
中图分类号
学科分类号
摘要
In the field of computer vision, general single-stage object detection methods employ two individual subnets within detection head, serving classification and localization purposes respectively. However, the lack of explicit modeling for distinctions and associations poses challenges for aligning the spatial feature perception of these two tasks, consequently leading to sub-optimal detection performance. Although some methods utilize classification to evaluate localization, it is a compromise rather than multi-task optimization. In this paper, we propose a Task-coordinated Single-stage Object Detector (TSOD) to enhance the coordination of multiple tasks. Firstly, we introduce a Task-decoupled Feature Alignment Mechanism (TFAM), which adaptively provides compatible features for different tasks by decoupling spatial information. For classification and localization, the network adaptively samples from category-sensitive regions and boundary-separable regions, respectively. Secondly, we propose a Task-interactive Enhancement Mechanism (TEM), which explicitly combines different task-sensitive features for joint classification score prediction and selects samples with high task consistency for training. Through this interaction mechanism, consistency between tasks is bolstered. We conduct extensive experiments on the COCO, Cityscapes, CrowdHuman and WiderFace datasets to evaluate the performance of TSOD. The results demonstrate that our model outperforms several state-of-the-art detectors, achieving a 2.0 AP improvement over the baseline on COCO minival and a remarkable 50.4 AP at single-model single-scale testing on COCO test-dev. Additionally, our model, equipped with ResNet-50, performs significantly better than other representative detectors on the Cityscapes, CrowdHuman, and WiderFace datasets, showcasing its robustness and generalizability. Our study contributes a new perspective to the design of single-stage object detectors by emphasizing the importance of decoupling and interaction, which is crucial for task coordination. The experimental results validate the effectiveness of our proposed TSOD and its potential as a leading approach in the field. Codes are available at https://github.com/Majiawei/tsod-complete. © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2024.
引用
收藏
页码:8149 / 8178
页数:29
相关论文
共 50 条
  • [1] A Refinement Method for Single-Stage Object Detection Based on Progressive Decoupled Task Alignment
    Tang, Xianlun
    Yang, Qiao
    Zhang, Xi
    Deng, Wuquan
    Wang, Huiming
    Gao, Xinbo
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (05) : 3383 - 3394
  • [2] Single-stage Object Detection from Thermal Images
    Mostayed, Ahmed
    Zhou, Xuefu
    4TH INTERDISCIPLINARY CONFERENCE ON ELECTRICS AND COMPUTER, INTCEC 2024, 2024,
  • [3] HTD: Heterogeneous Task Decoupling for Two-Stage Object Detection
    Li, Wuyang
    Chen, Zhen
    Li, Baopu
    Zhang, Dingwen
    Yuan, Yixuan
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 9456 - 9469
  • [4] Single-Stage Related Object Detection for Intelligent Industrial Surveillance
    Zhang, Yang
    Bai, Hao
    Xu, Yuan
    He, Yanlin
    Zhu, Qunxiong
    Sheng, Hao
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2024, 20 (04) : 5539 - 5549
  • [5] Improving Single-Stage Object Detectors for Nighttime Pedestrian Detection
    Devi, S.
    Thopalli, Kowshik
    Malarvezhi, P.
    Thiagarajan, Jayaraman J.
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2022, 36 (09)
  • [6] LOSS RESCALING BY UNCERTAINTY INFERENCE FOR SINGLE-STAGE OBJECT DETECTION
    Li, Yan
    Chen, Xiaoyi
    Quan, Li
    Zhang, Ni
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 698 - 702
  • [7] TGADHead: An efficient and accurate task-guided attention-decoupled head for single-stage object detection
    Zuo, Fengyuan
    Liu, Jinhai
    Chen, Zhaolin
    Fu, Mingrui
    Wang, Lei
    KNOWLEDGE-BASED SYSTEMS, 2024, 302
  • [8] Single-stage rotated object detection network based on anchor transformation
    Qu Y.
    Li W.-H.
    Jilin Daxue Xuebao (Gongxueban)/Journal of Jilin University (Engineering and Technology Edition), 2022, 52 (01): : 162 - 173
  • [9] Single-Stage Object Detection with Fusion of Point Cloud and Image Feature
    Cai, Zhengyi
    Zhao, Jieyu
    Zhu, Feng
    Computer Engineering and Applications, 2023, 59 (09) : 140 - 149
  • [10] Single-stage object detection algorithm based on optimizing position prediction
    Zhang, Na
    Qi, Xu-Lei
    Bao, Xiao-An
    Wu, Biao
    Tu, Xiao-Mei
    Jin, Yu-Ting
    Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2022, 56 (04): : 783 - 794