Decoupling and Interaction: task coordination in single-stage object detection

被引:0
|
作者
Ma J.-W. [1 ]
Tian S. [1 ]
Man H. [2 ]
Chen S.-L. [1 ]
Qin J. [1 ]
Yin X.-C. [1 ]
机构
[1] Department of Computer Science and Technology, University of Science and Technology Beijing, No. 30 Xueyuan Road, Haidian District, Beijing
[2] School of Foreign Studies, University of Science and Technology Beijing, No. 30 Xueyuan Road, Haidian District, Beijing
关键词
Feature decoupling; Information interaction; Object detection; Task coordination;
D O I
10.1007/s11042-024-19257-x
中图分类号
学科分类号
摘要
In the field of computer vision, general single-stage object detection methods employ two individual subnets within detection head, serving classification and localization purposes respectively. However, the lack of explicit modeling for distinctions and associations poses challenges for aligning the spatial feature perception of these two tasks, consequently leading to sub-optimal detection performance. Although some methods utilize classification to evaluate localization, it is a compromise rather than multi-task optimization. In this paper, we propose a Task-coordinated Single-stage Object Detector (TSOD) to enhance the coordination of multiple tasks. Firstly, we introduce a Task-decoupled Feature Alignment Mechanism (TFAM), which adaptively provides compatible features for different tasks by decoupling spatial information. For classification and localization, the network adaptively samples from category-sensitive regions and boundary-separable regions, respectively. Secondly, we propose a Task-interactive Enhancement Mechanism (TEM), which explicitly combines different task-sensitive features for joint classification score prediction and selects samples with high task consistency for training. Through this interaction mechanism, consistency between tasks is bolstered. We conduct extensive experiments on the COCO, Cityscapes, CrowdHuman and WiderFace datasets to evaluate the performance of TSOD. The results demonstrate that our model outperforms several state-of-the-art detectors, achieving a 2.0 AP improvement over the baseline on COCO minival and a remarkable 50.4 AP at single-model single-scale testing on COCO test-dev. Additionally, our model, equipped with ResNet-50, performs significantly better than other representative detectors on the Cityscapes, CrowdHuman, and WiderFace datasets, showcasing its robustness and generalizability. Our study contributes a new perspective to the design of single-stage object detectors by emphasizing the importance of decoupling and interaction, which is crucial for task coordination. The experimental results validate the effectiveness of our proposed TSOD and its potential as a leading approach in the field. Codes are available at https://github.com/Majiawei/tsod-complete. © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2024.
引用
收藏
页码:8149 / 8178
页数:29
相关论文
共 50 条
  • [31] A Rich Feature Fusion Single-Stage Object Detector
    Zhang, Kai
    Musha, Yasenjiang
    Si, Binglong
    IEEE ACCESS, 2020, 8 : 204352 - 204359
  • [32] Refined single-stage object detection deep-learning technique for chilli leaf disease detection
    Naik, Bhookya Nageswararao
    Ramanathan, Malmathanraj
    Ponnusamy, Palanisamy
    JOURNAL OF ELECTRONIC IMAGING, 2023, 32 (03)
  • [33] Mutton Multipartite Real-time Classification and Detection Based on Single-stage Object Detection Algorithm
    Zhao S.
    Wang S.
    Hao G.
    Zhang Y.
    Yang H.
    Nongye Jixie Xuebao/Transactions of the Chinese Society for Agricultural Machinery, 2022, 53 (03): : 400 - 411
  • [34] A Single-Stage Multimode Extended Interaction Circuit
    Chang, Zhiwei
    Shang, Wenli
    Cao, Zhong
    Shu, Guoxiang
    Tian, Yanyan
    He, Wenlong
    IEEE TRANSACTIONS ON PLASMA SCIENCE, 2024, 52 (07) : 2686 - 2691
  • [35] FDNet: Feature decoupling for single-stage pose estimation in complex scenes
    Wang, Qianqian
    Liu, Qiong
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 98
  • [36] Identification and location of infrared image for substation equipment based on single-stage object detection algorithm
    Zhu H.
    Niu Z.
    Huang K.
    Tang W.
    Dianli Zidonghua Shebei/Electric Power Automation Equipment, 2021, 41 (08): : 217 - 224
  • [37] A Single-Stage 3D Object Detection Method Based on Sparse Attention Mechanism
    Jia, Songche
    Zhang, Zhenyu
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT III, 2024, 14427 : 414 - 425
  • [38] Rethinking IoU-based Optimization for Single-stage 3D Object Detection
    Sheng, Hualian
    Cai, Sijia
    Zhao, Na
    Deng, Bing
    Huang, Jianqiang
    Hua, Xian-Sheng
    Zhao, Min-Jian
    Lee, Gim Hee
    COMPUTER VISION, ECCV 2022, PT IX, 2022, 13669 : 544 - 561
  • [39] Semi-supervised object detection based on single-stage detector for thighbone fracture localization
    Wei, Jinman
    Yao, Jinkun
    Zhang, Guoshan
    Guan, Bin
    Zhang, Yueming
    Wang, Shaoquan
    NEURAL COMPUTING & APPLICATIONS, 2024, 36 (07): : 3447 - 3461
  • [40] Semi-supervised object detection based on single-stage detector for thighbone fracture localization
    Jinman Wei
    Jinkun Yao
    Guoshan Zhang
    Bin Guan
    Yueming Zhang
    Shaoquan Wang
    Neural Computing and Applications, 2024, 36 : 3447 - 3461