A dynamic label assignment strategy for one-stage detectors

被引：2

作者：

Zhang, Yi ^{[1
]}

Luo, Chen ^{[1
]}

机构：

[1] Sichuan Univ, Dept Comp Sci, Chengdu, Peoples R China

来源：

NEUROCOMPUTING | 2024年 / 577卷

关键词：

Object detection; Label assignment; Classification; Localization; NETWORK;

D O I：

10.1016/j.neucom.2024.127383

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In object detection field, label assignment (LA) is an important step in determining the detection accuracy, which assigns positive and negative labels for the training samples, so that the prediction loss could be calculated. Therefore, how to realize a more reasonable LA has always been a major concern for computer vision experts. Considering the difficulty of current LA strategy in adapting to different scenarios and the lack of interaction between the classification and localization tasks. We propose a novel dynamic LA scheme for one -stage object detector. Firstly, the qualities of the anchor boxes are computed based on the outputs of both classification and localization, which are used to assign the positive and negative samples and will also be adjusted during training. Secondly, the positive and negative samples for the classification and localization tasks are decoupled, and independent LA strategies are developed for each task. Finally, the interaction between the two network heads are enhanced through multiple shared convolution blocks so as to complete the two tasks in a more collaborative manner. Extensive experiments conducted on MS COCO, PASCAL VOC and CrowdHuman to support our design and analysis. With the newly introduced LA strategy, we improve the detection accuracy of existing one -stage detector to a new level.

引用

页数：11

共 65 条

[1] Inside-Outside Net: Detecting Objects in Context with Skip Pooling and Recurrent Neural Networks [J].

Bell, Sean ;

Zitnick, C. Lawrence ;

Bala, Kavita ;

Girshick, Ross .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :2874-2883

[2] Cascade R-CNN: Delving into High Quality Object Detection [J].

Cai, Zhaowei ;

Vasconcelos, Nuno .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :6154-6162

[3]

Chen K, 2019, Arxiv, DOI arXiv:1906.07155

[4]

Chen Yu, 2020, ADV NEURAL INFORM PR, V33

[5]

Dai JF, 2016, ADV NEUR IN, V29

[6] Dynamic Head: Unifying Object Detection Heads with Attentions [J].

Dai, Xiyang ;

Chen, Yinpeng ;

Xiao, Bin ;

Chen, Dongdong ;

Liu, Mengchen ;

Yuan, Lu ;

Zhang, Lei .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :7369-7378

[7] CenterNet: Keypoint Triplets for Object Detection [J].

Duan, Kaiwen ;

Bai, Song ;

Xie, Lingxi ;

Qi, Honggang ;

Huang, Qingming ;

Tian, Qi .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :6568-6577

[8]

Everingham Mark, 2007, The pascal visual object classes challenge,(voc2007) results

[9] TOOD: Task-aligned One-stage Object Detection [J].

Feng, Chengjian ;

Zhong, Yujie ;

Gao, Yu ;

Scott, Matthew R. ;

Huang, Weilin .

2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :3490-3499

[10] OTA: Optimal Transport Assignment for Object Detection [J].

Ge, Zheng ;

Liu, Songtao ;

Liu, Zeming ;

Yoshie, Osamu ;

Sun, Jian .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :303-312

← 1 2 3 4 5 6 7 →