EOOD: End-to-end oriented object detection

被引:0
作者
Zhang, Caiguang [1 ,2 ]
Chen, Zilong [1 ]
Xiong, Boli [1 ]
Ji, Kefeng [1 ]
Kuang, Gangyao
机构
[1] Natl Univ Def Technol, State Key Lab Complex Electromagnet Environm Effec, Changsha 410073, Peoples R China
[2] Shanghai Radio Equipment Res Inst, Shanghai 201100, Peoples R China
基金
中国国家自然科学基金;
关键词
Oriented object detection; Label assignment; End-to-end; Non-maximum suppression;
D O I
10.1016/j.neucom.2024.129251
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, significant advancements have been made in oriented object detectors based on convolutional networks. However, these models often rely on hand-designed post-processing techniques such as non- maximum suppression (NMS) to suppress redundant predictions, which impedes establishing an end-to-end detection system. In this paper, we explore how to build an end-to-end oriented detector? Firstly, our research demonstrates that prediction-oriented one-to-one label assignment (POLA) can significantly reduce the performance gap between using and not using NMS and is an essential component of end-to-end detection. Additionally, the proposed Negative Sample Reweighted Focal Loss ( N'FL ) can widen the classification confidence gap between positive and negative samples, separate positive samples from noise, and guarantee high classification scores for the single positive sample. Finally, in order to address the lack of supervision caused by one-to-one label assignment, a joint training pipeline is designed that unites multiple auxiliary heads and takes advantage of one-to-many label assignment on supervision to improve feature representation and increase performance. During inference, Only the main head trained with one-to-one label assignment is involved in prediction. Extensive experiments on publicly available datasets DOTA and DIOR-R demonstrate that the proposed EOOD exhibits significant performance improvements over baseline models and has the potential to overcome the limitations of NMS. Code is available at https://github.com/zhangiguang/EOOD.
引用
收藏
页数:15
相关论文
共 70 条
[1]   End-to-End Object Detection with Transformers [J].
Carion, Nicolas ;
Massa, Francisco ;
Synnaeve, Gabriel ;
Usunier, Nicolas ;
Kirillov, Alexander ;
Zagoruyko, Sergey .
COMPUTER VISION - ECCV 2020, PT I, 2020, 12346 :213-229
[2]  
Chen Q, 2023, Arxiv, DOI arXiv:2207.13085
[3]  
Cheng G, 2022, Arxiv, DOI arXiv:2110.01931
[4]   Dual-Aligned Oriented Detector [J].
Cheng, Gong ;
Yao, Yanqing ;
Li, Shengyang ;
Li, Ke ;
Xie, Xingxing ;
Wang, Jiabao ;
Yao, Xiwen ;
Han, Junwei .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[5]   AO2-DETR: Arbitrary-Oriented Object Detection Transformer [J].
Dai, Linhui ;
Liu, Hong ;
Tang, Hao ;
Wu, Zhiwei ;
Song, Pinhao .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (05) :2342-2356
[6]   Learning RoI Transformer for Oriented Object Detection in Aerial Images [J].
Ding, Jian ;
Xue, Nan ;
Long, Yang ;
Xia, Gui-Song ;
Lu, Qikai .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :2844-2853
[7]   The Pascal Visual Object Classes (VOC) Challenge [J].
Everingham, Mark ;
Van Gool, Luc ;
Williams, Christopher K. I. ;
Winn, John ;
Zisserman, Andrew .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2010, 88 (02) :303-338
[8]   Convex-Hull Feature Adaptation for Oriented and Densely Packed Object Detection [J].
Guo, Zonghao ;
Zhang, Xiaosong ;
Liu, Chang ;
Ji, Xiangyang ;
Jiao, Jianbin ;
Ye, Qixiang .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (08) :5252-5265
[9]   Align Deep Features for Oriented Object Detection [J].
Han, Jiaming ;
Ding, Jian ;
Li, Jie ;
Xia, Gui-Song .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[10]   ReDet: A Rotation-equivariant Detector for Aerial Object Detection [J].
Han, Jiaming ;
Ding, Jian ;
Xue, Nan ;
Xia, Gui-Song .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :2785-2794