A Refined Hybrid Network for Object Detection in Aerial Images

被引:2
作者
Yu, Ying [1 ]
Yang, Xi [2 ]
Li, Jie [1 ]
Gao, Xinbo [1 ,3 ]
机构
[1] Xidian Univ, Sch Elect Engn, Xian 710071, Peoples R China
[2] Xidian Univ, Sch Telecommun Engn, State Key Lab Integrated Serv Networks, Xian 710071, Peoples R China
[3] Chongqing Univ Posts & Telecommun, Chongqing Key Lab Image Cognit, Chongqing 400065, Peoples R China
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2023年 / 61卷
基金
中国国家自然科学基金;
关键词
Detectors; Object detection; Feature extraction; Training; Proposals; Task analysis; Transformers; Adaptive feature fusion (AFF); aerial images; dynamic query generation (DQG); hybrid network; mixed query sampling (MQS);
D O I
10.1109/TGRS.2023.3316833
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Aerial object detection is a challenging task that needs to detect objects with large variations in scale and orientation. Previous dense object detectors rely on heuristic nonmaximum suppression (NMS) to filter out redundant detections. This may reduce the recall rate for objects with arbitrary orientations and large aspect ratios. Recently proposed sparse object detectors treat object detection as a set prediction task, effectively eliminating the need for hand-crafted components. However, applying this paradigm directly to aerial images achieves inferior performance. In this article, we develop an effective refined hybrid network (RHNet) for object detection in aerial images. Our method combines the advantages of both dense and sparse detectors, achieving outstanding performance for aerial objects with large variations. Specifically, considering the highly diverse orientations of objects, we first apply a dynamic query generation (DQG) module to produce high-quality oriented queries. These queries can effectively locate the foreground objects in an image, ensuring a high recall rate. Then, the object queries are sent to a query decoder for further refinement. This refinement stage adopts one-to-one matching to eliminate the negative impact caused by NMS. Moreover, an adaptive feature fusion (AFF) module is designed to learn stronger modeling capabilities for rotated objects at different scales. In addition, we propose a practical mixed query sampling (MQS) strategy that uses many-to-one assignment as an auxiliary scheme to help detector training. Extensive experiments conducted on several aerial datasets demonstrate the superior performance of the proposed method in comparison to other state-of-the-art approaches.
引用
收藏
页数:15
相关论文
共 58 条
[1]   Towards Multi-class Object Detection in Unconstrained Remote Sensing Imagery [J].
Azimi, Seyed Majid ;
Vig, Eleonora ;
Bahmanyar, Reza ;
Koerner, Marco ;
Reinartz, Peter .
COMPUTER VISION - ACCV 2018, PT III, 2019, 11363 :150-165
[2]   End-to-End Object Detection with Transformers [J].
Carion, Nicolas ;
Massa, Francisco ;
Synnaeve, Gabriel ;
Usunier, Nicolas ;
Kirillov, Alexander ;
Zagoruyko, Sergey .
COMPUTER VISION - ECCV 2020, PT I, 2020, 12346 :213-229
[3]  
Chen Q, 2023, Arxiv, DOI arXiv:2207.13085
[4]   PIoU Loss: Towards Accurate Oriented Object Detection in Complex Environments [J].
Chen, Zhiming ;
Chen, Kean ;
Lin, Weiyao ;
See, John ;
Yu, Hui ;
Ke, Yan ;
Yang, Cong .
COMPUTER VISION - ECCV 2020, PT V, 2020, 12350 :195-211
[5]  
Dai LH, 2022, Arxiv, DOI arXiv:2205.12785
[6]   Learning RoI Transformer for Oriented Object Detection in Aerial Images [J].
Ding, Jian ;
Xue, Nan ;
Long, Yang ;
Xia, Gui-Song ;
Lu, Qikai .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :2844-2853
[7]  
Everingham M., 2007, P IEEE INT C COMP VI
[8]   OTA: Optimal Transport Assignment for Object Detection [J].
Ge, Zheng ;
Liu, Songtao ;
Liu, Zeming ;
Yoshie, Osamu ;
Sun, Jian .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :303-312
[9]   Align Deep Features for Oriented Object Detection [J].
Han, Jiaming ;
Ding, Jian ;
Li, Jie ;
Xia, Gui-Song .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[10]   ReDet: A Rotation-equivariant Detector for Aerial Object Detection [J].
Han, Jiaming ;
Ding, Jian ;
Xue, Nan ;
Xia, Gui-Song .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :2785-2794