A Refined Hybrid Network for Object Detection in Aerial Images

被引:2
作者
Yu, Ying [1 ]
Yang, Xi [2 ]
Li, Jie [1 ]
Gao, Xinbo [1 ,3 ]
机构
[1] Xidian Univ, Sch Elect Engn, Xian 710071, Peoples R China
[2] Xidian Univ, Sch Telecommun Engn, State Key Lab Integrated Serv Networks, Xian 710071, Peoples R China
[3] Chongqing Univ Posts & Telecommun, Chongqing Key Lab Image Cognit, Chongqing 400065, Peoples R China
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2023年 / 61卷
基金
中国国家自然科学基金;
关键词
Detectors; Object detection; Feature extraction; Training; Proposals; Task analysis; Transformers; Adaptive feature fusion (AFF); aerial images; dynamic query generation (DQG); hybrid network; mixed query sampling (MQS);
D O I
10.1109/TGRS.2023.3316833
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Aerial object detection is a challenging task that needs to detect objects with large variations in scale and orientation. Previous dense object detectors rely on heuristic nonmaximum suppression (NMS) to filter out redundant detections. This may reduce the recall rate for objects with arbitrary orientations and large aspect ratios. Recently proposed sparse object detectors treat object detection as a set prediction task, effectively eliminating the need for hand-crafted components. However, applying this paradigm directly to aerial images achieves inferior performance. In this article, we develop an effective refined hybrid network (RHNet) for object detection in aerial images. Our method combines the advantages of both dense and sparse detectors, achieving outstanding performance for aerial objects with large variations. Specifically, considering the highly diverse orientations of objects, we first apply a dynamic query generation (DQG) module to produce high-quality oriented queries. These queries can effectively locate the foreground objects in an image, ensuring a high recall rate. Then, the object queries are sent to a query decoder for further refinement. This refinement stage adopts one-to-one matching to eliminate the negative impact caused by NMS. Moreover, an adaptive feature fusion (AFF) module is designed to learn stronger modeling capabilities for rotated objects at different scales. In addition, we propose a practical mixed query sampling (MQS) strategy that uses many-to-one assignment as an auxiliary scheme to help detector training. Extensive experiments conducted on several aerial datasets demonstrate the superior performance of the proposed method in comparison to other state-of-the-art approaches.
引用
收藏
页数:15
相关论文
共 58 条
[51]   CAD-Net: A Context-Aware Detection Network for Objects in Remote Sensing Imagery [J].
Zhang, Gongjie ;
Lu, Shijian ;
Zhang, Wei .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2019, 57 (12) :10015-10024
[52]  
Zhang H, 2022, Arxiv, DOI [arXiv:2203.03605, 10.48550/arXiv.2203.03605, DOI 10.48550/ARXIV.2203.03605]
[53]  
Zhang S., 2022, arXiv
[54]   Toward Arbitrary-Oriented Ship Detection With Rotated Region Proposal and Discrimination Networks [J].
Zhang, Zenghui ;
Guo, Weiwei ;
Zhu, Shengnan ;
Yu, Wenxian .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2018, 15 (11) :1745-1749
[55]   IoU Loss for 2D/3D Object Detection [J].
Zhou, Dingfu ;
Fang, Jin ;
Song, Xibin ;
Guan, Chenye ;
Yin, Junbo ;
Dai, Yuchao ;
Yang, Ruigang .
2019 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2019), 2019, :85-94
[56]  
Zhou XY, 2019, Arxiv, DOI [arXiv:1904.07850, 10.48550/arXiv.1904.07850]
[57]  
Zhu HG, 2015, IEEE IMAGE PROC, P3735, DOI 10.1109/ICIP.2015.7351502
[58]  
Zhu XZ, 2021, Arxiv, DOI [arXiv:2010.04159, 10.48550/arXiv.2010.04159, DOI 10.48550/ARXIV.2010.04159]