Task-Specific Heterogeneous Network for Object Detection in Aerial Images

被引:3
作者
Yu, Ying [1 ]
Yang, Xi [2 ]
Li, Jie [1 ]
Gao, Xinbo [1 ,3 ]
机构
[1] Xidian Univ, Sch Elect Engn, Xian 710071, Peoples R China
[2] Xidian Univ, Sch Telecommun Engn, State Key Lab Integrated Serv Networks, Xian 710071, Peoples R China
[3] Chongqing Univ Posts & Telecommun, Chongqing Key Lab Image Cognit, Chongqing 400065, Peoples R China
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2023年 / 61卷
基金
中国国家自然科学基金;
关键词
Aerial images; heterogeneous network; object detection;
D O I
10.1109/TGRS.2023.3311966
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Object detection in aerial images has attracted increasing attention in recent years. Due to the complex background and arbitrary-oriented objects, it is challenging to accurately locate the objects of interest in the images. Many methods have been developed for improving localization accuracy of oriented objects. However, classification and localization tasks require different features due to the unique characteristics of aerial images, which are still not fully considered in previous methods. Therefore, we propose a task-specific heterogeneous (TSH) network for aerial object detection. Specifically, we design an interference-suppression module (ISM) to reduce both the background and interclass interference, which can provide discriminative features for classification. To produce more reliable localization confidence, we propose a joint-learning quality estimation (JQE) module to adaptively combine the classification and regression features, thereby achieving accurate classification and localization quality estimation simultaneously. Moreover, we propose a point-based localization (PBL) branch. In the PBL, the learnable points can effectively adapt to objects with diverse shapes and orientations, and the dynamic information aggregation (DIA) module can enhance the relationships between the dispersible points to promote localization accuracy. The proposed TSH is evaluated extensively on four widely used aerial datasets, demonstrating its state-of-the-art performance. Ablation study and visualizations further verify the effectiveness of our method.
引用
收藏
页数:15
相关论文
共 95 条
[1]   Towards Multi-class Object Detection in Unconstrained Remote Sensing Imagery [J].
Azimi, Seyed Majid ;
Vig, Eleonora ;
Bahmanyar, Reza ;
Koerner, Marco ;
Reinartz, Peter .
COMPUTER VISION - ACCV 2018, PT III, 2019, 11363 :150-165
[2]  
Bochkovskiy A, 2020, Arxiv, DOI arXiv:2004.10934
[3]   MACS-Mar: a real-time remote sensing system for maritime security applications [J].
Brauchle, Joerg ;
Bayer, Steven ;
Hein, Daniel ;
Berger, Ralf ;
Pless, Sebastian .
CEAS SPACE JOURNAL, 2019, 11 (01) :35-44
[4]  
Cao YH, 2020, PROC CVPR IEEE, P11580, DOI 10.1109/CVPR42600.2020.01160
[5]  
Chen K, 2019, Arxiv, DOI arXiv:1906.07155
[6]  
Chen X., 2019, P INT C SENS INSTR I, P1
[7]  
Chen Z., 2020, P EUR C COMP VIS ECC, P195
[8]   DDOD: Dive Deeper into the Disentanglement of Object Detector [J].
Chen, Zehui ;
Yang, Chenhongyi ;
Chang, Jiahao ;
Zhao, Feng ;
Zha, Zheng-Jun ;
Wu, Feng .
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 :284-298
[9]   Disentangle Your Dense Object Detector [J].
Chen, Zehui ;
Yang, Chenhongyi ;
Li, Qiaofei ;
Zhao, Feng ;
Zha, Zheng-Jun ;
Wu, Feng .
PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, :4939-4948
[10]   Anchor-Free Oriented Proposal Generator for Object Detection [J].
Cheng, Gong ;
Wang, Jiabao ;
Li, Ke ;
Xie, Xingxing ;
Lang, Chunbo ;
Yao, Yanqing ;
Han, Junwei .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60