Structure-Adaptive Oriented Object Detection Network for Remote Sensing Images

被引:1
作者
Xi, Yifan [1 ,2 ]
Lu, Ting [1 ,2 ]
Kang, Xudong [1 ,2 ]
Li, Shutao [1 ,2 ]
机构
[1] Hunan Univ, Coll Elect & Informat Engn, Changsha 410082, Peoples R China
[2] Hunan Univ, Key Lab Visual Percept & Artificial Intelligence H, Changsha 410082, Peoples R China
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2024年 / 62卷
基金
中国国家自然科学基金;
关键词
Location awareness; Remote sensing; Estimation; Uncertainty; Predictive models; Object detection; Visualization; Oriented object detection (OOD); remote sensing image; rotation angle encoder (RAE); structure-adaptive confidence estimation (SACE); structure-adaptive label assignment (SALA);
D O I
10.1109/TGRS.2024.3432878
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Nowadays, high-resolution remote sensing images provide rich data sources and deep learning models show powerful feature representation capability for remote sensing object detection. However, due to the complex object structure as well as the changeable rotation angle, how to efficiently estimate the oriented bounding box regarding the accurate location of objects is still an open issue. Focused on this, a new one-stage structure-adaptive oriented object detection (SOOD) network is proposed, in this article. First, we designed a new rotation angle encoder (RAE), where an angle coordinate system is adopted and periodic angle correction is conducted. Different from the traditional longe-edge definition for angle estimation, the RAE can mitigate boundary discontinuity and square-like problems. Then, structure-adaptive label assignment (SALA) and confidence estimation (SACE) are introduced, to locate the position of objects more accurately. On the one hand, the anchor box determines the label assignment according to the affiliation relationship between the center point and the object's inner ellipse boundary. By constraining the ellipse boundary and employing non-parametric label assignment, high-quality anchor boxes are initially selected, and low-quality anchor boxes are suppressed. On the other hand, the integration of intersection over union (IoU) prediction and uncertainty prediction constructs a quality evaluation function to guide. In this manner, this function dynamically evaluates the localization and classification ability of each prediction box. Extensive experiments on publicly available datasets such as DOTA1.0, DOTA1.5, DIOR, and MAR20 demonstrate the effectiveness of the proposed model. The source code will be available at https://github.com/fan609/SOOD.
引用
收藏
页数:13
相关论文
共 60 条
[21]   Path Aggregation Network for Instance Segmentation [J].
Liu, Shu ;
Qi, Lu ;
Qin, Haifang ;
Shi, Jianping ;
Jia, Jiaya .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :8759-8768
[22]   Center-Boundary Dual Attention for Oriented Object Detection in Remote Sensing Images [J].
Liu, Shuai ;
Zhang, Lu ;
Lu, Huchuan ;
He, You .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[23]   SSD: Single Shot MultiBox Detector [J].
Liu, Wei ;
Anguelov, Dragomir ;
Erhan, Dumitru ;
Szegedy, Christian ;
Reed, Scott ;
Fu, Cheng-Yang ;
Berg, Alexander C. .
COMPUTER VISION - ECCV 2016, PT I, 2016, 9905 :21-37
[24]   A ConvNet for the 2020s [J].
Liu, Zhuang ;
Mao, Hanzi ;
Wu, Chao-Yuan ;
Feichtenhofer, Christoph ;
Darrell, Trevor ;
Xie, Saining .
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, :11966-11976
[25]  
Lyu C, 2022, Arxiv, DOI [arXiv:2212.07784, 10.48550/arXiv.2212.07784abs/2212.07784, 10.48550/arXiv.2212.07784]
[26]   Arbitrary-Oriented Scene Text Detection via Rotation Proposals [J].
Ma, Jianqi ;
Shao, Weiyuan ;
Ye, Hao ;
Wang, Li ;
Wang, Hong ;
Zheng, Yingbin ;
Xue, Xiangyang .
IEEE TRANSACTIONS ON MULTIMEDIA, 2018, 20 (11) :3111-3122
[27]   Task interleaving and orientation estimation for high-precision oriented object detection in aerial images [J].
Ming, Qi ;
Miao, Lingjuan ;
Zhou, Zhiqiang ;
Song, Junjie ;
Dong, Yunpeng ;
Yang, Xue .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2023, 196 :241-255
[28]   Dynamic Refinement Network for Oriented and Densely Packed Object Detection [J].
Pan, Xingjia ;
Ren, Yuqiang ;
Sheng, Kekai ;
Dong, Weiming ;
Yuan, Haolei ;
Guo, Xiaowei ;
Ma, Chongyang ;
Xu, Changsheng .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :11204-11213
[29]   RSDet plus plus : Point-Based Modulated Loss for More Accurate Rotated Object Detection [J].
Qian, Wen ;
Yang, Xue ;
Peng, Silong ;
Zhang, Xiujuan ;
Yan, Junchi .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (11) :7869-7879
[30]  
Qian W, 2021, AAAI CONF ARTIF INTE, V35, P2458