Learning to detect extreme objects for remote sensing images

被引:2
作者
He, Qiang [1 ,2 ]
Li, Meng [1 ,2 ]
Huo, Lianzhi [3 ]
Chen, Linlin [1 ,2 ]
机构
[1] Beijing Univ Civil Engn & Architecture, Sch Sci, Beijing 100044, Peoples R China
[2] Beijing Univ Civil Engn & Architecture, Inst Big Data Modeling & Technol, Beijing 100044, Peoples R China
[3] Chinese Acad Sci, Aerosp Informat Res Inst, Beijing 100094, Peoples R China
基金
中国国家自然科学基金;
关键词
Extreme aspect ratio; Multi-scale; Object detection; Remote sensing images; Self-similarity; Shape sensitive;
D O I
10.1007/s13042-023-02059-2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The scale and shape of objects in remote sensing images varies greatly, and thus the object detection of remote sensing image has always been a relatively challenging task. Focusing on the problem of multi-scale and extreme aspect ratio object detection, this paper proposes the extreme object detection net (EODNet). Firstly, we design a fractal-based feature fusion network (FFFN) by stacking different multilayer elementary convolutional blocks based on the self-similarity feature of fractals, so that the fused feature maps can fully retain the original features to facilitate the extraction of objects of different sizes. The parallel and serial asymmetric convolution block is fitted objects by forming receptive fields that approximate the shape of the object for feature maps used to detect large-sized objects. Following this, the width-height sensitive loss function is designed, which adapts to multi-scale objects and is sensitive to shape. Finally, the Extreme-soft- non maximum suppression (NMS) realizes the removal of extreme overlapping boxes and avoids the accidental deletion of dense object prediction boxes. The proposed method is tested on the NWPU VHR-10 and RSOD datasets, and the results are compared with mainstream object detection algorithms, verifying the effectiveness of the proposed method, as the mAP on the two datasets is 96.1% and 95.4%, respectively.
引用
收藏
页码:2719 / 2736
页数:18
相关论文
共 51 条
[1]  
Bochkovskiy A, 2020, Arxiv, DOI [arXiv:2004.10934, 10.48550/arXiv.2004.10934]
[2]   Soft-NMS - Improving Object Detection With One Line of Code [J].
Bodla, Navaneeth ;
Singh, Bharat ;
Chellappa, Rama ;
Davis, Larry S. .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :5562-5570
[3]   Multi-class geospatial object detection and geographic image classification based on collection of part detectors [J].
Cheng, Gong ;
Han, Junwei ;
Zhou, Peicheng ;
Guo, Lei .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2014, 98 :119-132
[4]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[5]   CenterNet: Keypoint Triplets for Object Detection [J].
Duan, Kaiwen ;
Bai, Song ;
Xie, Lingxi ;
Qi, Honggang ;
Huang, Qingming ;
Tian, Qi .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :6568-6577
[6]   An Anchor-Free Method Based on Feature Balancing and Refinement Network for Multiscale Ship Detection in SAR Images [J].
Fu, Jiamei ;
Sun, Xian ;
Wang, Zhirui ;
Fu, Kun .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (02) :1331-1344
[7]   Global to Local: A Scale-Aware Network for Remote Sensing Object Detection [J].
Gao, Tao ;
Niu, Qianqian ;
Zhang, Jing ;
Chen, Ting ;
Mei, Shaohui ;
Jubair, Ahmad .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
[8]   AugFPN: Improving Multi-scale Feature Learning for Object Detection [J].
Guo, Chaoxu ;
Fan, Bin ;
Zhang, Qian ;
Xiang, Shiming ;
Pan, Chunhong .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :12592-12601
[9]   M-FCCL: Memory-based concept-cognitive learning for dynamic fuzzy data classification and knowledge fusion [J].
Guo, Doudou ;
Xu, Weihua ;
Qian, Yuhua ;
Ding, Weiping .
INFORMATION FUSION, 2023, 100
[10]   Fuzzy-based concept-cognitive learning: An investigation of novel approach to tumor diagnosis analysis [J].
Guo, Doudou ;
Xu, Weihua .
INFORMATION SCIENCES, 2023, 639