Attention and Feature Fusion SSD for Remote Sensing Object Detection

被引:173
作者
Lu, Xiaocong [1 ]
Ji, Jian [1 ]
Xing, Zhiqi [1 ]
Miao, Qiguang [1 ]
机构
[1] Xidian Univ, Sch Comp Sci & Technol, Xian 710071, Peoples R China
基金
中国国家自然科学基金;
关键词
Convolutional neural network; dual-path attention mechanism; feature fusion; image processing; object detection; remote sensing;
D O I
10.1109/TIM.2021.3052575
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Object detection is a basic topic in the field of remote sensing. However, remote sensing images usually suffer in complicated backgrounds, object scale variations, and small objects, which make remote sensing object detection still difficult. Although the existing two-stage methods have higher accuracy, their detection speed is slow. While the one-stage object detection algorithms such as single-shot detector (SSD) and YOLO, although they can achieve real-time detection, they have poor detection performance, especially for small objects. In this article, in order to further improve the remote sensing object detection performance of one-stage methods, we propose an end-to-end network named attention and feature fusion SSD. First, a multilayer feature fusion structure is designed to enhance the semantic information of the shallow features. Next, a dual-path attention module is introduced to screen the feature information. This module uses spatial attention and channel attention to suppress the background noise and highlight the key feature. Then, the feature representation ability of the network is further enhanced by a multiscale receptive field module. Finally, the loss function is optimized to alleviate the imbalance between the positive and negative samples. The experimental results on the DOTA and NWPU VHR-10 data sets verify the effectiveness of our method.
引用
收藏
页数:9
相关论文
共 33 条
[1]   Feature-Fused SSD: Fast Detection for Small Objects [J].
Cao, Guimei ;
Xie, Xuemei ;
Yang, Wenzhe ;
Liao, Quan ;
Shi, Guangming ;
Wu, Jinjian .
NINTH INTERNATIONAL CONFERENCE ON GRAPHIC AND IMAGE PROCESSING (ICGIP 2017), 2018, 10615
[2]   D2Det: Towards High Quality Object Detection and Instance Segmentation [J].
Cao, Jiale ;
Cholakkal, Hisham ;
Anwer, Rao Muhammad ;
Khan, Fahad Shahbaz ;
Pang, Yanwei ;
Shao, Ling .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :11482-11491
[3]   Learning Rotation-Invariant Convolutional Neural Networks for Object Detection in VHR Optical Remote Sensing Images [J].
Cheng, Gong ;
Zhou, Peicheng ;
Han, Junwei .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2016, 54 (12) :7405-7415
[4]   Multi-class geospatial object detection and geographic image classification based on collection of part detectors [J].
Cheng, Gong ;
Han, Junwei ;
Zhou, Peicheng ;
Guo, Lei .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2014, 98 :119-132
[5]  
Fu C.-Y., 2017, ARXIV
[6]   Rotation-aware and multi-scale convolutional neural network for object detection in remote sensing images [J].
Fu, Kun ;
Chang, Zhonghan ;
Zhang, Yue ;
Xu, Guangluan ;
Zhang, Keshu ;
Sun, Xian .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2020, 161 :294-308
[7]   Rich feature hierarchies for accurate object detection and semantic segmentation [J].
Girshick, Ross ;
Donahue, Jeff ;
Darrell, Trevor ;
Malik, Jitendra .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :580-587
[8]  
Hu J., P IEEE C COMP VIS PA, P7132
[9]  
Jaderberg M, 2015, ADV NEUR IN, V28
[10]   R3-Net: A Deep Network for Multioriented Vehicle Detection in Aerial Images and Videos [J].
Li, Qingpeng ;
Mou, Lichao ;
Xu, Qizhi ;
Zhang, Yun ;
Zhu, Xiao Xiang .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2019, 57 (07) :5028-5042