Laplacian Feature Pyramid Network for Object Detection in VHR Optical Remote Sensing Images

被引:77
作者
Zhang, Wenhua [1 ]
Jiao, Licheng [1 ]
Li, Yuxuan [1 ]
Huang, Zhongjian [1 ]
Wang, Haoran [1 ]
机构
[1] Xidian Univ, Sch Artificial Intelligence, Minist Educ, Key Lab Intelligent Percept & Image Understanding, Xian 710071, Peoples R China
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2022年 / 60卷
基金
中国国家自然科学基金;
关键词
Convolutional neural networks (CNNs); feature pyramid (FP) networks; Laplacian FP; object detection; very high resolution optical remote sensing (VHR-ORS) images; SHIP DETECTION; MODEL;
D O I
10.1109/TGRS.2021.3072488
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Except for multiscale features, high-frequency features are also crucial for the identification of many objects in object detection for very high resolution optical remote sensing (VHR-ORS) images but have not been considered yet. Due to the fact that the Laplacian pyramid consists of high-frequency information at each level, we propose a Laplacian feature pyramid (FP) network (LFPN) considering both low-frequency features and high-frequency features based on FP structure to improve the object detection performance of VHR-ORS images. FP-based structures are efficient to represent multiscale features. But, in general, FP-based structures, high-frequency features are not specially considered. Such high-frequency features are important to distinguish many ground objects with sufficient details. For example, texture features are critical to distinguish basketball_court and tennis_court. The construction of LFPN consists of a bottom-up pathway, Laplacian pathway, and a fusion pathway, which generate low-frequency pyramid, high-frequency pyramid, and compound pyramid, respectively. The bottom-up pathway follows the computation flow of the backbone convolutional neural networks (CNNs) which is similar to general FP-based structures. The Laplacian pathway extracts the high-frequency features of objects through a trainable Laplacian operator. Finally, the low-frequency and high-frequency FPs are fused to generate the compound pyramid in efficient ways. To evaluate the performance of LFPN, we embed LFPN into both two-stage object detection (T-LFPN) systems and single-stage object detection (S-LFPN) systems to conduct experiments. Experiments on a public challenging ten-class data set NWPU VHR-10 demonstrate the superior performance of LFPN in both T-LFPN and S-LFPN systems and state-of-the-art performance of LFPN-based detectors.
引用
收藏
页数:14
相关论文
共 54 条
[1]   Texture-Based Airport Runway Detection [J].
Aytekin, O. ;
Zongur, U. ;
Halici, U. .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2013, 10 (03) :471-475
[2]   A Visual Search Inspired Computational Model for Ship Detection in Optical Satellite Images [J].
Bi, Fukun ;
Zhu, Bocheng ;
Gao, Lining ;
Bian, Mingming .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2012, 9 (04) :749-753
[3]   THE LAPLACIAN PYRAMID AS A COMPACT IMAGE CODE [J].
BURT, PJ ;
ADELSON, EH .
IEEE TRANSACTIONS ON COMMUNICATIONS, 1983, 31 (04) :532-540
[4]   Object Detection in Remote Sensing Images Based on a Scene-Contextual Feature Pyramid Network [J].
Chen, Chaoyue ;
Gong, Weiguo ;
Chen, Yongliang ;
Li, Weihong .
REMOTE SENSING, 2019, 11 (03)
[5]   Learning Rotation-Invariant Convolutional Neural Networks for Object Detection in VHR Optical Remote Sensing Images [J].
Cheng, Gong ;
Zhou, Peicheng ;
Han, Junwei .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2016, 54 (12) :7405-7415
[6]  
Cheng G, 2015, PROC CVPR IEEE, P1173, DOI 10.1109/CVPR.2015.7298721
[7]   Effective and Efficient Midlevel Visual Elements-Oriented Land-Use Classification Using VHR Remote Sensing Images [J].
Cheng, Gong ;
Han, Junwei ;
Guo, Lei ;
Liu, Zhenbao ;
Bu, Shuhui ;
Ren, Jinchang .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2015, 53 (08) :4238-4249
[8]   Multi-class geospatial object detection and geographic image classification based on collection of part detectors [J].
Cheng, Gong ;
Han, Junwei ;
Zhou, Peicheng ;
Guo, Lei .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2014, 98 :119-132
[9]   Histograms of oriented gradients for human detection [J].
Dalal, N ;
Triggs, B .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893
[10]   Use of Salient Features for the Design of a Multistage Framework to Extract Roads From High-Resolution Multispectral Satellite Images [J].
Das, Sukhendu ;
Mirnalinee, T. T. ;
Varghese, Koshy .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2011, 49 (10) :3906-3931