Nested Network With Two-Stream Pyramid for Salient Object Detection in Optical Remote Sensing Images

被引:268
作者
Li, Chongyi [1 ]
Cong, Runmin [2 ,3 ]
Hou, Junhui [1 ,4 ]
Zhang, Sanyi [5 ]
Qian, Yue [1 ]
Kwong, Sam [1 ,4 ]
机构
[1] City Univ Hong Kong, Dept Comp Sci, Hong Kong 999077, Peoples R China
[2] Beijing Jiaotong Univ, Inst Informat Sci, Beijing 100044, Peoples R China
[3] Beijing Jiaotong Univ, Beijing Key Lab Adv Informat Sci & Network Techno, Beijing 100044, Peoples R China
[4] City Univ Hong Kong, Shenzhen Res Inst, Shenzhen 51800, Peoples R China
[5] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2019年 / 57卷 / 11期
基金
中国国家自然科学基金;
关键词
Object detection; Optical imaging; Saliency detection; Optical sensors; Feature extraction; Optical network units; Nested connections; optical remote sensing images (RSIs); salient object detection; two-stream pyramid module;
D O I
10.1109/TGRS.2019.2925070
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Arising from the various object types and scales, diverse imaging orientations, and cluttered backgrounds in optical remote sensing image (RSI), it is difficult to directly extend the success of salient object detection for nature scene image to the optical RSI. In this paper, we propose an end-to-end deep network called LV-Net based on the shape of network architecture, which detects salient objects from optical RSIs in a purely data-driven fashion. The proposed LV-Net consists of two key modules, i.e., a two-stream pyramid module (L-shaped module) and an encoder-decoder module with nested connections (V-shaped module). Specifically, the L-shaped module extracts a set of complementary information hierarchically by using a two-stream pyramid structure, which is beneficial to perceiving the diverse scales and local details of salient objects. The V-shaped module gradually integrates encoder detail features with decoder semantic features through nested connections, which aims at suppressing the cluttered backgrounds and highlighting the salient objects. In addition, we construct the first publicly available optical RSI data set for salient object detection, including 800 images with varying spatial resolutions, diverse saliency types, and pixel-wise ground truth. Experiments on this benchmark data set demonstrate that the proposed method outperforms the state-of-the-art salient object detection methods both qualitatively and quantitatively.
引用
收藏
页码:9156 / 9166
页数:11
相关论文
共 55 条
[1]  
Achanta R, 2009, PROC CVPR IEEE, P1597, DOI 10.1109/CVPRW.2009.5206596
[2]  
Borji A, 2015, REMOTE SENS
[3]   Salient Object Detection: A Benchmark [J].
Borji, Ali ;
Sihite, Dicky N. ;
Itti, Laurent .
COMPUTER VISION - ECCV 2012, PT II, 2012, 7573 :414-429
[4]   Saliency-Aware Nonparametric Foreground Annotation Based on Weakly Labeled Data [J].
Cao, Xiaochun ;
Zhang, Changqing ;
Fu, Huazhu ;
Guo, Xiaojie ;
Tian, Qi .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2016, 27 (06) :1253-1265
[5]   Remote Sensing Image Scene Classification: Benchmark and State of the Art [J].
Cheng, Gong ;
Han, Junwei ;
Lu, Xiaoqiang .
PROCEEDINGS OF THE IEEE, 2017, 105 (10) :1865-1883
[6]   Learning Rotation-Invariant Convolutional Neural Networks for Object Detection in VHR Optical Remote Sensing Images [J].
Cheng, Gong ;
Zhou, Peicheng ;
Han, Junwei .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2016, 54 (12) :7405-7415
[7]   A survey on object detection in optical remote sensing images [J].
Cheng, Gong ;
Han, Junwei .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2016, 117 :11-28
[8]   Multi-class geospatial object detection and geographic image classification based on collection of part detectors [J].
Cheng, Gong ;
Han, Junwei ;
Zhou, Peicheng ;
Guo, Lei .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2014, 98 :119-132
[9]  
Cheng MM, 2021, INT J COMPUT VISION, V129, P2622, DOI [10.1109/ICCV.2017.487, 10.1007/s11263-021-01490-8]
[10]   Review of Visual Saliency Detection With Comprehensive Information [J].
Cong, Runmin ;
Lei, Jianjun ;
Fu, Huazhu ;
Cheng, Ming-Ming ;
Lin, Weisi ;
Huang, Qingming .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (10) :2941-2959