Nested Network With Two-Stream Pyramid for Salient Object Detection in Optical Remote Sensing Images

被引:268
作者
Li, Chongyi [1 ]
Cong, Runmin [2 ,3 ]
Hou, Junhui [1 ,4 ]
Zhang, Sanyi [5 ]
Qian, Yue [1 ]
Kwong, Sam [1 ,4 ]
机构
[1] City Univ Hong Kong, Dept Comp Sci, Hong Kong 999077, Peoples R China
[2] Beijing Jiaotong Univ, Inst Informat Sci, Beijing 100044, Peoples R China
[3] Beijing Jiaotong Univ, Beijing Key Lab Adv Informat Sci & Network Techno, Beijing 100044, Peoples R China
[4] City Univ Hong Kong, Shenzhen Res Inst, Shenzhen 51800, Peoples R China
[5] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2019年 / 57卷 / 11期
基金
中国国家自然科学基金;
关键词
Object detection; Optical imaging; Saliency detection; Optical sensors; Feature extraction; Optical network units; Nested connections; optical remote sensing images (RSIs); salient object detection; two-stream pyramid module;
D O I
10.1109/TGRS.2019.2925070
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Arising from the various object types and scales, diverse imaging orientations, and cluttered backgrounds in optical remote sensing image (RSI), it is difficult to directly extend the success of salient object detection for nature scene image to the optical RSI. In this paper, we propose an end-to-end deep network called LV-Net based on the shape of network architecture, which detects salient objects from optical RSIs in a purely data-driven fashion. The proposed LV-Net consists of two key modules, i.e., a two-stream pyramid module (L-shaped module) and an encoder-decoder module with nested connections (V-shaped module). Specifically, the L-shaped module extracts a set of complementary information hierarchically by using a two-stream pyramid structure, which is beneficial to perceiving the diverse scales and local details of salient objects. The V-shaped module gradually integrates encoder detail features with decoder semantic features through nested connections, which aims at suppressing the cluttered backgrounds and highlighting the salient objects. In addition, we construct the first publicly available optical RSI data set for salient object detection, including 800 images with varying spatial resolutions, diverse saliency types, and pixel-wise ground truth. Experiments on this benchmark data set demonstrate that the proposed method outperforms the state-of-the-art salient object detection methods both qualitatively and quantitatively.
引用
收藏
页码:9156 / 9166
页数:11
相关论文
共 55 条
[21]   Deeply Supervised Salient Object Detection with Short Connections [J].
Hou, Qibin ;
Cheng, Ming-Ming ;
Hu, Xiaowei ;
Borji, Ali ;
Tu, Zhuowen ;
Torr, Philip .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :5300-5309
[22]  
Hu XW, 2018, AAAI CONF ARTIF INTE, P6943
[23]   Salient Object Detection via Multiple Instance Learning [J].
Huang, Fang ;
Qi, Jinqing ;
Lu, Huchuan ;
Zhang, Lihe ;
Ruan, Xiang .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017, 26 (04) :1911-1922
[24]  
Huang G., 2017, P CVPR, P4548
[25]   A video summarization approach based on the emulation of bottom-up mechanisms of visual attention [J].
Jacob, Hugo ;
Padua, Flavio L. C. ;
Lacerda, Anisio ;
Pereira, Adriano C. M. .
JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2017, 49 (02) :193-211
[26]   Salient Region Detection via High-Dimensional Color Transform and Local Spatial Support [J].
Kim, Jiwhan ;
Han, Dongyoon ;
Tai, Yu-Wing ;
Kim, Junmo .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (01) :9-23
[27]  
Kingma DP, 2014, ARXIV
[28]   ImageNet Classification with Deep Convolutional Neural Networks [J].
Krizhevsky, Alex ;
Sutskever, Ilya ;
Hinton, Geoffrey E. .
COMMUNICATIONS OF THE ACM, 2017, 60 (06) :84-90
[29]   ELD-Net: An Efficient Deep Learning Architecture for Accurate Saliency Detection [J].
Lee, Gayoung ;
Tai, Yu-Wing ;
Kim, Junmo .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (07) :1599-1610
[30]  
Li CY, 2015, PROC CVPR IEEE, P2710, DOI 10.1109/CVPR.2015.7298887