Heterogeneous Feature Collaboration Network for Salient Object Detection in Optical Remote Sensing Images

被引:10
作者
Liu, Yutong [1 ]
Xu, Mingzhu [1 ]
Xiao, Tianxiang [1 ]
Tang, Haoyu [1 ]
Hu, Yupeng [1 ]
Nie, Liqiang [2 ]
机构
[1] Shandong Univ, Sch Software, Jinan 250101, Peoples R China
[2] Harbin Inst Technol Shenzhen, Sch Comp Sci & Technol, Shenzhen 518055, Peoples R China
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2024年 / 62卷
基金
中国国家自然科学基金;
关键词
Optical imaging; Optical sensors; Feature extraction; Decoding; Transformers; Topology; Context modeling; Attention mechanism; feature alignment; heterogeneous feature fusion; optical remote sensing images; salient object detection (SOD);
D O I
10.1109/TGRS.2024.3439401
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
In recent years, the task of salient object detection in optical remote sensing images (RSI-SOD) has gained increasing interest. Despite some advancement in current methods, challenges such as the irregular topology of salient objects and cluttered backgrounds in optical RSI remain. To tackle these issues, we propose a novel heterogeneous feature collaboration network (HFCNet). Specifically, we design a new hybrid heterogeneous encoder that combines CNN and transformer to extract a set of heterogeneous features, famous in modeling local and global information, respectively. Subsequently, the adaptive global-local integration (AGLI) module is devised to integrate these complementary heterogeneous features through our feature alignment methods at global and local levels, so the global irregular topology structure and local details can be well-modeled. Furthermore, the proposed saliency-guided attention enhanced decoder (SGAED) leverages deep salient cues to guide the shallow decoders to pay more attention to important areas and suppress irrelevant areas, reducing the interference of cluttered backgrounds. Extensive experiments on three benchmark datasets have confirmed the significant superiority of our method compared with 18 state-of-the-art methods. All codes and results of our method are available at https://github.com/xumingzhu989/HFCNet-TGRS.
引用
收藏
页数:14
相关论文
共 52 条
[1]  
Achanta R, 2009, PROC CVPR IEEE, P1597, DOI 10.1109/CVPRW.2009.5206596
[2]   DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].
Chen, Liang-Chieh ;
Papandreou, George ;
Kokkinos, Iasonas ;
Murphy, Kevin ;
Yuille, Alan L. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848
[3]  
Chen ZY, 2020, AAAI CONF ARTIF INTE, V34, P10599
[4]   RRNet: Relational Reasoning Network With Parallel Multiscale Attention for Salient Object Detection in Optical Remote Sensing Images [J].
Cong, Runmin ;
Zhang, Yumo ;
Fang, Leyuan ;
Li, Jun ;
Zhao, Yao ;
Kwong, Sam .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[5]  
Deng ZJ, 2018, PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P684
[6]   PGA-Net: Pyramid Feature Fusion and Global Context Attention Network for Automated Surface Defect Detection [J].
Dong, Hongwen ;
Song, Kechen ;
He, Yu ;
Xu, Jing ;
Yan, Yunhui ;
Meng, Qinggang .
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2020, 16 (12) :7448-7458
[7]   Structure-measure: A New Way to Evaluate Foreground Maps [J].
Fan, Deng-Ping ;
Cheng, Ming-Ming ;
Liu, Yun ;
Li, Tao ;
Borji, Ali .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :4558-4567
[8]  
Fan DP, 2018, PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P698
[9]   Adaptive Spatial Tokenization Transformer for Salient Object Detection in Optical Remote Sensing Images [J].
Gao, Lina ;
Liu, Bing ;
Fu, Ping ;
Xu, Mingzhu .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
[10]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778