Spectrum-Driven Mixed-Frequency Network for Hyperspectral Salient Object Detection

被引:11
作者
Liu, Peifu [1 ]
Xu, Tingfa [1 ,2 ,3 ]
Chen, Huan [1 ]
Zhou, Shiyun [1 ]
Qin, Haolin [1 ]
Li, Jianan [1 ]
机构
[1] Beijing Inst Technol, Beijing 10081, Peoples R China
[2] Minist Educ China, Key Lab Photoelect Imaging Technol & Syst, Beijing 100081, Peoples R China
[3] Beijing Inst Technol, Chongqing Innovat Ctr, Big Data & Artificial Intelligence Lab, Chongqing 401151, Peoples R China
关键词
Image edge detection; Feature extraction; Object detection; Hyperspectral imaging; Head; Neural networks; Image color analysis; Hyperspectral salient object detection (HSOD); mixed-frequency attention; spectrum; DATASET; MODEL;
D O I
10.1109/TMM.2023.3331196
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Hyperspectral salient object detection (HSOD) aims to detect spectrally salient objects in hyperspectral images (HSIs). However, existing methods inadequately utilize spectral information by either converting HSIs into false-color images or converging neural networks with clustering. We propose a novel approach that fully leverages the spectral characteristics by extracting two distinct frequency components from the spectrum: low-frequency Spectral Saliency and high-frequency Spectral Edge. The Spectral Saliency approximates the region of salient objects, while the Spectral Edge captures edge information of salient objects. These two complementary components, crucial for HSOD, are derived by computing from the inter-layer spectral angular distance of the Gaussian pyramid and the intra-neighborhood spectral angular gradients, respectively. To effectively utilize this dual-frequency information, we introduce a novel lightweight Spectrum-driven Mixed-frequency Network (SMN). SMN incorporates two parameter-free plug-and-play operators, namely Spectral Saliency Generator and Spectral Edge Operator, to extract the Spectral Saliency and Spectral Edge components from the input HSI independently. Subsequently, the Mixed-frequency Attention module, comprised of two frequency-dependent heads, intelligently combines the embedded features of edge and saliency information, resulting in a mixed-frequency feature representation. Furthermore, a saliency-edge-aware decoder progressively scales up the mixed-frequency feature while preserving rich detail and saliency information for accurate salient object prediction. Extensive experiments conducted on the HS-SOD benchmark and our custom dataset HSOD-BIT demonstrate that our SMN outperforms state-of-the-art methods regarding HSOD performance.
引用
收藏
页码:5296 / 5310
页数:15
相关论文
共 49 条
[1]  
Alexe B, 2010, PROC CVPR IEEE, P73, DOI 10.1109/CVPR.2010.5540226
[2]  
[Anonymous], 2016, Trans Assoc Comput Linguist, DOI [DOI 10.1162/TACLA00097, 10.1162/tacl_a_00097]
[3]   Sparse Recovery of Hyperspectral Signal from Natural RGB Images [J].
Arad, Boaz ;
Ben-Shahar, Ohad .
COMPUTER VISION - ECCV 2016, PT VII, 2016, 9911 :19-34
[4]   Salient Object Detection: A Benchmark [J].
Borji, Ali ;
Cheng, Ming-Ming ;
Jiang, Huaizu ;
Li, Jia .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2015, 24 (12) :5706-5722
[5]   Salient object detection: A survey [J].
Borji, Ali ;
Cheng, Ming-Ming ;
Hou, Qibin ;
Jiang, Huaizu ;
Li, Jia .
COMPUTATIONAL VISUAL MEDIA, 2019, 5 (02) :117-150
[6]   What is a Salient Object? A Dataset and a Baseline Model for Salient Object Detection [J].
Borji, Ali .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2015, 24 (02) :742-756
[7]   What Do Different Evaluation Metrics Tell Us About Saliency Models? [J].
Bylinskii, Zoya ;
Judd, Tilke ;
Oliva, Aude ;
Torralba, Antonio ;
Durand, Fredo .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2019, 41 (03) :740-757
[8]  
Cao Y, 2015, 2015 IEEE CHINA SUMMIT & INTERNATIONAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING, P1086, DOI 10.1109/ChinaSIP.2015.7230572
[9]   Does Thermal Really Always Matter for RGB-T Salient Object Detection? [J].
Cong, Runmin ;
Zhang, Kepu ;
Zhang, Chen ;
Zheng, Feng ;
Zhao, Yao ;
Huang, Qingming ;
Kwong, Sam .
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 :6971-6982
[10]  
Dosovitskiy A, 2020, INT C LEARN REPR