Object Detection in Hyperspectral Image via Unified Spectral-Spatial Feature Aggregation

被引：0

作者：

He X. ^{[1
]}

Tang C. ^{[1
]}

Liu X. ^{[2
]}

Zhang W. ^{[3
]}

Sun K. ^{[1
]}

Xu J. ^{[4
]}

机构：

[1] China University of Geosciences, The School of Computer, Wuhan

[2] National University of Defense Technology, The School of Computer, Changsha

[3] Qilu University of Technology (Shandong Academy of Sciences), The Shandong Provincial Key Laboratory of Computer Networks, Shandong Computer Science Center (National Supercomputing Center in Jinan), Jinan

[4] Hexagon Ab, Qingdao

来源：

IEEE Transactions on Geoscience and Remote Sensing | 2023年 / 61卷

关键词：

Deep learning; feature fusion; hyperspectral image (HSI) object detection; spectral-spatial aggregation (SSA);

D O I：

10.1109/TGRS.2023.3307288

中图分类号：

学科分类号：

摘要：

Deep learning-based hyperspectral image (HSI) classification and object detection techniques have gained significant attention due to their vital role in image content analysis, interpretation, and broader HSI applications. However, current hyperspectral object detection approaches predominantly emphasize spectral or spatial information, overlooking the valuable complementary relationship between these two aspects. In this study, we present a novel spectral-spatial aggregation (S2ADet) object detector that effectively harnesses the rich spectral and spatial complementary information inherent in the HSI. S2ADet comprises a hyperspectral information decoupling (HID) module, a two-stream feature extraction network, and a one-stage detection head. The HID module processes hyperspectral data by aggregating spectral and spatial information via band selection and principal components analysis, consequently reducing redundancy. Based on the acquired spectral and spatial aggregation information, we propose a feature aggregation two-stream network for interacting spectral-spatial features. Furthermore, to address the limitations of existing databases, we annotate an extensive dataset, designated as HOD3K, containing 3242 HSIs captured across diverse real-world scenes, and encompassing three object classes. These images possess a resolution of 512 × 256 pixels and cover 16 bands ranging from 470 to 620 nm. Comprehensive experiments on two datasets demonstrate that S2ADet surpasses existing state-of-the-art methods, achieving robust, and reliable results. The demo code and dataset of this work are publicly available at https://github.com/hexiao-cs/S2ADet. © 1980-2012 IEEE.

引用