Supervised-unsupervised combined transformer for spectral compressive imaging reconstruction

被引:6
作者
Zhou, Han [1 ]
Lian, Yusheng [1 ]
Li, Jin [2 ]
Liu, Zilong [3 ]
Cao, Xuheng [4 ]
Ma, Chao [1 ]
机构
[1] Beijing Inst Graph Commun, Sch Printing & Packaging Engn, Beijing 102600, Peoples R China
[2] Beihang Univ, Sch Instrumentat & Optoelect Engn, Beijing 100191, Peoples R China
[3] Natl Inst Metrol, Data Ctr, Beijing 100029, Peoples R China
[4] Tongji Univ, Sch Phys Sci & Engn, Shanghai 200092, Peoples R China
基金
中国国家自然科学基金;
关键词
compressive hyperspectral reconstruction; image priors learning; computational imaging; DESIGN;
D O I
10.1016/j.optlaseng.2024.108030
中图分类号
O43 [光学];
学科分类号
070207 ; 0803 ;
摘要
To solve the low spatial and/or temporal resolution problem which the conventional hyperspectral cameras often suffer from, spectral compressive imaging systems (SCI) have attracted more attention recently. Recovering a hyperspectral image (HSI) from its corresponding 2D coded image is an ill-posed inverse problem, and learning accurate prior from HSI and 2D coded image is essential to solve this inverse problem. Existing methods only use supervised networks that focus on learning generalized prior from training datasets, or only use unsupervised networks that focus on learning specific prior from 2D coded image, resulting in the inability to learn both generalized and specific priors. Also, when learning the priors, existing methods cannot simultaneously give consideration to both global and local scales, as well as both spatial and spectral dimensions. To cope with this problem, in this paper, we propose a Supervised-Unsupervised Combined Transformer Network (SUCTNet) composed by a supervised Spatio-spectral Transformer network (SSTNet) and an Unsupervised Multi-level Feature Refinement network (UMFRNet). Specifically, we first develop the SSTNet to learn generalized prior and obtain a preliminary HSI. In SSTNet, the proposed spatial encoding and spectral decoding network architecture enables it to simultaneously consider both spatial and spectral dimensions, and a proposed Global and Local Multi head Self Attention block (GL-MSA) enables it simultaneously to consider both global and local scales. Then, the preliminary HSI is fed into the proposed UMFRNet to learn specific prior and obtain the target HSI. In UMFRNet, a proposed multi-level feature refinement mechanism and the physical imaging model of SCI are used to improve reconstruction accuracy and generalization performance. Extensive experiments show that our method significantly outperforms state-of-the-art (SOTA) methods on simulated and real datasets. Codes will be available at https://github.com/Vzhouhan/SUCTNet.
引用
收藏
页数:11
相关论文
共 50 条
[21]   Convolutional sparse coding framework for compressive spectral imaging [J].
Barajas-Solano, Crisostomo ;
Ramirez, Juan-Marcos ;
Arguello, Henry .
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2020, 66
[22]   Block-based reconstructions for compressive spectral imaging [J].
Correa, Claudia V. ;
Arguello, Henry ;
Arce, Gonzalo R. .
COMPRESSIVE SENSING II, 2013, 8717
[23]   COMPRESSIVE SPECTRAL IMAGING WITH COLORED-PATTERNED DETECTORS [J].
Correa, Claudia V. ;
Arguello, Henry ;
Arce, Gonzalo R. .
2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
[24]   Spatial-spectral Encoded Compressive Hyperspectral Imaging [J].
Lin, Xing ;
Liu, Yebin ;
Wu, Jiamin ;
Dai, Qionghai .
ACM TRANSACTIONS ON GRAPHICS, 2014, 33 (06)
[25]   Color-Coded Compressive Spectral Imager Based on Focus Transformer Network [J].
Li, Jinshan ;
Ma, Xu ;
Paruchuri, Aanish ;
Alrushud, Abdullah ;
Arce, Gonzalo R. .
SENSORS, 2025, 25 (07)
[26]   Review of physical implementation architecture in compressive spectral imaging system [J].
Li Yun-hui .
CHINESE OPTICS, 2022, 15 (05) :929-945
[27]   LED-based compressive spectral- temporal imaging [J].
Ma, Xiao ;
Yuan, Xin ;
Fu, Chen ;
Arce, Gonzalo R. .
OPTICS EXPRESS, 2021, 29 (07) :10698-10715
[28]   OPTIMIZATION OF A MOVING COLORED CODED APERTURE IN COMPRESSIVE SPECTRAL IMAGING [J].
Galvis, Laura ;
Mojica, Edson ;
Arguello, Henry ;
Arce, Gonzalo R. .
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, :7685-7689
[29]   Compressive spectral imaging approach using adaptive coded apertures [J].
Zhang, Hao ;
Ma, Xu ;
Arce, Gonzalo R. .
APPLIED OPTICS, 2020, 59 (07) :1924-1938
[30]   Prior Images Guided Generative Autoencoder Model for Dual-Camera Compressive Spectral Imaging [J].
Chen, Yurong ;
Wang, Yaonan ;
Zhang, Hui .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (09) :8629-8643