End-to-End Low Cost Compressive Spectral Imaging with Spatial-Spectral Self-Attention

被引:181
作者
Meng, Ziyi [1 ,2 ]
Ma, Jiawei [3 ]
Yuan, Xin [4 ]
机构
[1] Beijing Univ Posts & Telecommun, Beijing 100876, Peoples R China
[2] New Jersey Inst Technol, Newark, NJ 07102 USA
[3] Columbia Univ, New York, NY 10027 USA
[4] Nokia Bell Labs, Murray Hill, NJ 07974 USA
来源
COMPUTER VISION - ECCV 2020, PT XXIII | 2020年 / 12368卷
关键词
Compressive spectral imaging; Spatial-Spectral Self-Attention; Large-scale real data; RECONSTRUCTION; VIDEO; DESIGN; NOISE; MODEL;
D O I
10.1007/978-3-030-58592-1_12
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Coded aperture snapshot spectral imaging (CASSI) is an effective tool to capture real-world 3D hyperspectral images. While a number of existing work has been conducted for hardware and algorithm design, we make a step towards the low-cost solution that enjoys video-rate high-quality reconstruction. To make solid progress on this challenging yet under-investigated task, we reproduce a stable single disperser (SD) CASSI system to gather large-scale real-world CASSI data and propose a novel deep convolutional network to carry out the real-time reconstruction by using self-attention. In order to jointly capture the self-attention across different dimensions in hyperspectral images (i.e., channel-wise spectral correlation and non-local spatial regions), we propose Spatial-Spectral Self-Attention (TSA) to process each dimension sequentially, yet in an order-independent manner. We employ TSA in an encoder-decoder network, dubbed TSA-Net, to reconstruct the desired 3D cube. Furthermore, we investigate how noise affects the results and propose to add shot noise in model training, which improves the real data results significantly. We hope our large-scale CASSI data serve as a benchmark in future research and our TSA model as a baseline in deep learning based reconstruction algorithms. Our code and data are available at https://github.com/mengziyi64/TSA-Net.
引用
收藏
页码:187 / 204
页数:18
相关论文
共 72 条
[51]   Hyperspectral Image Reconstruction Using a Deep Spatial-Spectral Prior [J].
Wang, Lizhi ;
Sun, Chen ;
Fu, Ying ;
Kim, Min H. ;
Huang, Hua .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :8024-8033
[52]   HyperReconNet: Joint Coded Aperture Optimization and Image Reconstruction for Compressive Hyperspectral Imaging [J].
Wang, Lizhi ;
Zhang, Tao ;
Fu, Ying ;
Huang, Hua .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (05) :2257-2270
[53]   High-Speed Hyperspectral Video Acquisition By Combining Nyquist and Compressive Sampling [J].
Wang, Lizhi ;
Xiong, Zhiwei ;
Huang, Hua ;
Shi, Guangming ;
Wu, Feng ;
Zeng, Wenjun .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2019, 41 (04) :857-870
[54]   Adaptive Nonlocal Sparse Representation for Dual-Camera Compressive Hyperspectral Imaging [J].
Wang, Lizhi ;
Xiong, Zhiwei ;
Shi, Guangming ;
Wu, Feng ;
Zeng, Wenjun .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (10) :2104-2111
[55]  
Wang LZ, 2015, PROC CVPR IEEE, P4942, DOI 10.1109/CVPR.2015.7299128
[56]   Dual-camera design for coded aperture snapshot spectral imaging [J].
Wang, Lizhi ;
Xiong, Zhiwei ;
Gao, Dahua ;
Shi, Guangming ;
Wu, Feng .
APPLIED OPTICS, 2015, 54 (04) :848-858
[57]   Image quality assessment: From error visibility to structural similarity [J].
Wang, Z ;
Bovik, AC ;
Sheikh, HR ;
Simoncelli, EP .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2004, 13 (04) :600-612
[58]   Development of a digital-micromirror-device-based multishot snapshot spectral imaging system [J].
Wu, Yuehao ;
Mirza, Iftekhar O. ;
Arce, Gonzalo R. ;
Prather, Dennis W. .
OPTICS LETTERS, 2011, 36 (14) :2692-2694
[59]  
Xie J., 2012, Advances in neural information processing systems, P25
[60]   Compressive Sensing by Learning a Gaussian Mixture Model From Measurements [J].
Yang, Jianbo ;
Liao, Xuejun ;
Yuan, Xin ;
Llull, Patrick ;
Brady, David J. ;
Sapiro, Guillermo ;
Carin, Lawrence .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2015, 24 (01) :106-119