MFormer: Taming Masked Transformer for Unsupervised Spectral Reconstruction

被引:14
作者
Li, Jiaojiao [1 ,2 ]
Leng, Yihong [1 ]
Song, Rui [1 ]
Liu, Wei [3 ]
Li, Yunsong [1 ]
Du, Qian [4 ]
机构
[1] Xidian Univ, Sch Telecommun Engn, State Key Lab Integrated Serv Networks, Xian 710071, Peoples R China
[2] Chinese Acad Sci, Key Lab Spectral Imaging Technol, Xian 710119, Peoples R China
[3] State Key Lab Geoinformat Engn, Xian 710054, Peoples R China
[4] Mississippi State Univ, Dept Elect & Comp Engn, Starkville, MS 39762 USA
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2023年 / 61卷
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
Transformers; Image reconstruction; Task analysis; Spatial resolution; Hyperspectral imaging; Imaging; Feature extraction; Band masking; dual multihead self-attention (MSA); spectral reconstruction (SR); spectral structural similarity; transformer; unsupervised learning; DEEP FOREST; CNN;
D O I
10.1109/TGRS.2023.3264976
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Spectral reconstruction (SR) aims to recover the hyperspectral images (HSIs) from the corresponding RGB images directly. Most SR studies based on supervised learning require massive data annotations to achieve superior reconstruction performance, which is limited by complicated imaging techniques and laborious annotation calibration in practice. Thus, unsupervised strategies attract the attention of the community, however, existing unsupervised SR works still face a fatal bottleneck from low accuracy. Besides, traditional convolutional neural network (CNN)-based models are good at capturing local features but experience difficulty in global features. To ameliorate these drawbacks, we propose an unsupervised SR architecture with strong constraints, especially constructing a novel Masked Transformer (MFormer) to excavate latent hyperspectral characteristics to restore realistic HSIs further. Concretely, a dual spectralwise multihead self-attention (DSSA) mechanism embedded in transformer is proposed to firmly associate multihead and channel dimensions and then capture the spectral representation in the implicit solution spaces. Furthermore, a plug-and-play mask-guided band augment (MBA) module is presented to extract and further enhance the bandwise correlation and continuity to boost the robustness of the model. Innovatively, a customized loss based on the intrinsic mapping from HSIs to RGB images and the inherent spectral structural similarity is designed to restrain spectral distortion. Extensive experimental results on three benchmarks verify that our MFormer achieves superior performance over other state-of-the-art (SOTA) supervised and unsupervised methods under a no-label training process equally.
引用
收藏
页数:12
相关论文
共 61 条
  • [1] Detection and Analysis of the Intestinal Ischemia Using Visible and Invisible Hyperspectral Imaging
    Akbari, Hamed
    Kosugi, Yukio
    Kojima, Kazuyuki
    Tanaka, Naofumi
    [J]. IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2010, 57 (08) : 2011 - 2017
  • [2] Arad B., 2018, P C COMP VIS PATT RE, P18
  • [3] NTIRE 2022 Spectral Recovery Challenge and Data Set
    Arad, Boaz
    Timofte, Radu
    Yahel, Rony
    Morag, Nimrod
    Bernat, Amir
    Cai, Yuanhao
    Lin, Jing
    Lin, Zudi
    Wang, Haoqian
    Zhang, Yulun
    Pfister, Hanspeter
    Van Gool, Luc
    Liu, Shuai
    Li, Yongqiang
    Feng, Chaoyu
    Lei, Lei
    Li, Jiaojiao
    Du, Songcheng
    Wu, Chaoxiong
    Leng, Yihong
    Song, Rui
    Zhang, Mingwei
    Song, Chongxing
    Zhao, Shuyi
    Lang, Zhiqiang
    Wei, Wei
    Zhang, Lei
    Dian, Renwei
    Shan, Tianci
    Guo, Anjing
    Feng, Chengguo
    Liu, Jinyang
    Agarla, Mirko
    Bianco, Simone
    Buzzelli, Marco
    Celona, Luigi
    Schettini, Raimondo
    He, Jiang
    Xiao, Yi
    Xiao, Jiajun
    Yuan, Qiangqiang
    Li, Jie
    Zhang, Liangpei
    Kwon, Taesung
    Ryu, Dohoon
    Bae, Hyokyoung
    Yang, Hao-Hsiang
    Chang, Hua-En
    Huang, Zhi-Kai
    Chen, Wei-Ting
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 862 - 880
  • [4] NTIRE 2020 Challenge on Spectral Reconstruction from an RGB Image
    Arad, Boaz
    Timofte, Radu
    Ben-Shahar, Ohad
    Lin, Yi-Tun
    Finlayson, Graham
    Givati, Shai
    Li, Jiaojiao
    Wu, Chaoxiong
    Song, Rui
    Li, Yunsong
    Liu, Fei
    Lang, Zhiqiang
    Wei, Wei
    Zhang, Lei
    Nie, Jiangtao
    Zhao, Yuzhi
    Po, Lai-Man
    Yan, Qiong
    Liu, Wei
    Lin, Tingyu
    Kim, Youngjung
    Shin, Changyeop
    Rho, Kyeongha
    Kim, Sungho
    Zhu, Zhiyu
    Hou, Junhui
    Sun, He
    Ren, Jinchang
    Fang, Zhenyu
    Yan, Yijun
    Peng, Hao
    Chen, Xiaomei
    Zhao, Jie
    Stiebel, Tarek
    Koppers, Simon
    Merhof, Dorit
    Gupta, Honey
    Mitra, Kaushik
    Fubara, Biebele Joslyn
    Sedky, Mohamed
    Dyke, Dave
    Banerjee, Atmadeep
    Palrecha, Akash
    Sabarinathan
    Uma, K.
    Vinothini, D. Synthiya
    Bama, B. Sathya
    Roomi, S. M. Md Mansoor
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, : 1806 - 1822
  • [5] Filter Selection for Hyperspectral Estimation
    Arad, Boaz
    Ben-Shahar, Ohad
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 3172 - 3180
  • [6] Sparse Recovery of Hyperspectral Signal from Natural RGB Images
    Arad, Boaz
    Ben-Shahar, Ohad
    [J]. COMPUTER VISION - ECCV 2016, PT VII, 2016, 9911 : 19 - 34
  • [7] Arjovsky M, 2017, PR MACH LEARN RES, V70
  • [8] Hyperspectral Remote Sensing Data Analysis and Future Challenges
    Bioucas-Dias, Jose M.
    Plaza, Antonio
    Camps-Valls, Gustavo
    Scheunders, Paul
    Nasrabadi, Nasser M.
    Chanussot, Jocelyn
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING MAGAZINE, 2013, 1 (02) : 6 - 36
  • [9] Mask-guided Spectral-wise Transformer for Efficient Hyperspectral Image Reconstruction
    Cai, Yuanhao
    Lin, Jing
    Hu, Xiaowan
    Wang, Haoqian
    Yuan, Xin
    Zhang, Yulun
    Timofte, Radu
    Van Gool, Luc
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 17481 - 17490
  • [10] MST plus plus : Multi-stage Spectral-wise Transformer for Efficient Spectral Reconstruction
    Cai, Yuanhao
    Lin, Jing
    Lin, Zudi
    Wang, Haoqian
    Zhang, Yulun
    Pfister, Hanspeter
    Timofte, Radu
    Van Gool, Luc
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 744 - 754