The Paradigm Shift in Hyperspectral Image Compression: A Neural Video Representation Methodology

被引:0
|
作者
Zhao, Nan [1 ]
Pan, Tianpeng [2 ]
Li, Zhitong [1 ]
Chen, Enke [1 ]
Zhang, Lili [2 ]
机构
[1] Jiangsu Automat Res Inst, Lianyungang 222061, Peoples R China
[2] Shenyang Aerosp Univ, Sch Elect & Informat Engn, Shenyang 110136, Peoples R China
关键词
deep learning; hyperspectral image; neural video representation; deep learning compression methods; MODEL;
D O I
10.3390/rs17040679
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
In recent years, with the continuous development of deep learning, the scope of neural networks that can be expressed is becoming wider and their expressive ability stronger. Traditional deep learning methods based on extracting latent representations have achieved satisfactory results. However, in the field of hyperspectral image compression, the high computational cost and the degradation of their generalization ability reduce their application. We analyze the objective formulation of traditional learning-based methods and draw the conclusion that rather than treating the hyperspectral image as an entire tensor to extract the latent representation, it is preferred to view it as a stream of video data, where each spectral band represents a frame of information and variances between spectral bands represent transformations between frames. Moreover, in order to compress the hyperspectral image of this video representation, neural video representation that decouples the spectral and spatial dimensions from each other for representation learning is employed so that the information about the data is preserved in the neural network parameters. Specifically, the network utilizes the spectral band index and the spatial coordinate index encoded with positional encoding as its input to perform network overfitting, which can output the image information of the corresponding spectral band based on the index of that spectral band. The experimental results indicate that the proposed method achieves approximately a 5 dB higher PSNR compared with traditional deep learning-based compression methods and outperforms another neural video representation method by 0.5 dB when using only the spectral band index as input.
引用
收藏
页数:20
相关论文
共 50 条
  • [41] Hyperspectral image compression on reconfigurable platforms
    Fry, TW
    Hauck, S
    10TH ANNUAL IEEE SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES, PROCEEDINGS, 2002, : 251 - 260
  • [42] Data characterization for hyperspectral image compression
    Simmons, RE
    Brower, BV
    Schott, JR
    MULTISPECTRAL IMAGING FOR TERRESTRIAL APPLICATIONS II, 1997, 3119 : 172 - 183
  • [43] HIERARCHICAL COMPRESSION FOR HYPERSPECTRAL IMAGE STORAGE
    Gashnikov, M. V.
    Glumov, N. I.
    COMPUTER OPTICS, 2014, 38 (03) : 482 - 488
  • [44] AN OPERATIONAL APPROACH FOR HYPERSPECTRAL IMAGE COMPRESSION
    Du, Qian
    Ly, Nam
    Fowler, James E.
    2012 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2012, : 1357 - 1360
  • [45] Progressively Expanded Neural Network (PEN Net) for hyperspectral image classification: A new neural network paradigm for remote sensing image analysis
    Sidike, Paheding
    Asari, Vijayan K.
    Sagan, Vasit
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2018, 146 : 161 - 181
  • [46] Hybrid Adaptive Prediction Mechanisms with Multilayer Propagation Neural Network for Hyperspectral Image Compression
    Xiao, Rui
    Paul, Manoranjan
    IMAGE AND VIDEO TECHNOLOGY (PSIVT 2017), 2018, 10749 : 162 - 173
  • [47] A Unified Attention Paradigm for Hyperspectral Image Classification
    Liu, Qian
    Wu, Zebin
    Xu, Yang
    Wei, Zhihui
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [48] Mamba Collaborative Implicit Neural Representation for Hyperspectral and Multispectral Remote Sensing Image Fusion
    Zhu, Chunyu
    Deng, Shangqi
    Song, Xuan
    Li, Yachao
    Wang, Qi
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2025, 63
  • [49] Image and video compression: A survey
    Clarke, RJ
    INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 1999, 10 (01) : 20 - 32
  • [50] An FPGA implementation of a neural optimization of block truncation coding for image/video compression
    Saif, Sherif
    Abbas, Hazent M.
    Nassar, Salwa M.
    Wahdan, Abdelmonem A.
    MICROPROCESSORS AND MICROSYSTEMS, 2007, 31 (08) : 477 - 486