Adaptive Token Mixer for Hyperspectral Image Classification

被引:0
作者
Lei, Shuhan [1 ]
Zhang, Meng [1 ]
Wang, Yuhang [1 ]
Tang, Nan [1 ]
Jia, Ni [1 ]
Fu, Lihua [2 ]
机构
[1] Cent China Normal Univ, Sch Comp, Wuhan 430079, Peoples R China
[2] China Univ Geosci, Sch Math & Phys, Wuhan 430074, Peoples R China
基金
中国国家自然科学基金;
关键词
Adaptation models; Feature extraction; Computer architecture; Mixers; Computational modeling; Overfitting; Hyperspectral imaging; Convolution; Strips; Fuses; Adaptive token mixer (ATM); cross-shaped convolutional operator (COSTCO); hyperspectral image (HSI) classification; MLP-like models;
D O I
10.1109/JSTARS.2025.3552817
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
MLP-like models have shown strong potential in hyperspectral image (HSI) classification. However, their dense connections among all neurons (tokens) lead to large model sizes, high computational costs, and increased risk of overfitting. To address these issues, researchers have proposed sparse connectivity strategies to create more compact MLP models by selecting and mixing only a subset of tokens. However, most token selection rules overlook image patch content, often introducing task-irrelevant tokens with little valuable class distribution information. This problem is particularly severe in HSIs, which contain rich spatial and spectral information. To overcome this, we propose an adaptive token mixer (ATM) to effectively integrate spatial information in HSIs. ATM adaptively learns token positions based on their content, enabling the model to identify relevant tokens and capture global spatial information across the entire spatial domain. In addition, we introduce a cross-shaped convolutional operator (COSTCO) to enhance local spatial feature extraction. The combination of ATM and COSTCO enables comprehensive token mixing by integrating both global and local spatial information. Experimental results show that this proposed adaptive MLP focuses on the most informative, task-relevant regions during decision-making, offering interpretability to help users understand its predictions. Moreover, the adaptive MLP achieves state-of-the-art performance on HSI classification tasks across four publicly available datasets.
引用
收藏
页码:8882 / 8896
页数:15
相关论文
共 30 条
[11]   Recent Advances on Spectral-Spatial Hyperspectral Image Classification: An Overview and New Guidelines [J].
He, Lin ;
Li, Jun ;
Liu, Chenying ;
Li, Shutao .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2018, 56 (03) :1579-1597
[12]   SpectralFormer: Rethinking Hyperspectral Image Classification With Transformers [J].
Hong, Danfeng ;
Han, Zhu ;
Yao, Jing ;
Gao, Lianru ;
Zhang, Bing ;
Plaza, Antonio ;
Chanussot, Jocelyn .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[13]   Estimation of crop LAI using hyperspectral vegetation indices and a hybrid inversion method [J].
Liang, Liang ;
Di, Liping ;
Zhang, Lianpeng ;
Deng, Meixia ;
Qin, Zhihao ;
Zhao, Shuhe ;
Lin, Hui .
REMOTE SENSING OF ENVIRONMENT, 2015, 165 :123-134
[14]   Probabilistic-Kernel Collaborative Representation for Spatial-Spectral Hyperspectral Image Classification [J].
Liu, Jianjun ;
Wu, Zebin ;
Li, Jun ;
Plaza, Antonio ;
Yuan, Yunhao .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2016, 54 (04) :2371-2384
[15]  
Lou YX, 2022, Arxiv, DOI arXiv:2109.02008
[16]   Classification of hyperspectral remote sensing images with support vector machines [J].
Melgani, F ;
Bruzzone, L .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2004, 42 (08) :1778-1790
[17]   Limitations of Principal Components Analysis for Hyperspectral Target Recognition [J].
Prasad, Saurabh ;
Bruce, Lori Mann .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2008, 5 (04) :625-629
[18]   HybridSN: Exploring 3-D-2-D CNN Feature Hierarchy for Hyperspectral Image Classification [J].
Roy, Swalpa Kumar ;
Krishna, Gopal ;
Dubey, Shiv Ram ;
Chaudhuri, Bidyut B. .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2020, 17 (02) :277-281
[19]   Supervised classification of remotely sensed imagery using a modified k-NN technique [J].
Samaniego, Luis ;
Bardossy, Andras ;
Schulz, Karsten .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2008, 46 (07) :2112-2125
[20]   Grad-CAM: Visual Explanations from Deep Networks via Gradient-based Localization [J].
Selvaraju, Ramprasaath R. ;
Cogswell, Michael ;
Das, Abhishek ;
Vedantam, Ramakrishna ;
Parikh, Devi ;
Batra, Dhruv .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :618-626