A Cross-Attention-Based Multi-Information Fusion Transformer for Hyperspectral Image Classification

被引：1

作者：

Yang, Jinghui ^{[1
]}

Li, Anqi ^{[1
]}

Qian, Jinxi ^{[2
]}

Qin, Jia ^{[1
]}

Wang, Liguo ^{[3
]}

机构：

[1] China Univ Geosci, Sch Informat Engn, Beijing 100083, Peoples R China

[2] China Acad Space Technol, Inst Telecommun & Nav Satellites, Beijing 100094, Peoples R China

[3] Dalian Minzu Univ, Coll Informat & Commun Engn, Dalian 116600, Peoples R China

来源：

IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING | 2024年 / 17卷

基金：

中国国家自然科学基金; 国家重点研发计划;

关键词：

Transformers; Feature extraction; Hyperspectral imaging; Convolution; Image classification; Convolutional neural networks; Computational modeling; Classification; cross-attention; hyperspectral image (HSI); multi-information fusion; transformer; VISION TRANSFORMER; NETWORK;

D O I：

10.1109/JSTARS.2024.3429492

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In recent years, deep-learning-based classification methods have been widely used for hyperspectral images (HSIs). However, in the existing transformer-based HSI classification methods, how to effectively and comprehensively utilize the rich information still has room for improvement, for example, when utilizing multiple-image information, the comprehensive interaction between information has insufficient consideration. To address the above issues, cross-attention interaction, class token and patch token information, and multiscale spatial information are addressed in a unified framework, and a cross-attention-based multi-information fusion transformer (CAMFT) for HSI classification was proposed, which includes the multiscale patch embedding module, the residual connection-based DeepViT (RCD) module, and the double-branch cross-attention (DBCA) module. First, the multiscale patch embedding module is formed for multi-information preprocessing, accompanied by the built of different scale processing branches and the addition of learnable class tokens. Second, the RCD module is designed to utilize rich information from different layers; this module includes reattention and residual connection. Third, a DBCA module is constructed to obtain more representative multi-information fusion features; this module not only integrates multiscale patch information but also effectively utilizes complementary information between class tokens and patch tokens in the interaction of two branches. Moreover, numerous experiments demonstrate that, compared with other state-of-the-art classification methods, the proposed CAMFT method achieves the optimal classification performance, especially with a small training sample size, but it still has excellent performance.

引用

页码：13358 / 13375

页数：18

共 61 条

[1] A Review of Spatial Enhancement of Hyperspectral Remote Sensing Imaging Techniques [J].

Aburaed, Nour ;

Alkhatib, Mohammed Q. ;

Marshall, Stephen ;

Zabalza, Jaime ;

Al Ahmad, Hussain .

IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2023, 16 :2275-2300

[2] Hyperspectral Image Classification-Traditional to Deep Models: A Survey for Future Prospects [J].

Ahmad, Muhammad ;

Shabbir, Sidrah ;

Roy, Swalpa Kumar ;

Hong, Danfeng ;

Wu, Xin ;

Yao, Jing ;

Khan, Adil Mehmood ;

Mazzara, Manuel ;

Distefano, Salvatore ;

Chanussot, Jocelyn .

IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2022, 15 :968-999

[3] Deep Learning for Classification of Hyperspectral Data [J].

Audebert, Nicolas ;

Le Saux, Bertrand ;

Lefevre, Sebastien .

IEEE GEOSCIENCE AND REMOTE SENSING MAGAZINE, 2019, 7 (02) :159-173

[4] Sea water chlorophyll-a estimation using hyperspectral images and supervised Artificial Neural Network [J].

Awad, Mohamad .

ECOLOGICAL INFORMATICS, 2014, 24 :60-68

[5]

Chen CF, 2019, Arxiv, DOI [arXiv:1807.03848, 10.48550/arXiv.1807.03848]

[6] CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image Classification [J].

Chen, Chun-Fu ;

Fan, Quanfu ;

Panda, Rameswar .

2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :347-356

[7] Remote Sensing Image Scene Classification Meets Deep Learning: Challenges, Methods, Benchmarks, and Opportunities [J].

Cheng, Gong ;

Xie, Xingxing ;

Han, Junwei ;

Guo, Lei ;

Xia, Gui-Song .

IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2020, 13 :3735-3756

[8]

Dosovitskiy A., 2021, 9 INT C LEARN REPR I

[9] Skin Complications of Diabetes Mellitus Revealed by Polarized Hyperspectral Imaging and Machine Learning [J].

Dremin, Viktor ;

Marcinkevics, Zbignevs ;

Zherebtsov, Evgeny ;

Popov, Alexey ;

Grabovskis, Andris ;

Kronberga, Hedviga ;

Geldnere, Kristine ;

Doronin, Alexander ;

Meglinski, Igor ;

Bykov, Alexander .

IEEE TRANSACTIONS ON MEDICAL IMAGING, 2021, 40 (04) :1207-1216

[10] Multiscale Vision Transformers [J].

Fan, Haoqi ;

Xiong, Bo ;

Mangalam, Karttikeya ;

Li, Yanghao ;

Yan, Zhicheng ;

Malik, Jitendra ;

Feichtenhofer, Christoph .

2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :6804-6815

← 1 2 3 4 5 6 7 →