MATR: Multimodal Medical Image Fusion via Multiscale Adaptive Transformer

被引：209

作者：

Tang, Wei ^{[1
]}

He, Fazhi ^{[1
]}

Liu, Yu ^{[2
]}

Duan, Yansong ^{[3
]}

机构：

[1] Wuhan Univ, Sch Comp Sci, Wuhan 430072, Peoples R China

[2] Hefei Univ Technol, Dept Biomed Engn, Hefei 230009, Peoples R China

[3] Wuhan Univ, Sch Remote Sensing & Informat Engn, Wuhan 430079, Peoples R China

来源：

IEEE TRANSACTIONS ON IMAGE PROCESSING | 2022年 / 31卷

基金：

中国国家自然科学基金;

关键词：

Transformers; Image fusion; Single photon emission computed tomography; Magnetic resonance imaging; Transforms; Medical diagnostic imaging; Task analysis; biomedical image; transformer; adaptive convolution; deep learning; TOP-DOWN INFLUENCES; INFORMATION; PERFORMANCE; SIMILARITY; PROTEIN; MODEL;

D O I：

10.1109/TIP.2022.3193288

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Owing to the limitations of imaging sensors, it is challenging to obtain a medical image that simultaneously contains functional metabolic information and structural tissue details. Multimodal medical image fusion, an effective way to merge the complementary information in different modalities, has become a significant technique to facilitate clinical diagnosis and surgical navigation. With powerful feature representation ability, deep learning (DL)-based methods have improved such fusion results but still have not achieved satisfactory performance. Specifically, existing DL-based methods generally depend on convolutional operations, which can well extract local patterns but have limited capability in preserving global context information. To compensate for this defect and achieve accurate fusion, we propose a novel unsupervised method to fuse multimodal medical images via a multiscale adaptive Transformer termed MATR. In the proposed method, instead of directly employing vanilla convolution, we introduce an adaptive convolution for adaptively modulating the convolutional kernel based on the global complementary context. To further model long-range dependencies, an adaptive Transformer is employed to enhance the global semantic extraction capability. Our network architecture is designed in a multiscale fashion so that useful multimodal information can be adequately acquired from the perspective of different scales. Moreover, an objective function composed of a structural loss and a region mutual information loss is devised to construct constraints for information preservation at both the structural-level and the feature-level. Extensive experiments on a mainstream database demonstrate that the proposed method outperforms other representative and state-of-the-art methods in terms of both visual quality and quantitative evaluation. We also extend the proposed method to address other biomedical image fusion issues, and the pleasing fusion results illustrate that MATR has good generalization capability. The code of the proposed method is available at https://github.com/tthinking/MATR.

引用

页码：5134 / 5149

页数：16

共 68 条

[1] A curvelet transform approach for the fusion of MR and CT images [J].

Ali, F. E. ;

El-Dokany, I. M. ;

Saad, A. A. ;

El-Samie, F. E. Abd .

JOURNAL OF MODERN OPTICS, 2010, 57 (04) :273-286

[2] Color Balance and Fusion for Underwater Image Enhancement [J].

Ancuti, Codruta O. ;

Ancuti, Cosmin ;

De Vleeschouwer, Christophe ;

Bekaert, Philippe .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (01) :379-393

[3] The type VI secretion system of Vibrio cholerae fosters horizontal gene transfer [J].

Borgeaud, Sandrine ;

Metzger, Lisa C. ;

Scrignari, Tiziana ;

Blokesch, Melanie .

SCIENCE, 2015, 347 (6217) :63-67

[4] End-to-End Object Detection with Transformers [J].

Carion, Nicolas ;

Massa, Francisco ;

Synnaeve, Gabriel ;

Usunier, Nicolas ;

Kirillov, Alexander ;

Zagoruyko, Sergey .

COMPUTER VISION - ECCV 2020, PT I, 2020, 12346 :213-229

[5] PET/MRI for Neurologic Applications [J].

Catana, Ciprian ;

Drzezga, Alexander ;

Heiss, Wolf-Dieter ;

Rosen, Bruce R. .

JOURNAL OF NUCLEAR MEDICINE, 2012, 53 (12) :1916-1925

[6] Pre-Trained Image Processing Transformer [J].

Chen, Hanting ;

Wang, Yunhe ;

Guo, Tianyu ;

Xu, Chang ;

Deng, Yiping ;

Liu, Zhenhua ;

Ma, Siwei ;

Xu, Chunjing ;

Xu, Chao ;

Gao, Wen .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :12294-12305

[7] A human perception inspired quality metric for image fusion based on regional information [J].

Chen, Hao ;

Varshney, Pramod K. .

INFORMATION FUSION, 2007, 8 (02) :193-207

[8]

Chen J., 2021, arXiv, DOI 10.48550/arXiv:2102.04306

[9] Image fusion metric based on mutual information and Tsallis entropy [J].

Cvejic, N. ;

Canagarajah, C. N. ;

Bull, D. R. .

ELECTRONICS LETTERS, 2006, 42 (11) :626-627

[10]

Dosovitskiy A, 2021, Arxiv, DOI [arXiv:2010.11929, DOI 10.48550/ARXIV.2010.11929]

← 1 2 3 4 5 6 7 →