IC-CViT: Inverse-Consistent Convolutional Vision Transformer for Diffeomorphic Image Registration

被引：0

作者：

Xu, Tao ^{[1
]}

Jiang, Ting ^{[1
]}

Li, Xiaoning ^{[1
,2
,3
]}

机构：

[1] Sichuan Normal Univ, Coll Comp Sci, Chengdu, Peoples R China

[2] Visual Comp & Virtual Real Key Lab Sichuan Prov, Chengdu, Peoples R China

[3] Sichuan 2011 Collaborat Innovat Ctr Educ Big Data, Chengdu, Peoples R China

来源：

2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN | 2023年

关键词：

Diffeomorphic registration; Inverse-consistent; Convolutional neural networks; Vision Transformer; 3D brain MRI;

D O I：

10.1109/IJCNN54540.2023.10191209

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Diffeomorphic registration plays a crucial role in medical image analysis due to the invertible and one-to-one mapping transformation. In recent years, with the development of deep learning technology, convolutional neural networks (CNNs) have been a broad focus of research in medical image registration, and CNN-based methods have made great progress. However, the results of most existing methods generally are not necessarily diffeomorphic, generating implausibly bijective mappings between images due to the interpolation and discrete representation. Furthermore, the performances of CNNs may be limited by a lack of precise comprehension of global and long-range cross-image spatial relevance. Vision Transformer (ViT) is capable of enhancing the long-distance information interaction ability to identify the semantically anatomically correspondences of medical images. Compared with CNN, ViT has weak local feature extraction ability due to less inductive bias, especially in small-scale training datasets, meaning that the samples between adjacent pixels cannot be exploited adequately. To address the above challenges, we propose a novel Inverse-Consistent Convolutional Vision Transformer (IC-CViT) network for diffeomorphic image registration. Specifically, image pairs can explicitly conduct bi-directional registration through the predicted deformation filed, generated within the diffeomorphic mappings space and restricted by the proposed inverse consistent loss term. We verify our method on two 3D brain MRI scan datasets including OASIS and LPBA40. Comprehensive results demonstrate that IC-CViT achieves state-of-the-art registration accuracy while maintaining desired diffeomorphic properties.

引用

页数：10

共 10 条

[1] IConDiffNet: an unsupervised inverse-consistent diffeomorphic network for medical image registration
Liao, Rui
Williamson, Jeffrey F.
Xia, Tianyu
Ge, Tao
O'Sullivan, Joseph A.
PHYSICS IN MEDICINE AND BIOLOGY, 2025, 70 (05):
[2] CvTMorph: Improving Local Feature Extraction in Medical Image Registration for Respiratory Motion Modeling with Convolutional Vision Transformer
Chen, Peizhi
Zou, Xupeng
Gou, Yifan
CURRENT MEDICAL IMAGING, 2024, 20
[3] PIViT: Large Deformation Image Registration with Pyramid-Iterative Vision Transformer
Ma, Tai
Dai, Xinru
Zhang, Suwei
Wen, Ying
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT X, 2023, 14229 : 602 - 612
[4] CAEVT: Convolutional Autoencoder Meets Lightweight Vision Transformer for Hyperspectral Image Classification
Zhang, Zhiwen
Li, Teng
Tang, Xuebin
Hu, Xiang
Peng, Yuanxi
SENSORS, 2022, 22 (10)
[5] Image Retrieval Using Convolutional Autoencoder, InfoGAN, and Vision Transformer Unsupervised Models
Sabry, Eman S.
Elagooz, Salah S.
Abd El-Samie, Fathi E.
El-Shafai, Walid
El-Bahnasawy, Nirmeen A.
El-Banby, Ghada M.
Algarni, Abeer D.
Soliman, Naglaa F.
Ramadan, Rabie A.
IEEE ACCESS, 2023, 11 : 20445 - 20477
[6] Hyperspectral Image Classification Using Groupwise Separable Convolutional Vision Transformer Network
Zhao, Zhuoyi
Xu, Xiang
Li, Shutao
Plaza, Antonio
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 17
[7] A 3-D Convolutional Vision Transformer for PolSAR Image Classification and Change Detection
Wang, Lei
Gui, Rong
Hong, Hanyu
Hu, Jun
Ma, Lei
Shi, Yu
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 11503 - 11520
[8] Res-MGCA-SE: a lightweight convolutional neural network based on vision transformer for medical image classification
Soleimani-Fard S.
Ko S.-B.
Neural Computing and Applications, 2024, 36 (28) : 17631 - 17644
[9] PViT-AIR: Puzzling vision transformer-based affine image registration for multi histopathology and faxitron images of breast tissue
Golestani, Negar
Wang, Aihui
Moallem, Golnaz
Bean, Gregory R.
Rusu, Mirabela
MEDICAL IMAGE ANALYSIS, 2025, 99
[10] Convolutional Neural Network and Vision Transformer-driven Cross-layer Multi-scale Fusion Network for Hyperspectral Image Classification
Zhao F.
Geng M.
Liu H.
Zhang J.
Yu J.
Dianzi Yu Xinxi Xuebao/Journal of Electronics and Information Technology, 2024, 46 (05): : 2237 - 2248

← 1 →