IC-CViT: Inverse-Consistent Convolutional Vision Transformer for Diffeomorphic Image Registration

被引:0
|
作者
Xu, Tao [1 ]
Jiang, Ting [1 ]
Li, Xiaoning [1 ,2 ,3 ]
机构
[1] Sichuan Normal Univ, Coll Comp Sci, Chengdu, Peoples R China
[2] Visual Comp & Virtual Real Key Lab Sichuan Prov, Chengdu, Peoples R China
[3] Sichuan 2011 Collaborat Innovat Ctr Educ Big Data, Chengdu, Peoples R China
来源
2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN | 2023年
关键词
Diffeomorphic registration; Inverse-consistent; Convolutional neural networks; Vision Transformer; 3D brain MRI;
D O I
10.1109/IJCNN54540.2023.10191209
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Diffeomorphic registration plays a crucial role in medical image analysis due to the invertible and one-to-one mapping transformation. In recent years, with the development of deep learning technology, convolutional neural networks (CNNs) have been a broad focus of research in medical image registration, and CNN-based methods have made great progress. However, the results of most existing methods generally are not necessarily diffeomorphic, generating implausibly bijective mappings between images due to the interpolation and discrete representation. Furthermore, the performances of CNNs may be limited by a lack of precise comprehension of global and long-range cross-image spatial relevance. Vision Transformer (ViT) is capable of enhancing the long-distance information interaction ability to identify the semantically anatomically correspondences of medical images. Compared with CNN, ViT has weak local feature extraction ability due to less inductive bias, especially in small-scale training datasets, meaning that the samples between adjacent pixels cannot be exploited adequately. To address the above challenges, we propose a novel Inverse-Consistent Convolutional Vision Transformer (IC-CViT) network for diffeomorphic image registration. Specifically, image pairs can explicitly conduct bi-directional registration through the predicted deformation filed, generated within the diffeomorphic mappings space and restricted by the proposed inverse consistent loss term. We verify our method on two 3D brain MRI scan datasets including OASIS and LPBA40. Comprehensive results demonstrate that IC-CViT achieves state-of-the-art registration accuracy while maintaining desired diffeomorphic properties.
引用
收藏
页数:10
相关论文
共 10 条
  • [1] IConDiffNet: an unsupervised inverse-consistent diffeomorphic network for medical image registration
    Liao, Rui
    Williamson, Jeffrey F.
    Xia, Tianyu
    Ge, Tao
    O'Sullivan, Joseph A.
    PHYSICS IN MEDICINE AND BIOLOGY, 2025, 70 (05):
  • [2] CvTMorph: Improving Local Feature Extraction in Medical Image Registration for Respiratory Motion Modeling with Convolutional Vision Transformer
    Chen, Peizhi
    Zou, Xupeng
    Gou, Yifan
    CURRENT MEDICAL IMAGING, 2024, 20
  • [3] PIViT: Large Deformation Image Registration with Pyramid-Iterative Vision Transformer
    Ma, Tai
    Dai, Xinru
    Zhang, Suwei
    Wen, Ying
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT X, 2023, 14229 : 602 - 612
  • [4] CAEVT: Convolutional Autoencoder Meets Lightweight Vision Transformer for Hyperspectral Image Classification
    Zhang, Zhiwen
    Li, Teng
    Tang, Xuebin
    Hu, Xiang
    Peng, Yuanxi
    SENSORS, 2022, 22 (10)
  • [5] Image Retrieval Using Convolutional Autoencoder, InfoGAN, and Vision Transformer Unsupervised Models
    Sabry, Eman S.
    Elagooz, Salah S.
    Abd El-Samie, Fathi E.
    El-Shafai, Walid
    El-Bahnasawy, Nirmeen A.
    El-Banby, Ghada M.
    Algarni, Abeer D.
    Soliman, Naglaa F.
    Ramadan, Rabie A.
    IEEE ACCESS, 2023, 11 : 20445 - 20477
  • [6] Hyperspectral Image Classification Using Groupwise Separable Convolutional Vision Transformer Network
    Zhao, Zhuoyi
    Xu, Xiang
    Li, Shutao
    Plaza, Antonio
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 17
  • [7] A 3-D Convolutional Vision Transformer for PolSAR Image Classification and Change Detection
    Wang, Lei
    Gui, Rong
    Hong, Hanyu
    Hu, Jun
    Ma, Lei
    Shi, Yu
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 11503 - 11520
  • [8] Res-MGCA-SE: a lightweight convolutional neural network based on vision transformer for medical image classification
    Soleimani-Fard S.
    Ko S.-B.
    Neural Computing and Applications, 2024, 36 (28) : 17631 - 17644
  • [9] PViT-AIR: Puzzling vision transformer-based affine image registration for multi histopathology and faxitron images of breast tissue
    Golestani, Negar
    Wang, Aihui
    Moallem, Golnaz
    Bean, Gregory R.
    Rusu, Mirabela
    MEDICAL IMAGE ANALYSIS, 2025, 99
  • [10] Convolutional Neural Network and Vision Transformer-driven Cross-layer Multi-scale Fusion Network for Hyperspectral Image Classification
    Zhao F.
    Geng M.
    Liu H.
    Zhang J.
    Yu J.
    Dianzi Yu Xinxi Xuebao/Journal of Electronics and Information Technology, 2024, 46 (05): : 2237 - 2248