Autoencoder-Based Collaborative Attention GAN for Multi-Modal Image Synthesis

被引:9
作者
Cao, Bing [1 ,2 ]
Cao, Haifang [1 ,3 ]
Liu, Jiaxu [1 ,3 ]
Zhu, Pengfei [1 ,3 ]
Zhang, Changqing [1 ,3 ]
Hu, Qinghua [1 ,3 ]
机构
[1] Tianjin Univ, Coll Intelligence & Comp, Tianjin 300403, Peoples R China
[2] Xidian Univ, State Key Lab Integrated Serv Networks, Xian 710000, Peoples R China
[3] Tianjin Univ, Haihe Lab Informat echnol Applicat Innovat, Tianjin 300403, Peoples R China
关键词
Image synthesis; Collaboration; Task analysis; Generative adversarial networks; Feature extraction; Data models; Image reconstruction; Multi-modal image synthesis; collaborative attention; single-modal attention; multi-modal attention; TRANSLATION; NETWORK;
D O I
10.1109/TMM.2023.3274990
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Multi-modal images are required in a wide range of practical scenarios, from clinical diagnosis to public security. However, certain modalities may be incomplete or unavailable because of the restricted imaging conditions, which commonly leads to decision bias in many real-world applications. Despite the significant advancement of existing image synthesis techniques, learning complementary information from multi-modal inputs remains challenging. To address this problem, we propose an autoencoder-based collaborative attention generative adversarial network (ACA-GAN) that uses available multi-modal images to generate the missing ones. The collaborative attention mechanism deploys a single-modal attention module and a multi-modal attention module to effectively extract complementary information from multiple available modalities. Considering the significant modal gap, we further developed an autoencoder network to extract the self-representation of target modality, guiding the generative model to fuse target-specific information from multiple modalities. This considerably improves cross-modal consistency with the desired modality, thereby greatly enhancing the image synthesis performance. Quantitative and qualitative comparisons for various multi-modal image synthesis tasks highlight the superiority of our approach over several prior methods by demonstrating more precise and realistic results.
引用
收藏
页码:995 / 1010
页数:16
相关论文
共 76 条
  • [1] [Anonymous], 2010, INT C MACHINE LEARNI
  • [2] Bahdanau D, 2016, Arxiv, DOI arXiv:1409.0473
  • [3] Bakas S, 2019, Arxiv, DOI [arXiv:1811.02629, 10.48550/arXiv.1811.02629, DOI 10.48550/ARXIV.1811.02629], Patent No. [US10294234B2, 10294234]
  • [4] Data Descriptor: Advancing The Cancer Genome Atlas glioma MRI collections with expert segmentation labels and radiomic features
    Bakas, Spyridon
    Akbari, Hamed
    Sotiras, Aristeidis
    Bilello, Michel
    Rozycki, Martin
    Kirby, Justin S.
    Freymann, John B.
    Farahani, Keyvan
    Davatzikos, Christos
    [J]. SCIENTIFIC DATA, 2017, 4
  • [5] Learning a Prototype Discriminator With RBF for Multimodal Image Synthesis
    Bi, Zhiwei
    Cao, Bing
    Zuo, Wangmeng
    Hu, Qinghua
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 6664 - 6678
  • [6] AUTO-ASSOCIATION BY MULTILAYER PERCEPTRONS AND SINGULAR VALUE DECOMPOSITION
    BOURLARD, H
    KAMP, Y
    [J]. BIOLOGICAL CYBERNETICS, 1988, 59 (4-5) : 291 - 294
  • [7] Attenuation Correction Synthesis for Hybrid PET-MR Scanners: Application to Brain Studies
    Burgos, Ninon
    Cardoso, M. Jorge
    Thielemans, Kris
    Modat, Marc
    Pedemonte, Stefano
    Dickson, John
    Barnes, Anna
    Ahmed, Rebekah
    Mahoney, Colin J.
    Schott, Jonathan M.
    Duncan, John S.
    Atkinson, David
    Arridge, Simon R.
    Hutton, Brian F.
    Ourselin, Sebastien
    [J]. IEEE TRANSACTIONS ON MEDICAL IMAGING, 2014, 33 (12) : 2332 - 2341
  • [8] Feature-Based Fusion of Medical Imaging Data
    Calhoun, Vince D.
    Adali, Tuelay
    [J]. IEEE TRANSACTIONS ON INFORMATION TECHNOLOGY IN BIOMEDICINE, 2009, 13 (05): : 711 - 720
  • [9] Cao B., 2023, IEEE Trans. Neural Netw. Learn. Syst.
  • [10] Face photo-sketch synthesis via full-scale identity supervision
    Cao, Bing
    Wang, Nannan
    Li, Jie
    Hu, Qinghua
    Gao, Xinbo
    [J]. PATTERN RECOGNITION, 2022, 124