Deep Cross-View Reconstruction GAN Based on Correlated Subspace for Multi-View Transformation

被引:1
|
作者
Mi, Jian-Xun [1 ,2 ]
He, Junchang [1 ,2 ]
Li, Weisheng [1 ,2 ]
机构
[1] Chongqing Univ Posts & Telecommun, Key Lab Big Data Intelligent Comp, Chongqing Key Lab Image Cognit, Chongqing 400065, Peoples R China
[2] Chongqing Univ Posts & Telecommun, Coll Comp Sci & Technol, Chongqing 400065, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature extraction; Image reconstruction; Generative adversarial networks; Face recognition; Data mining; Correlation; Image transformation; Image-to-image transformation; generative adversarial network; multi-view learning; subspace learning; canonical correlation analysis;
D O I
10.1109/TIP.2024.3442610
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In scenarios where identifying face information in the visible spectrum (VIS) is challenging due to poor lighting conditions, the use of near-infrared (NIR) and thermal (TH) cameras can provide viable alternatives. However, the unique data distribution of images captured by these cameras compared to VIS images presents challenges in matching face identities. To address these challenges, we propose a novel image transformation framework. The framework includes feature extraction from the input image, followed by a transformation network that generates target domain images with perceptual fidelity. Additionally, a reconstruction network preserves original information by reconstructing the original domain image from the extracted features. By considering the correlation between features from both domains, our framework utilizes paired data obtained from the same individual. We apply this framework to two well-established image-to-image transformation models, pix2pix and CycleGAN, known as CRC-pix2pix and CRC-CycleGAN respectively. The versatility of our approach allows extension to other models based on pix2pix or CycleGAN architectures. Our models generate high-quality images while preserving the identity information of the original face. Performance evaluation on TFW and BUAA NIR-VIS datasets demonstrates the superiority of our models in terms of generated image face matching and evaluation metrics such as SSIM, MSE, PSNR, and LPIPS. Moreover, we introduce the CQUPT-VIS-TH dataset, which enriches the paired dataset with thermal-visual face data capturing various angles and expressions.
引用
收藏
页码:4614 / 4626
页数:13
相关论文
共 50 条
  • [1] Multi-view Deep Network for Cross-view Classification
    Kan, Meina
    Shan, Shiguang
    Chen, Xilin
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 4847 - 4855
  • [2] Deep cross-view autoencoder network for multi-view learning
    Mi, Jian-Xun
    Fu, Chang-Qing
    Chen, Tao
    Gou, Tingting
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (17) : 24645 - 24664
  • [3] Deep cross-view autoencoder network for multi-view learning
    Jian-Xun Mi
    Chang-Qing Fu
    Tao Chen
    Tingting Gou
    Multimedia Tools and Applications, 2022, 81 : 24645 - 24664
  • [4] CDD: Multi-view Subspace Clustering via Cross-view Diversity Detection
    Huang, Shudong
    Tsang, Ivor W.
    Xu, Zenglin
    Lv, Jiancheng
    Liu, Quanhui
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 2308 - 2316
  • [5] Cross-view Topology Based Consistent and Complementary Information for Deep Multi-view Clustering
    Dong, Zhibin
    Wang, Siwei
    Jin, Jiaqi
    Liu, Xinwang
    Zhu, En
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 19383 - 19394
  • [6] Cross-View Fusion for Multi-View Clustering
    Huang, Zhijie
    Huang, Binqiang
    Zheng, Qinghai
    Yu, Yuanlong
    IEEE SIGNAL PROCESSING LETTERS, 2025, 32 : 621 - 625
  • [7] Cross-view and multi-view gait recognitions based on view transformation model using multi-layer perceptron
    Kusakunniran, Worapan
    Wu, Qiang
    Zhang, Jian
    Li, Hongdong
    PATTERN RECOGNITION LETTERS, 2012, 33 (07) : 882 - 889
  • [8] Cross-view Transformer for enhanced multi-view 3D reconstruction
    Shi, Wuzhen
    Yin, Aixue
    Li, Yingxiang
    Qian, Bo
    VISUAL COMPUTER, 2024,
  • [9] BACVC: Bi-adaptive and cross-view consistency for incomplete multi-view subspace clustering
    Zhan, Jiaqiyu
    Zhu, Yuesheng
    Luo, Guibo
    ALEXANDRIA ENGINEERING JOURNAL, 2025, 119 : 623 - 633
  • [10] Consensus Low-Rank Multi-View Subspace Clustering With Cross-View Diversity Preserving
    Kang, Kehan
    Chen, Chenglizhao
    Peng, Chong
    IEEE SIGNAL PROCESSING LETTERS, 2023, 30 : 1512 - 1516