Instance-wise multi-view visual fusion for zero-shot learning

被引:0
|
作者
Tang, Long [1 ,2 ]
Zhao, Jingtao [1 ]
Tian, Yingjie [3 ]
Yao, Changhua [4 ]
Pardalos, Panos M. [5 ]
机构
[1] Nanjing Univ Informat Sci & Technol, Sch Artificial Intelligence, Nanjing 210044, Peoples R China
[2] Nanjing Univ Informat Sci & Technol, Res Inst Talent Big Data, Nanjing 210044, Peoples R China
[3] Chinese Acad Sci, Res Ctr Fictitious Econ & Data Sci, Beijing 100190, Peoples R China
[4] Nanjing Univ Informat Sci & Technol, Sch Elect & Informat Engn, Nanjing 210044, Peoples R China
[5] Univ Florida, Ctr Appl Optimizat, Dept Ind & Syst Engn, Gainesville, FL 32611 USA
关键词
Zero shot learning; Multi-view visual fusion; Consensus principle; Complementary principle; Multi-view manifold regularization;
D O I
10.1016/j.asoc.2024.112339
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Zero-shot learning (ZSL) has become increasing popular in computer vision due to its ability to recognize categories unobserved in the training data. So far, most existing ZSL approaches adopt visual representations that are either derived from pretrained networks or learned using an end-to-end architecture. However, a single group of visual representations can hardly capture all features hidden in the images, yielding incomplete visual information. In numerous real-life scenarios, multi-view visual representations are often accessible which describe the instances more comprehensively and are potential for better learning performance. In this paper, we introduce an instance-wise multi-view visual fusion (IMVF) for zero-shot learning (ZSL) model. In accordance with the consensus principle, a multi-view visual-semantic mapping is created by minimizing the disparities of seen-class semantic projections on different views. Meanwhile, a straightforward linear constraint is performed on each seen-class instance to adhere to the complementary principle so that the cross-view information exchange is well motivated. In order to mitigate the domain shift problem, the predicted unseen-class semantic projections are further refined through a multi-view manifold alignment under the consensus principle. Our proposed IMVFZSL is compared with the State-of-the-Art ZSL methods on AwA2, CUB and SUN datasets. Exciting experimental results validate the effectiveness of the IMVF mechanism. To the best of our understanding, this is an initial attempt to fuse multi-view visual representations in ZSL, which will stimulate valuable contemplation in this field.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Instance-wise multi-view representation learning
    Li, Dan
    Wang, Haibao
    Wang, Yufeng
    Wang, Shengpei
    INFORMATION FUSION, 2023, 91 : 612 - 622
  • [2] Class-wise and instance-wise contrastive learning for zero-shot learning based on VAEGAN
    Zheng, Baolong
    Li, Zhanshan
    Li, Jingyao
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 272
  • [3] Transductive Multi-View Zero-Shot Learning
    Fu, Yanwei
    Hospedales, Timothy M.
    Xiang, Tao
    Gong, Shaogang
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2015, 37 (11) : 2332 - 2345
  • [4] Fusion by synthesizing: A multi-view deep neural network for zero-shot recognition
    Xu, Xing
    Zhou, Xiang
    Shen, Fumin
    Gao, Lianli
    Shen, Heng Tao
    Li, Xuelong
    SIGNAL PROCESSING, 2019, 164 : 354 - 367
  • [5] Multi-view enhanced zero-shot node classification
    Wang, Jiahui
    Wu, Likang
    Zhao, Hongke
    Jia, Ning
    INFORMATION PROCESSING & MANAGEMENT, 2023, 60 (06)
  • [6] Multi-view graph representation with similarity diffusion for general zero-shot learning
    Yu, Beibei
    Xie, Cheng
    Tang, Peng
    Duan, Haoran
    NEURAL NETWORKS, 2023, 166 : 38 - 50
  • [7] Transductive Multi-view Embedding for Zero-Shot Recognition and Annotation
    Fu, Yanwei
    Hospedales, Timothy M.
    Xiang, Tao
    Fu, Zhenyong
    Gong, Shaogang
    COMPUTER VISION - ECCV 2014, PT II, 2014, 8690 : 584 - 599
  • [8] Zero-Shot Instance Segmentation
    Zheng, Ye
    Wu, Jiahong
    Qin, Yongqiang
    Zhang, Faen
    Cui, Li
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 2593 - 2602
  • [9] Prioritized Semantic Learning for Zero-Shot Instance Navigation
    Sun, Xinyu
    Liu, Lizhao
    Zhi, Hongyan
    Qiu, Ronghe
    Liang, Junwei
    COMPUTER VISION - ECCV 2024, PT XII, 2025, 15070 : 161 - 178
  • [10] Zero-Shot Neural Decoding with Semi-Supervised Multi-View Embedding
    Akamatsu, Yusuke
    Maeda, Keisuke
    Ogawa, Takahiro
    Haseyama, Miki
    SENSORS, 2023, 23 (15)