Instance-wise multi-view visual fusion for zero-shot learning

被引：0

作者：

Tang, Long ^{[1
,2
]}

Zhao, Jingtao ^{[1
]}

Tian, Yingjie ^{[3
]}

Yao, Changhua ^{[4
]}

Pardalos, Panos M. ^{[5
]}

机构：

[1] Nanjing Univ Informat Sci & Technol, Sch Artificial Intelligence, Nanjing 210044, Peoples R China

[2] Nanjing Univ Informat Sci & Technol, Res Inst Talent Big Data, Nanjing 210044, Peoples R China

[3] Chinese Acad Sci, Res Ctr Fictitious Econ & Data Sci, Beijing 100190, Peoples R China

[4] Nanjing Univ Informat Sci & Technol, Sch Elect & Informat Engn, Nanjing 210044, Peoples R China

[5] Univ Florida, Ctr Appl Optimizat, Dept Ind & Syst Engn, Gainesville, FL 32611 USA

来源：

APPLIED SOFT COMPUTING | 2024年 / 167卷

关键词：

Zero shot learning; Multi-view visual fusion; Consensus principle; Complementary principle; Multi-view manifold regularization;

D O I：

10.1016/j.asoc.2024.112339

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Zero-shot learning (ZSL) has become increasing popular in computer vision due to its ability to recognize categories unobserved in the training data. So far, most existing ZSL approaches adopt visual representations that are either derived from pretrained networks or learned using an end-to-end architecture. However, a single group of visual representations can hardly capture all features hidden in the images, yielding incomplete visual information. In numerous real-life scenarios, multi-view visual representations are often accessible which describe the instances more comprehensively and are potential for better learning performance. In this paper, we introduce an instance-wise multi-view visual fusion (IMVF) for zero-shot learning (ZSL) model. In accordance with the consensus principle, a multi-view visual-semantic mapping is created by minimizing the disparities of seen-class semantic projections on different views. Meanwhile, a straightforward linear constraint is performed on each seen-class instance to adhere to the complementary principle so that the cross-view information exchange is well motivated. In order to mitigate the domain shift problem, the predicted unseen-class semantic projections are further refined through a multi-view manifold alignment under the consensus principle. Our proposed IMVFZSL is compared with the State-of-the-Art ZSL methods on AwA2, CUB and SUN datasets. Exciting experimental results validate the effectiveness of the IMVF mechanism. To the best of our understanding, this is an initial attempt to fuse multi-view visual representations in ZSL, which will stimulate valuable contemplation in this field.

引用

页数：14

共 50 条

[1] Instance-wise multi-view representation learning
Li, Dan
Wang, Haibao
Wang, Yufeng
Wang, Shengpei
INFORMATION FUSION, 2023, 91 : 612 - 622
[2] Class-wise and instance-wise contrastive learning for zero-shot learning based on VAEGAN
Zheng, Baolong
Li, Zhanshan
Li, Jingyao
EXPERT SYSTEMS WITH APPLICATIONS, 2025, 272
[3] Transductive Multi-View Zero-Shot Learning
Fu, Yanwei
Hospedales, Timothy M.
Xiang, Tao
Gong, Shaogang
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2015, 37 (11) : 2332 - 2345
[4] Fusion by synthesizing: A multi-view deep neural network for zero-shot recognition
Xu, Xing
Zhou, Xiang
Shen, Fumin
Gao, Lianli
Shen, Heng Tao
Li, Xuelong
SIGNAL PROCESSING, 2019, 164 : 354 - 367
[5] Multi-view enhanced zero-shot node classification
Wang, Jiahui
Wu, Likang
Zhao, Hongke
Jia, Ning
INFORMATION PROCESSING & MANAGEMENT, 2023, 60 (06)
[6] Multi-view graph representation with similarity diffusion for general zero-shot learning
Yu, Beibei
Xie, Cheng
Tang, Peng
Duan, Haoran
NEURAL NETWORKS, 2023, 166 : 38 - 50
[7] Transductive Multi-view Embedding for Zero-Shot Recognition and Annotation
Fu, Yanwei
Hospedales, Timothy M.
Xiang, Tao
Fu, Zhenyong
Gong, Shaogang
COMPUTER VISION - ECCV 2014, PT II, 2014, 8690 : 584 - 599
[8] Zero-Shot Instance Segmentation
Zheng, Ye
Wu, Jiahong
Qin, Yongqiang
Zhang, Faen
Cui, Li
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 2593 - 2602
[9] Prioritized Semantic Learning for Zero-Shot Instance Navigation
Sun, Xinyu
Liu, Lizhao
Zhi, Hongyan
Qiu, Ronghe
Liang, Junwei
COMPUTER VISION - ECCV 2024, PT XII, 2025, 15070 : 161 - 178
[10] Zero-Shot Neural Decoding with Semi-Supervised Multi-View Embedding
Akamatsu, Yusuke
Maeda, Keisuke
Ogawa, Takahiro
Haseyama, Miki
SENSORS, 2023, 23 (15)

← 1 2 3 4 5 →