Audio-Visual Generalized Zero-Shot Learning Based on Variational Information Bottleneck

被引：0

作者：

Li, Yapeng

Luo, Yong ^{[1
]}

Du, Bo ^{[1
]}

机构：

[1] Wuhan Univ, Inst Artificial Intelligence, Sch Comp Sci, Natl Engn Res Ctr Multimedia Software, Wuhan 430072, Peoples R China

来源：

2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME | 2023年

基金：

中国国家自然科学基金;

关键词：

Audio-visual; generalized zero-shot learning; information bottleneck; multi-modality fusion;

D O I：

10.1109/ICME55011.2023.00084

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Audio-visual generalized zero-shot learning (GZSL) aims to train a model on seen classes for classifying data samples from both seen classes and unseen classes. Due to the absence of unseen training samples, the model tends to misclassify unseen class samples into seen classes. To mitigate this problem, in this paper, we propose a method based on variational information bottleneck for audio-visual GZSL. Specifically, we model the joint representations as a product-of-experts over marginal representations to integrate the information of audio and visual. Besides, we introduce variational information bottleneck to the learning of audio-visual joint representations and marginal representations of audio, visual, and text label modalities. This helps our model reduce the negative impact of information that cannot be generalized to unseen classes. Experimental results conducted on the UCF-GZSL, VGGSound-GZSL, and ActivityNet-GZSL benchmarks demonstrate the effectiveness and superiority of the proposed model for audio-visual GZSL.

引用

页码：450 / 455

页数：6

共 50 条

[31] Contrastive embedding-based feature generation for generalized zero-shot learning
Wang, Han
Zhang, Tingting
Zhang, Xiaoxuan
INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2023, 14 (05) : 1669 - 1681
[32] Augmented semantic feature based generative network for generalized zero-shot learning
Li, Zhiqun
Chen, Qiong
Liu, Qingfa
NEURAL NETWORKS, 2021, 143 : 1 - 11
[33] Contrastive embedding-based feature generation for generalized zero-shot learning
Han Wang
Tingting Zhang
Xiaoxuan Zhang
International Journal of Machine Learning and Cybernetics, 2023, 14 : 1669 - 1681
[34] Vision transformer-based generalized zero-shot learning with data criticizing
Zhou, Quan
Liang, Yucuan
Zhang, Zhenqi
Cao, Wenming
APPLIED INTELLIGENCE, 2025, 55 (06)
[35] A Generalized Zero-Shot Learning Scheme for SSVEP-Based BCI System
Wang, Xietian
Liu, Aiping
Wu, Le
Li, Chang
Liu, Yu
Chen, Xun
IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, 2023, 31 : 863 - 874
[36] A Generalized Zero-Shot Learning Scheme for SSVEP-Based BCI System
Wang, Xietian
Liu, Aiping
Wu, Le
Li, Chang
Liu, Yu
Chen, Xun
IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, 2023, 31 : 863 - 874
[37] Estimation of Near-Instance-Level Attribute Bottleneck for Zero-Shot Learning
Jiang, Chenyi
Shen, Yuming
Chen, Dubing
Zhang, Haofeng
Shao, Ling
Torr, Philip H. S.
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (08) : 2962 - 2988
[38] Vision transformer-based generalized zero-shot learning with data criticizingVision transformer-based generalized zero-shot learning with data criticizingQ. Zhou et al.
Quan Zhou
Yucuan Liang
Zhenqi Zhang
Wenming Cao
Applied Intelligence, 2025, 55 (6)
[39] Adaptive Margin-based Contrastive Network for Generalized Zero-Shot Learning
Lee, Jeong-Cheol
Shibu, Athul
Lee, Dong-Gyu
2023 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS, ICCE, 2023,
[40] Augmented Multimodality Fusion for Generalized Zero-Shot Sketch-Based Visual Retrieval
Jing, Taotao
Xia, Haifeng
Hamm, Jihun
Ding, Zhengming
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 3657 - 3668

← 1 2 3 4 5 →