Audio-Visual Generalized Zero-Shot Learning Based on Variational Information Bottleneck

被引:0
作者
Li, Yapeng
Luo, Yong [1 ]
Du, Bo [1 ]
机构
[1] Wuhan Univ, Inst Artificial Intelligence, Sch Comp Sci, Natl Engn Res Ctr Multimedia Software, Wuhan 430072, Peoples R China
来源
2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME | 2023年
基金
中国国家自然科学基金;
关键词
Audio-visual; generalized zero-shot learning; information bottleneck; multi-modality fusion;
D O I
10.1109/ICME55011.2023.00084
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Audio-visual generalized zero-shot learning (GZSL) aims to train a model on seen classes for classifying data samples from both seen classes and unseen classes. Due to the absence of unseen training samples, the model tends to misclassify unseen class samples into seen classes. To mitigate this problem, in this paper, we propose a method based on variational information bottleneck for audio-visual GZSL. Specifically, we model the joint representations as a product-of-experts over marginal representations to integrate the information of audio and visual. Besides, we introduce variational information bottleneck to the learning of audio-visual joint representations and marginal representations of audio, visual, and text label modalities. This helps our model reduce the negative impact of information that cannot be generalized to unseen classes. Experimental results conducted on the UCF-GZSL, VGGSound-GZSL, and ActivityNet-GZSL benchmarks demonstrate the effectiveness and superiority of the proposed model for audio-visual GZSL.
引用
收藏
页码:450 / 455
页数:6
相关论文
共 50 条
  • [31] Contrastive embedding-based feature generation for generalized zero-shot learning
    Wang, Han
    Zhang, Tingting
    Zhang, Xiaoxuan
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2023, 14 (05) : 1669 - 1681
  • [32] Augmented semantic feature based generative network for generalized zero-shot learning
    Li, Zhiqun
    Chen, Qiong
    Liu, Qingfa
    NEURAL NETWORKS, 2021, 143 : 1 - 11
  • [33] Contrastive embedding-based feature generation for generalized zero-shot learning
    Han Wang
    Tingting Zhang
    Xiaoxuan Zhang
    International Journal of Machine Learning and Cybernetics, 2023, 14 : 1669 - 1681
  • [34] Vision transformer-based generalized zero-shot learning with data criticizing
    Zhou, Quan
    Liang, Yucuan
    Zhang, Zhenqi
    Cao, Wenming
    APPLIED INTELLIGENCE, 2025, 55 (06)
  • [35] A Generalized Zero-Shot Learning Scheme for SSVEP-Based BCI System
    Wang, Xietian
    Liu, Aiping
    Wu, Le
    Li, Chang
    Liu, Yu
    Chen, Xun
    IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, 2023, 31 : 863 - 874
  • [36] A Generalized Zero-Shot Learning Scheme for SSVEP-Based BCI System
    Wang, Xietian
    Liu, Aiping
    Wu, Le
    Li, Chang
    Liu, Yu
    Chen, Xun
    IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, 2023, 31 : 863 - 874
  • [37] Estimation of Near-Instance-Level Attribute Bottleneck for Zero-Shot Learning
    Jiang, Chenyi
    Shen, Yuming
    Chen, Dubing
    Zhang, Haofeng
    Shao, Ling
    Torr, Philip H. S.
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (08) : 2962 - 2988
  • [38] Vision transformer-based generalized zero-shot learning with data criticizingVision transformer-based generalized zero-shot learning with data criticizingQ. Zhou et al.
    Quan Zhou
    Yucuan Liang
    Zhenqi Zhang
    Wenming Cao
    Applied Intelligence, 2025, 55 (6)
  • [39] Adaptive Margin-based Contrastive Network for Generalized Zero-Shot Learning
    Lee, Jeong-Cheol
    Shibu, Athul
    Lee, Dong-Gyu
    2023 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS, ICCE, 2023,
  • [40] Augmented Multimodality Fusion for Generalized Zero-Shot Sketch-Based Visual Retrieval
    Jing, Taotao
    Xia, Haifeng
    Hamm, Jihun
    Ding, Zhengming
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 3657 - 3668