Zero-Shot Cross-Media Embedding Learning With Dual Adversarial Distribution Network

被引:36
|
作者
Chi, Jingze [1 ]
Peng, Yuxin [1 ]
机构
[1] Peking Univ, Inst Comp Sci & Technol, Beijing 100871, Peoples R China
基金
中国国家自然科学基金;
关键词
Gallium nitride; Semantics; Media; Correlation; Training; Dogs; Measurement; Cross-media retrieval; zero-shot learning; generative adversarial networks; maximum mean discrepancy; REPRESENTATION; RETRIEVAL;
D O I
10.1109/TCSVT.2019.2900171
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Existing cross-media retrieval methods are mainly based on the condition where the training set covers all the categories in the testing set, which lack extensibility to retrieve data of new categories. Thus, zero-shot cross-media retrieval has been a promising direction in practical application, aiming to retrieve data of new categories (unseen categories), only with data of limited known categories (seen categories) for training. It is challenging for not only the heterogeneous distributions across different media types, but also the inconsistent semantics across seen and unseen categories need to be handled. To address the above issues, we propose dual adversarial distribution network (DADN), to learn common embeddings and explore the knowledge from word-embeddings of different categories. The main contributions are as follows. First, zero-shot cross-media dual generative adversarial networks architecture is proposed, in which two kinds of generative adversarial networks (GANs) for common embedding generation and representation reconstruction form dual processes. The dual GANs mutually promote to model semantic and underlying structure information, which generalizes across different categories on heterogeneous distributions and boosts correlation learning. Second, distribution matching with maximum mean discrepancy criterion is proposed to combine with dual GANs, which enhances distribution matching between common embeddings and category word-embeddings. Finally, adversarial inter-media metric constraint is proposed with an inter-media loss and a quadruplet loss, which further model the inter-media correlation information and improve semantic ranking ability. The experiments on four widely used cross-media datasets demonstrate the effectiveness of our DADN approach.
引用
收藏
页码:1173 / 1187
页数:15
相关论文
共 50 条
  • [41] A Discriminative Cross-Aligned Variational Autoencoder for Zero-Shot Learning
    Liu, Yang
    Gao, Xinbo
    Han, Jungong
    Shao, Ling
    IEEE TRANSACTIONS ON CYBERNETICS, 2023, 53 (06) : 3794 - 3805
  • [42] Language-Augmented Pixel Embedding for Generalized Zero-Shot Learning
    Wang, Ziyang
    Gou, Yunhao
    Li, Jingjing
    Zhu, Lei
    Shen, Heng Tao
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (03) : 1019 - 1030
  • [43] Investigating the Bilateral Connections in Generative Zero-Shot Learning
    Li, Jingjing
    Jing, Mengmeng
    Lu, Ke
    Zhu, Lei
    Shen, Heng Tao
    IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (08) : 8167 - 8178
  • [44] Contrast and Aggregation Network for Generalized Zero-shot Learning
    Li, Bin
    Xie, Cheng
    Yang, Jingqi
    Duan, Haoran
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT II, 2022, 13530 : 383 - 394
  • [45] Zero-shot Cross-modal Retrieval by Assembling AutoEncoder and Generative Adversarial Network
    Xu, Xing
    Tian, Jialin
    Lin, Kaiyi
    Lu, Huimin
    Shao, Jie
    Shen, Heng Tao
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2021, 17 (01)
  • [46] ZERO-SHOT LEARNING OF A CONDITIONAL GENERATIVE ADVERSARIAL NETWORK FOR DATA-FREE NETWORK QUANTIZATION
    Choi, Yoojin
    El-Khamy, Mostafa
    Lee, Jungwon
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 3552 - 3556
  • [47] Cross-modal propagation network for generalized zero-shot learning
    Guo, Ting
    Liang, Jianqing
    Liang, Jiye
    Xie, Guo-Sen
    PATTERN RECOGNITION LETTERS, 2022, 159 : 125 - 131
  • [48] Ternary Adversarial Networks With Self-Supervision for Zero-Shot Cross-Modal Retrieval
    Xu, Xing
    Lu, Huimin
    Song, Jingkuan
    Yang, Yang
    Shen, Heng Tao
    Li, Xuelong
    IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (06) : 2400 - 2413
  • [49] SR-GAN: SEMANTIC RECTIFYING GENERATIVE ADVERSARIAL NETWORK FOR ZERO-SHOT LEARNING
    Ye, Zihan
    Lyu, Fan
    Li, Linyan
    Fu, Qiming
    Ren, Jinchang
    Hu, Fuyuan
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 85 - 90
  • [50] Distribution and gradient constrained embedding model for zero-shot learning with fewer seen samples
    Zhang, Jing
    Geng, YangLi-ao
    Wang, Wen
    Sun, Wenju
    Yang, Zhirong
    Li, Qingyong
    KNOWLEDGE-BASED SYSTEMS, 2022, 251