Semantic-diversity transfer network for generalized zero-shot learning via inner disagreement based OOD detector

被引:9
|
作者
Liu, Bo [1 ,2 ]
Dong, Qiulei [1 ,3 ,4 ]
Hu, Zhanyi [1 ,2 ]
机构
[1] Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Sch Future Technol, Beijing 100049, Peoples R China
[3] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China
[4] Chinese Acad Sci, Ctr Excellence Brain Sci & Intelligence Technol, Shanghai 200031, Peoples R China
基金
中国国家自然科学基金;
关键词
Zero-shot learning; Visual-semantic embedding; Out-of-distribution detection;
D O I
10.1016/j.knosys.2021.107337
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Zero-shot learning (ZSL) aims to recognize objects from unseen classes, where the key is to transfer knowledge from seen classes to unseen classes by establishing appropriate mappings between visual and semantic features. Currently, the knowledge transfer in many existing works is rather limited due to various factors: (i) the widely used visual features are global ones, and they are not completely consistent with semantic attributes; (ii) only one mapping is learned, which is not able to effectively model diverse visual-semantic relations; (iii) the bias problem in the generalized ZSL (GZSL) could not be effectively handled. In this paper, we propose two techniques to alleviate these limitations. Firstly, we propose a Semantic-diversity transfer Network (SetNet) addressing the first two limitations, where (1) a multiple-attention architecture and a diversity regularizer are proposed to learn multiple local visual features being more consistent with semantic attributes and (2) a projector ensemble which geometrically takes diverse local features as inputs is proposed to diversify visual-semantic relations. Secondly, we propose an inner disagreement based domain detection module (ID3M) for GZSL to alleviate the bias problem, which picks out unseen-class data before class-level classification. Due to the lack of unseen-class data in the training stage, ID3M employs a novel self-contained training scheme and detects out unseen-class data based on a proposed inner disagreement criterion. Experimental results on three public datasets show that the proposed SetNet with the explored ID3M achieves a significant improvement against many state-of-the-art methods. (C) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页数:11
相关论文
共 50 条
  • [41] Dual-focus transfer network for zero-shot learning
    Jia, Zhen
    Zhang, Zhang
    Shan, Caifeng
    Wang, Liang
    Tan, Tieniu
    NEUROCOMPUTING, 2023, 541
  • [42] Generative Model with Semantic Embedding and Integrated Classifier for Generalized Zero-Shot Learning
    Pambala, Ayyappa Kumar
    Dutta, Titir
    Biswas, Soma
    2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 1226 - 1235
  • [43] A generalized zero-shot semantic learning model for batch process fault diagnosis
    Liu, Kai
    Zhao, Xiaoqiang
    Mou, Miao
    Hui, Yongyong
    MEASUREMENT SCIENCE AND TECHNOLOGY, 2025, 36 (01)
  • [44] Semantic-guided Reinforced Region Embedding for Generalized Zero-Shot Learning
    Ge, Jiannan
    Xie, Hongtao
    Min, Shaobo
    Zhang, Yongdong
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 1406 - 1414
  • [45] A Semantic Encoding Out-of-Distribution Classifier for Generalized Zero-Shot Learning
    Ding, Jiayu
    Hu, Xiao
    Zhong, Xiaorong
    IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 1395 - 1399
  • [46] Progressive Semantic-Visual Mutual Adaption for Generalized Zero-Shot Learning
    Liu, Man
    Li, Feng
    Zhang, Chunjie
    Wei, Yunchao
    Bai, Huihui
    Zhao, Yao
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 15337 - 15346
  • [47] Dual-level contrastive learning network for generalized zero-shot learning
    Jiaqi Guan
    Min Meng
    Tianyou Liang
    Jigang Liu
    Jigang Wu
    The Visual Computer, 2022, 38 : 3087 - 3095
  • [48] How to Transfer? Zero-Shot Object Recognition via Hierarchical Transfer of Semantic Attributes
    Al-Halah, Ziad
    Stiefelhagen, Rainer
    2015 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2015, : 837 - 843
  • [49] Dual-level contrastive learning network for generalized zero-shot learning
    Guan, Jiaqi
    Meng, Min
    Liang, Tianyou
    Liu, Jigang
    Wu, Jigang
    VISUAL COMPUTER, 2022, 38 (9-10): : 3087 - 3095
  • [50] Alleviating Domain Shift via Discriminative Learning for Generalized Zero-Shot Learning
    Ye, Yalan
    He, Yukun
    Pan, Tongjie
    Li, Jingjing
    Shen, Heng Tao
    IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 1325 - 1337