Semantic-diversity transfer network for generalized zero-shot learning via inner disagreement based OOD detector

被引:9
|
作者
Liu, Bo [1 ,2 ]
Dong, Qiulei [1 ,3 ,4 ]
Hu, Zhanyi [1 ,2 ]
机构
[1] Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Sch Future Technol, Beijing 100049, Peoples R China
[3] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China
[4] Chinese Acad Sci, Ctr Excellence Brain Sci & Intelligence Technol, Shanghai 200031, Peoples R China
基金
中国国家自然科学基金;
关键词
Zero-shot learning; Visual-semantic embedding; Out-of-distribution detection;
D O I
10.1016/j.knosys.2021.107337
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Zero-shot learning (ZSL) aims to recognize objects from unseen classes, where the key is to transfer knowledge from seen classes to unseen classes by establishing appropriate mappings between visual and semantic features. Currently, the knowledge transfer in many existing works is rather limited due to various factors: (i) the widely used visual features are global ones, and they are not completely consistent with semantic attributes; (ii) only one mapping is learned, which is not able to effectively model diverse visual-semantic relations; (iii) the bias problem in the generalized ZSL (GZSL) could not be effectively handled. In this paper, we propose two techniques to alleviate these limitations. Firstly, we propose a Semantic-diversity transfer Network (SetNet) addressing the first two limitations, where (1) a multiple-attention architecture and a diversity regularizer are proposed to learn multiple local visual features being more consistent with semantic attributes and (2) a projector ensemble which geometrically takes diverse local features as inputs is proposed to diversify visual-semantic relations. Secondly, we propose an inner disagreement based domain detection module (ID3M) for GZSL to alleviate the bias problem, which picks out unseen-class data before class-level classification. Due to the lack of unseen-class data in the training stage, ID3M employs a novel self-contained training scheme and detects out unseen-class data based on a proposed inner disagreement criterion. Experimental results on three public datasets show that the proposed SetNet with the explored ID3M achieves a significant improvement against many state-of-the-art methods. (C) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Augmented semantic feature based generative network for generalized zero-shot learning
    Li, Zhiqun
    Chen, Qiong
    Liu, Qingfa
    NEURAL NETWORKS, 2021, 143 : 1 - 11
  • [2] Semantic Contrastive Embedding for Generalized Zero-Shot Learning
    Zongyan Han
    Zhenyong Fu
    Shuo Chen
    Jian Yang
    International Journal of Computer Vision, 2022, 130 : 2606 - 2622
  • [3] Semantic Feature Extraction for Generalized Zero-Shot Learning
    Kim, Junhan
    Shim, Kyuhong
    Shim, Byonghyo
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 1166 - 1173
  • [4] Semantic Contrastive Embedding for Generalized Zero-Shot Learning
    Han, Zongyan
    Fu, Zhenyong
    Chen, Shuo
    Yang, Jian
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2022, 130 (11) : 2606 - 2622
  • [5] Transfer Increment for Generalized Zero-Shot Learning
    Feng, Liangjun
    Zhao, Chunhui
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (06) : 2506 - 2520
  • [6] Visual-semantic consistency matching network for generalized zero-shot learning
    Zhang, Zhenqi
    Cao, Wenming
    NEUROCOMPUTING, 2023, 536 : 30 - 39
  • [7] Attributes learning network for generalized zero-shot learning
    Yun, Yu
    Wang, Sen
    Hou, Mingzhen
    Gao, Quanxue
    NEURAL NETWORKS, 2022, 150 : 112 - 118
  • [8] Improving generalized zero-shot learning via cluster-based semantic disentangling representation
    Gao, Yi
    Feng, Wentao
    Xiao, Rong
    He, Lihuo
    He, Zhenan
    Lv, Jiancheng
    Tang, Chenwei
    PATTERN RECOGNITION, 2024, 150
  • [9] Attentive Semantic Preservation Network for Zero-Shot Learning
    Lu, Ziqian
    Yu, Yunlong
    Lu, Zhe-Ming
    Shen, Feng-Li
    Zhang, Zhongfei
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, : 2919 - 2925
  • [10] Learning Unbiased Zero-Shot Semantic Segmentation Networks Via Transductive Transfer
    Lv, Fengmao
    Liu, Haiyang
    Wang, Yichen
    Zhao, Jiayi
    Yang, Guowu
    IEEE SIGNAL PROCESSING LETTERS, 2020, 27 : 1640 - 1644