Dynamic visual-guided selection for zero-shot learning

被引:2
|
作者
Zhou, Yuan [1 ]
Xiang, Lei [1 ]
Liu, Fan [1 ]
Duan, Haoran [2 ]
Long, Yang [2 ]
机构
[1] Nanjing Univ Informat Sci & Technol, Sch Artificial Intelligence, Nanjing 210044, Jiangsu, Peoples R China
[2] Univ Durham, Dept Comp Sci, Durham, England
来源
JOURNAL OF SUPERCOMPUTING | 2024年 / 80卷 / 03期
关键词
Visual-guided selection; Class prototype refinement; Task-relevant regions; Zero-shot learning;
D O I
10.1007/s11227-023-05625-1
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Zero-shot learning (ZSL) methods currently employed to identify seen or unseen classes rely on semantic attribute prototypes or class information. However, hand-annotated attributes are only for the category rather than for each image belonging to that category. Furthermore, attribute information is inconsistent across different images of the same category due to variant views. Therefore, we propose a dynamic visual-guided selection (DVGS) which helps dynamically focus on different regions and refines class prototype on each image. Instead of directly aligning an image's global feature with its semantic class vector or its local features with all attribute vectors, the proposed method learns a vision-guided soft mask to refine the class prototype for each image. Additionally, it discovers the most task-relevant regions for fine-grained recognition with the refined class prototype. Extensive experiments on three benchmarks verify the effectiveness of our DVGS and achieve the new state-of-the-art. Our DVGS achieved the best results on fine-grained datasets within both the conventional zero-shot learning (CZSL) and generalized zero-shot learning (GZSL) settings. In particular, on the SUN dataset, our DVGS demonstrates a significant superiority of 10.2% in the CZSL setting compared with the second-best approach. Similarly, our method outperforms the second-best method by an average of 4% on CUB in both the CZSL and GZSL settings. Despite securing the second-best result on the AWA2 dataset, DVGS remains closely competitive, trailing the best performance by a mere 3.4% in CZSL and 1.2% in GZSL.
引用
收藏
页码:4401 / 4419
页数:19
相关论文
共 50 条
  • [31] Hierarchical Visual Primitive Experts for Compositional Zero-Shot Learning
    Kim, Hanjae
    Lee, Jiyoung
    Park, Seongheon
    Sohn, Kwanghoon
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 5652 - 5662
  • [32] Towards Visual Explainable Active Learning for Zero-Shot Classification
    Jia, Shichao
    Li, Zeyu
    Chen, Nuo
    Zhang, Jiawan
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2022, 28 (01) : 791 - 801
  • [33] Rethinking Zero-Shot Learning: A Conditional Visual Classification Perspective
    Li, Kai
    Min, Martin Renqiang
    Fu, Yun
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 3582 - 3591
  • [34] Transductive Zero-Shot Learning via Visual Center Adaptation
    Wan, Ziyu
    Li, Yan
    Yang, Min
    Zhang, Junge
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 10059 - 10060
  • [35] Transductive Visual-Semantic Embedding for Zero-shot Learning
    Xu, Xing
    Shen, Fumin
    Yang, Yang
    Shao, Jie
    Huang, Zi
    PROCEEDINGS OF THE 2017 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL (ICMR'17), 2017, : 41 - 49
  • [36] Adversarial unseen visual feature synthesis for Zero-shot Learning
    Zhang, Haofeng
    Long, Yang
    Liu, Li
    Shao, Ling
    NEUROCOMPUTING, 2019, 329 : 12 - 20
  • [37] Visual Structure Constraint for Transductive Zero-Shot Learning in the Wild
    Wan, Ziyu
    Chen, Dongdong
    Liao, Jing
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2021, 129 (06) : 1893 - 1909
  • [38] Feature Selection Methods for Zero-Shot Learning of Neural Activity
    Caceres, Carlos A.
    Roos, Matthew J.
    Rupp, Kyle M.
    Milsap, Griffin
    Crone, Nathan E.
    Wolmetz, Michael E.
    Ratto, Christopher R.
    FRONTIERS IN NEUROINFORMATICS, 2017, 11
  • [39] Zero-Shot Transfer Learning Based on Visual and Textual Resemblance
    Yang, Gang
    Xu, Jieping
    NEURAL INFORMATION PROCESSING (ICONIP 2019), PT III, 2019, 11955 : 353 - 362
  • [40] Ordinal Zero-Shot Learning
    Huo, Zengwei
    Geng, Xin
    PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 1916 - 1922