A Cross-Modal Alignment for Zero-Shot Image Classification

被引:5
|
作者
Wu, Lu [1 ,2 ]
Wu, Chenyu [2 ]
Guo, Han [3 ]
Zhao, Zhihao [2 ]
机构
[1] Minist Nat Resources, Key Lab Urban Land Resources Monitoring & Simulat, Shenzhen 518000, Peoples R China
[2] Wuhan Univ Technol, Sch Informat Engn, Wuhan 430070, Peoples R China
[3] Wuhan Univ, Sch Resource & Environm Sci, Wuhan 430079, Peoples R China
关键词
Visualization; Semantics; Training data; Feature extraction; Object recognition; Monitoring; Image classification; Cross-modal alignment; zero-shot image classification; text attribute query; cosine similarity;
D O I
10.1109/ACCESS.2023.3237966
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Different from major classification methods based on large amounts of annotation data, we introduce a cross-modal alignment for zero-shot image classification.The key is utilizing the query of text attribute learned from the seen classes to guide local feature responses in unseen classes. First, an encoder is used to align semantic matching between visual features and their corresponding text attribute. Second, an attention module is used to get response maps through feature maps activated by the query of text attribute. Finally, the cosine distance metric is used to measure the matching degree of the text attribute and its corresponding feature response. The experiment results show that the method get better performance than existing Zero-shot Learning in embedding-based methods as well as other generative methods in CUB-200-2011 dataset.
引用
收藏
页码:9067 / 9073
页数:7
相关论文
共 50 条
  • [11] Zero-Shot Image Classification Based on Attribute
    Zhang, Wei
    Chen, Wenbai
    Chen, Xiangfeng
    Han, Hu
    2017 INTERNATIONAL CONFERENCE ON SECURITY, PATTERN ANALYSIS, AND CYBERNETICS (SPAC), 2017, : 25 - 30
  • [12] Denoised and Dynamic Alignment Enhancement for Zero-Shot Learning
    Ge, Jiannan
    Liu, Zhihang
    Li, Pandeng
    Xie, Lingxi
    Zhang, Yongdong
    Tian, Qi
    Xie, Hongtao
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2025, 34 : 1501 - 1515
  • [13] Side-Scan Sonar Image Classification With Zero-Shot and Style Transfer
    Bai, Zhongyu
    Xu, Hongli
    Ding, Qichuan
    Zhang, Xiangyue
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73 : 1 - 15
  • [14] Enhanced VAEGAN: a zero-shot image classification method
    Ding, Bo
    Fan, Yufei
    He, Yongjun
    Zhao, Jing
    APPLIED INTELLIGENCE, 2023, 53 (08) : 9235 - 9246
  • [15] Enhanced VAEGAN: a zero-shot image classification method
    Bo Ding
    Yufei Fan
    Yongjun He
    Jing Zhao
    Applied Intelligence, 2023, 53 : 9235 - 9246
  • [16] MCCE-REC: MLLM-Driven Cross-Modal Contrastive Entropy Model for Zero-Shot Referring Expression Comprehension
    Qiu, Heqian
    Wang, Lanxiao
    Zhao, Taijin
    Meng, Fanman
    Wu, Qingbo
    Li, Hongliang
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2025, 35 (01) : 754 - 768
  • [17] Multi-Modal Multi-Grained Embedding Learning for Generalized Zero-Shot Video Classification
    Hong, Mingyao
    Zhang, Xinfeng
    Li, Guorong
    Huang, Qingming
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (10) : 5959 - 5972
  • [18] Heterogeneous Feature Alignment and Fusion in Cross-Modal Augmented Space for Composed Image Retrieval
    Pang, Huaxin
    Wei, Shikui
    Zhang, Gangjian
    Zhang, Shiyin
    Qiu, Shuang
    Zhao, Yao
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 6446 - 6457
  • [19] Zero-Shot Image Classification Method Based on Attribute Weighting
    Chen, Wenbai
    Chen, Xiangfeng
    Liu, Chang
    Wu, Hao
    Li, Denghua
    PROCEEDINGS OF 2019 6TH IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTELLIGENCE SYSTEMS (CCIS), 2019, : 84 - 88
  • [20] Zero-Shot Image Classification: Recent Status and Future Trends
    Feng, Xiaodong
    Liu, Ying
    Chiew, Tuan Kiang
    2024 6TH INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING, ICNLP 2024, 2024, : 609 - 618