A Cross-Modal Alignment for Zero-Shot Image Classification

被引：5

作者：

Wu, Lu ^{[1
,2
]}

Wu, Chenyu ^{[2
]}

Guo, Han ^{[3
]}

Zhao, Zhihao ^{[2
]}

机构：

[1] Minist Nat Resources, Key Lab Urban Land Resources Monitoring & Simulat, Shenzhen 518000, Peoples R China

[2] Wuhan Univ Technol, Sch Informat Engn, Wuhan 430070, Peoples R China

[3] Wuhan Univ, Sch Resource & Environm Sci, Wuhan 430079, Peoples R China

来源：

IEEE ACCESS | 2023年 / 11卷

关键词：

Visualization; Semantics; Training data; Feature extraction; Object recognition; Monitoring; Image classification; Cross-modal alignment; zero-shot image classification; text attribute query; cosine similarity;

D O I：

10.1109/ACCESS.2023.3237966

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Different from major classification methods based on large amounts of annotation data, we introduce a cross-modal alignment for zero-shot image classification.The key is utilizing the query of text attribute learned from the seen classes to guide local feature responses in unseen classes. First, an encoder is used to align semantic matching between visual features and their corresponding text attribute. Second, an attention module is used to get response maps through feature maps activated by the query of text attribute. Finally, the cosine distance metric is used to measure the matching degree of the text attribute and its corresponding feature response. The experiment results show that the method get better performance than existing Zero-shot Learning in embedding-based methods as well as other generative methods in CUB-200-2011 dataset.

引用

页码：9067 / 9073

页数：7

共 50 条

[11] Zero-Shot Image Classification Based on Attribute
Zhang, Wei
Chen, Wenbai
Chen, Xiangfeng
Han, Hu
2017 INTERNATIONAL CONFERENCE ON SECURITY, PATTERN ANALYSIS, AND CYBERNETICS (SPAC), 2017, : 25 - 30
[12] Denoised and Dynamic Alignment Enhancement for Zero-Shot Learning
Ge, Jiannan
Liu, Zhihang
Li, Pandeng
Xie, Lingxi
Zhang, Yongdong
Tian, Qi
Xie, Hongtao
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2025, 34 : 1501 - 1515
[13] Side-Scan Sonar Image Classification With Zero-Shot and Style Transfer
Bai, Zhongyu
Xu, Hongli
Ding, Qichuan
Zhang, Xiangyue
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73 : 1 - 15
[14] Enhanced VAEGAN: a zero-shot image classification method
Ding, Bo
Fan, Yufei
He, Yongjun
Zhao, Jing
APPLIED INTELLIGENCE, 2023, 53 (08) : 9235 - 9246
[15] Enhanced VAEGAN: a zero-shot image classification method
Bo Ding
Yufei Fan
Yongjun He
Jing Zhao
Applied Intelligence, 2023, 53 : 9235 - 9246
[16] MCCE-REC: MLLM-Driven Cross-Modal Contrastive Entropy Model for Zero-Shot Referring Expression Comprehension
Qiu, Heqian
Wang, Lanxiao
Zhao, Taijin
Meng, Fanman
Wu, Qingbo
Li, Hongliang
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2025, 35 (01) : 754 - 768
[17] Multi-Modal Multi-Grained Embedding Learning for Generalized Zero-Shot Video Classification
Hong, Mingyao
Zhang, Xinfeng
Li, Guorong
Huang, Qingming
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (10) : 5959 - 5972
[18] Heterogeneous Feature Alignment and Fusion in Cross-Modal Augmented Space for Composed Image Retrieval
Pang, Huaxin
Wei, Shikui
Zhang, Gangjian
Zhang, Shiyin
Qiu, Shuang
Zhao, Yao
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 6446 - 6457
[19] Zero-Shot Image Classification Method Based on Attribute Weighting
Chen, Wenbai
Chen, Xiangfeng
Liu, Chang
Wu, Hao
Li, Denghua
PROCEEDINGS OF 2019 6TH IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTELLIGENCE SYSTEMS (CCIS), 2019, : 84 - 88
[20] Zero-Shot Image Classification: Recent Status and Future Trends
Feng, Xiaodong
Liu, Ying
Chiew, Tuan Kiang
2024 6TH INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING, ICNLP 2024, 2024, : 609 - 618

← 1 2 3 4 5 →