Semantic-Aligned Attention With Refining Feature Embedding for Few-Shot Image Classification

被引：2

作者：

Xu, Xianda ^{[1
,2
,3
]}

Xu, Xing ^{[1
,2
]}

Shen, Fumin ^{[1
,2
]}

Li, Yujie ^{[4
]}

机构：

[1] Univ Elect Sci & Technol China, Ctr Future Multimedia, Chengdu 611731, Peoples R China

[2] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, Chengdu 611731, Peoples R China

[3] Carnegie Mellon Univ, Sch Comp Sci, Pittsburgh, PA 15213 USA

[4] Yangzhou Univ, Sch Informat Engn, Yangzhou 225002, Jiangsu, Peoples R China

来源：

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS | 2022年 / 23卷 / 12期

基金：

中国国家自然科学基金;

关键词：

Semantics; Task analysis; Visualization; Training; Feature extraction; Autonomous vehicles; Real-time systems; Autonomous driving; few-shot image classification; zero-shot image classification; attention mechanism; visual-semantic alignment; RECOGNITION; NETWORKS;

D O I：

10.1109/TITS.2021.3127632

中图分类号：

TU [建筑科学];

学科分类号：

0813 ;

摘要：

Autonomous driving relies on trusty visual recognition of surrounding objects. Few-shot image classification is used in autonomous driving to help recognize objects that are rarely seen. Successful embedding and metric-learning approaches to this task normally learn a feature comparison framework between an unseen image and the labeled images. However, these approaches usually have problems with ambiguous feature embedding because they tend to ignore important local visual and semantic information when extracting intra-class common features from the images. In this paper, we introduce a Semantic-Aligned Attention (SAA) mechanism to refine feature embedding and it can be applied to most of the existing embedding and metric-learning approaches. The mechanism highlights pivotal local visual information with attention mechanism and aligns the attentive map with semantic information to refine the extracted features. Incorporating the proposed mechanism into the prototypical network, evaluation results reveal competitive improvements in both few-shot and zero-shot classification tasks on various benchmark datasets.

引用

页码：25458 / 25468

页数：11

共 65 条

[1] Akata Z, 2015, PROC CVPR IEEE, P2927, DOI 10.1109/CVPR.2015.7298911
[2] Label-Embedding for Attribute-Based Classification
Akata, Zeynep
Perronnin, Florent
Harchaoui, Zaid
Schmid, Cordelia
[J]. 2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 819 - 826
[3] Antoniou Antreas, 2018, INT C LEARN REPR
[4] Bin Liu, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12349), P438, DOI 10.1007/978-3-030-58548-8_26
[5] Improving Semantic Embedding Consistency by Metric Learning for Zero-Shot Classiffication
Bucher, Maxime
Herbin, Stephane
Jurie, Frederic
[J]. COMPUTER VISION - ECCV 2016, PT V, 2016, 9909 : 730 - 746
[6] Memory Matching Networks for One-Shot Image Recognition
Cai, Qi
Pan, Yingwei
Yao, Ting
Yan, Chenggang
Mei, Tao
[J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 4080 - 4088
[7] Chen H., ARXIV210311383, V2021
[8] Chen ZT, 2019, AAAI CONF ARTIF INTE, P3379
[9] Finn C, 2017, PR MACH LEARN RES, V70
[10] Finney D. J., 1952, J AM STAT ASSOC, DOI [10.2307/2280787, DOI 10.2307/2280787]

← 1 2 3 4 5 6 7 →