Semantic-Aligned Attention With Refining Feature Embedding for Few-Shot Image Classification

被引:2
作者
Xu, Xianda [1 ,2 ,3 ]
Xu, Xing [1 ,2 ]
Shen, Fumin [1 ,2 ]
Li, Yujie [4 ]
机构
[1] Univ Elect Sci & Technol China, Ctr Future Multimedia, Chengdu 611731, Peoples R China
[2] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, Chengdu 611731, Peoples R China
[3] Carnegie Mellon Univ, Sch Comp Sci, Pittsburgh, PA 15213 USA
[4] Yangzhou Univ, Sch Informat Engn, Yangzhou 225002, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
Semantics; Task analysis; Visualization; Training; Feature extraction; Autonomous vehicles; Real-time systems; Autonomous driving; few-shot image classification; zero-shot image classification; attention mechanism; visual-semantic alignment; RECOGNITION; NETWORKS;
D O I
10.1109/TITS.2021.3127632
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Autonomous driving relies on trusty visual recognition of surrounding objects. Few-shot image classification is used in autonomous driving to help recognize objects that are rarely seen. Successful embedding and metric-learning approaches to this task normally learn a feature comparison framework between an unseen image and the labeled images. However, these approaches usually have problems with ambiguous feature embedding because they tend to ignore important local visual and semantic information when extracting intra-class common features from the images. In this paper, we introduce a Semantic-Aligned Attention (SAA) mechanism to refine feature embedding and it can be applied to most of the existing embedding and metric-learning approaches. The mechanism highlights pivotal local visual information with attention mechanism and aligns the attentive map with semantic information to refine the extracted features. Incorporating the proposed mechanism into the prototypical network, evaluation results reveal competitive improvements in both few-shot and zero-shot classification tasks on various benchmark datasets.
引用
收藏
页码:25458 / 25468
页数:11
相关论文
共 65 条
  • [1] Akata Z, 2015, PROC CVPR IEEE, P2927, DOI 10.1109/CVPR.2015.7298911
  • [2] Label-Embedding for Attribute-Based Classification
    Akata, Zeynep
    Perronnin, Florent
    Harchaoui, Zaid
    Schmid, Cordelia
    [J]. 2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 819 - 826
  • [3] Antoniou Antreas, 2018, INT C LEARN REPR
  • [4] Bin Liu, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12349), P438, DOI 10.1007/978-3-030-58548-8_26
  • [5] Improving Semantic Embedding Consistency by Metric Learning for Zero-Shot Classiffication
    Bucher, Maxime
    Herbin, Stephane
    Jurie, Frederic
    [J]. COMPUTER VISION - ECCV 2016, PT V, 2016, 9909 : 730 - 746
  • [6] Memory Matching Networks for One-Shot Image Recognition
    Cai, Qi
    Pan, Yingwei
    Yao, Ting
    Yan, Chenggang
    Mei, Tao
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 4080 - 4088
  • [7] Chen H., ARXIV210311383, V2021
  • [8] Chen ZT, 2019, AAAI CONF ARTIF INTE, P3379
  • [9] Finn C, 2017, PR MACH LEARN RES, V70
  • [10] Finney D. J., 1952, J AM STAT ASSOC, DOI [10.2307/2280787, DOI 10.2307/2280787]