COUPLED PATCH SIMILARITY NETWORK FOR ONE-SHOT FINE-GRAINED IMAGE RECOGNITION

被引:9
作者
Tian, Sheng [1 ]
Tang, Hao [1 ]
Dai, Longquan [1 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing, Peoples R China
来源
2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP) | 2021年
基金
中国国家自然科学基金;
关键词
one-shot; fine-grained; image recognition;
D O I
10.1109/ICIP42928.2021.9506685
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
One-shot fine-grained image recognition (OSFG) aims to distinguish different fine-grained categories with only one training sample per category. Previous works mainly focus on learning a global feature representation through only a using single similarity metric branch, which is unsuitable for OSFG to effectively capture subtle and local differences under limited supervision. In this work, we propose a Coupled Patch Similarity Network (CPSN) for OSFG. Firstly, we propose a Feature Enhancement Module (FEM) to extract more discriminative features of the fine-grained samples. Then, we develop two coupled and symmetrical branches to capture discriminative parts of the samples and reduce the deviation of the distance metric. For each branch, we design a Patch Similarity Module (PSM) to calculate the patch similarity map for the sample pair. Especially, a PatchWeight Generator (PWG) is proposed to generate the patch weight map, which indicates the degree of importance for each position in the patch similarity map, so that the model can focus on diverse and informative parts. We analyze the effect of the different components in the proposed network, and extensive experimental results demonstrate the effectiveness and superiority of the proposed method on two fine-grained benchmark datasets.
引用
收藏
页码:2478 / 2482
页数:5
相关论文
共 22 条
  • [1] Afrasiyabi Arman, 2020, Computer Vision - ECCV 2020 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12350), P18, DOI 10.1007/978-3-030-58558-7_2
  • [2] [Anonymous], 2020, ACM INT C MULT
  • [3] Chen W.-Y., 2019, INT C LEARN REPR
  • [4] Hou RB, 2019, ADV NEUR IN, V32
  • [5] Huang Huaxi, 2020, TMM
  • [6] Central retina changes in Parkinson's disease: a systematic review and meta-analysis
    Huang, Lele
    Zhang, Dan
    Ji, Jianling
    Wang, Yujie
    Zhang, Ruijun
    [J]. JOURNAL OF NEUROLOGY, 2021, 268 (12) : 4646 - 4654
  • [7] 3D Object Representations for Fine-Grained Categorization
    Krause, Jonathan
    Stark, Michael
    Deng, Jia
    Li Fei-Fei
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2013, : 554 - 561
  • [8] Meta-Learning with Differentiable Convex Optimization
    Lee, Kwonjoon
    Maji, Subhransu
    Ravichandran, Avinash
    Soatto, Stefano
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 10649 - 10657
  • [9] Revisiting Local Descriptor based Image-to-Class Measure for Few-shot Learning
    Li, Wenbin
    Wang, Lei
    Xu, Jinglin
    Huo, Jing
    Gao, Yang
    Luo, Jiebo
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 7253 - 7260
  • [10] Li Xiaoxu, 2020, TIP