Low-resolution few-shot learning via multi-space knowledge distillation

被引:0
作者
Liu, Ke [1 ,2 ]
Ye, Xinchen [1 ,2 ]
Sun, Baoli [1 ,2 ]
Yang, Hairui [1 ,2 ]
Li, Haojie [3 ]
Xu, Rui [1 ,2 ]
Wang, Zhihui [1 ,2 ]
机构
[1] Dalian Univ Technol, DUT Sch Software Technol, Dalian, Peoples R China
[2] Dalian Univ Technol, DUT RU Int Sch Informat Sci Engn, Dalian, Peoples R China
[3] Shandong Univ Sci & Technol, Coll Comp Sci & Engn, Qingdao, Peoples R China
关键词
Few-shot learning; Low-resolution classification; Multi-space knowledge distillation;
D O I
10.1016/j.ins.2024.120968
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Existing few -shot classification models usually rely on limited known support images to form class centers, and classify query images based on the distance between their embedding and the class centers. However, these models assume that the query image is high -resolution (HR), and thus suffer from significant performance degradation when applied to low -resolution (LR) images. Due to the lack of discriminative information in LR images, there is a noticeable discrepancy between the embeddings of LR query images and the class centers formed by HR support images. To address this issue, we first formulate the problem of Low -Resolution Few -Shot Learning (LRFSL), where the support images are HR while the query images are only available in LR. Then, we propose an end -to -end pipeline that leverages mutual learning between a super -resolution (SR) network and a few -shot classification network. To further reduce the domain discrepancy between the embeddings of the SR images and HR class centers, we introduce a multi -space knowledge distillation strategy that aims to transfer pixel -level, feature -level, and logit-level knowledge of the HR domain to the SR domain. We conduct extensive experiments on classic few -shot datasets: miniImageNet, tieredImageNet, and the fine-grained few -shot dataset CUB. Experimental results show that our method can handle few -shot classification with LR input, and achieve performance that is almost comparable to using HR images as input. Specifically, our method achieves an average accuracy improvement of 26.47% with the Meta -Baseline Model and 7.44% with the Meta DeepBDC Model across all datasets compared to LR Query.
引用
收藏
页数:12
相关论文
共 41 条
  • [1] Variational Information Distillation for Knowledge Transfer
    Ahn, Sungsoo
    Hu, Shell Xu
    Damianou, Andreas
    Lawrence, Neil D.
    Dai, Zhenwen
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 9155 - 9163
  • [2] Convolutional low-resolution fine-grained classification
    Cai, Dingding
    Chen, Ke
    Qian, Yanlin
    Kamarainen, Joni-Kristian
    [J]. PATTERN RECOGNITION LETTERS, 2019, 119 : 166 - 171
  • [3] Memory Matching Networks for One-Shot Image Recognition
    Cai, Qi
    Pan, Yingwei
    Yao, Ting
    Yan, Chenggang
    Mei, Tao
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 4080 - 4088
  • [4] Video anomaly detection with spatio-temporal dissociation
    Chang, Yunpeng
    Tu, Zhigang
    Xie, Wei
    Luo, Bin
    Zhang, Shifu
    Sui, Haigang
    Yuan, Junsong
    [J]. PATTERN RECOGNITION, 2022, 122
  • [5] Meta-Baseline: Exploring Simple Meta-Learning for Few-Shot Learning
    Chen, Yinbo
    Liu, Zhuang
    Xu, Huijuan
    Darrell, Trevor
    Wang, Xiaolong
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 9042 - 9051
  • [6] CAD: Co-Adapting Discriminative Features for Improved Few-Shot Classification
    Chikontwe, Philip
    Kim, Soopil
    Park, Sang Hyun
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 14534 - 14543
  • [7] On the Efficacy of Knowledge Distillation
    Cho, Jang Hyun
    Hariharan, Bharath
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 4793 - 4801
  • [8] Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
  • [9] Diversity with Cooperation: Ensemble Methods for Few-Shot Classification
    Dvornik, Nikita
    Schmid, Cordelia
    Mairal, Julien
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 3722 - 3730
  • [10] Finn C, 2017, PR MACH LEARN RES, V70