FS-BAN: Born-Again Networks for Domain Generalization Few-Shot Classification

被引:8
作者
Zhao, Yunqing [1 ]
Cheung, Ngai-Man [1 ]
机构
[1] Singapore Univ Technol & Design, Informat Syst Technol & Design Pillar, Singapore 487372, Singapore
基金
新加坡国家研究基金会;
关键词
Training; Power capacitors; Task analysis; Data models; Knowledge engineering; Adaptation models; Training data; Few-shot classification; domain generalization; born-again network; episodic training; meta-learning;
D O I
10.1109/TIP.2023.3266172
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Conventional Few-shot classification (FSC) aims to recognize samples from novel classes given limited labeled data. Recently, domain generalization FSC (DG-FSC) has been proposed with the goal to recognize novel class samples from unseen domains. DG-FSC poses considerable challenges to many models due to the domain shift between base classes (used in training) and novel classes (encountered in evaluation). In this work, we make two novel contributions to tackle DG-FSC. Our first contribution is to propose Born-Again Network (BAN) episodic training and comprehensively investigate its effectiveness for DG-FSC. As a specific form of knowledge distillation, BAN has been shown to achieve improved generalization in conventional supervised classification with a closed-set setup. This improved generalization motivates us to study BAN for DG-FSC, and we show that BAN is promising to address the domain shift encountered in DG-FSC. Building on the encouraging findings, our second (major) contribution is to propose Few-Shot BAN (FS-BAN), a novel BAN approach for DG-FSC. Our proposed FS-BAN includes novel multi-task learning objectives: Mutual Regularization, Mismatched Teacher, and Meta-Control Temperature, each of these is specifically designed to overcome central and unique challenges in DG-FSC, namely overfitting and domain discrepancy. We analyze different design choices of these techniques. We conduct comprehensive quantitative and qualitative analysis and evaluation over six datasets and three baseline models. The results suggest that our proposed FS-BAN consistently improves the generalization performance of baseline models and achieves state-of-the-art accuracy for DG-FSC. Project Page: yunqing-me.github.io/Born-Again-FS/.
引用
收藏
页码:2252 / 2266
页数:15
相关论文
共 63 条
  • [1] Allen-Zhu Z, 2021, Arxiv, DOI [arXiv:2012.09816, 10.48550/arXiv.2012.09816]
  • [2] Bin Liu, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12349), P438, DOI 10.1007/978-3-030-58548-8_26
  • [3] Breiman L., 1996, BORN AGAIN TREES
  • [4] Entropy-SGD: biasing gradient descent into wide valleys
    Chaudhari, Pratik
    Choromanska, Anna
    Soatto, Stefano
    LeCun, Yann
    Baldassi, Carlo
    Borgs, Christian
    Chayes, Jennifer
    Sagun, Levent
    Zecchina, Riccardo
    [J]. JOURNAL OF STATISTICAL MECHANICS-THEORY AND EXPERIMENT, 2019, 2019 (12):
  • [5] Chen W., 2019, 7 INT C LEARN REPR I, P1
  • [6] Clark K, 2019, 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), P5931
  • [7] Dean J., 2015, ARXIV PREPRINT ARXIV
  • [8] Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
  • [9] Finn C, 2017, PR MACH LEARN RES, V70
  • [10] Fuglede B, 2004, 2004 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY, PROCEEDINGS, P31