Open-Set Likelihood Maximization for Few-Shot Learning

被引:8
作者
Boudiaf, Malik [1 ]
Bennequin, Etienne [2 ,3 ]
Tami, Myriam [3 ]
Toubhans, Antoine [2 ]
Piantanida, Pablo [4 ,5 ,6 ,7 ]
Hudelot, Celine [3 ]
Ben Ayed, Ismail [1 ]
机构
[1] ETS Montreal, Montreal, PQ, Canada
[2] Sicara, Paris, France
[3] Univ Paris Saclay, MICS, Cent Supelec, Gif Sur Yvette, France
[4] ILLS, Paris, France
[5] MILA, Montreal, PQ, Canada
[6] McGill, Montreal, PQ, Canada
[7] CNRS, Paris, France
来源
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2023年
关键词
D O I
10.1109/CVPR52729.2023.02299
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We tackle the Few-Shot Open-Set Recognition (FSOSR) problem, i.e. classifying instances among a set of classes for which we only have a few labeled samples, while simultaneously detecting instances that do not belong to any known class. We explore the popular transductive setting, which leverages the unlabelled query instances at inference. Motivated by the observation that existing transductive methods perform poorly in open-set scenarios, we propose a generalization of the maximum likelihood principle, in which latent scores down-weighing the influence of potential outliers are introduced alongside the usual parametric model. Our formulation embeds supervision constraints from the support set and additional penalties discouraging overconfident predictions on the query set. We proceed with a block-coordinate descent, with the latent scores and parametric model co-optimized alternately, thereby benefiting from each other. We call our resulting formulation Open-Set Likelihood Optimization (OSLO). OSLO is interpretable and fully modular; it can be applied on top of any pre-trained model seamlessly. Through extensive experiments, we show that our method surpasses existing inductive and transductive methods on both aspects of open-set recognition, namely inlier classification and outlier detection. Code is available at https://github.com/ebennequin/fewshot-open-set.
引用
收藏
页码:24007 / 24016
页数:10
相关论文
共 52 条
[1]  
[Anonymous], 2019, SIMPLESHOT REVISITIN, DOI DOI 10.1109/ICCV.2019.00362
[2]  
Bendale Abhijit, 2016, COMP VIS PATT REC C
[3]  
Bennequin Etienne, 2021, EUR C MACH LEARN PRI
[4]  
Boudiaf Malik, 2020, Advances in Neural Information Processing Systems (NeurIPS)
[5]  
Boudiaf Malik, 2021, COMP VIS PATT REC CV
[6]   Emerging Properties in Self-Supervised Vision Transformers [J].
Caron, Mathilde ;
Touvron, Hugo ;
Misra, Ishan ;
Jegou, Herve ;
Mairal, Julien ;
Bojanowski, Piotr ;
Joulin, Armand .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :9630-9640
[7]  
Chen W.-Y., 2019, INT C LEARN REPR, P1, DOI DOI 10.1109/MSR.2015.54
[8]  
Chen X., 2022, PROC 10 INT C LEARN
[9]  
Chen Yanbei, 2020, C ART INT AAAI
[10]  
Dhillon G S, 2020, INT C LEARN REPR ICL