Few-shot Learning via Saliency-guided Hallucination of Samples

被引:201
作者
Zhang, Hongguang [1 ,2 ]
Zhang, Jing [1 ,2 ]
Koniusz, Piotr [1 ,2 ]
机构
[1] Australian Natl Univ, Canberra, ACT, Australia
[2] CSIRO, Data61, Sydney, NSW, Australia
来源
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019) | 2019年
关键词
EXAMPLE;
D O I
10.1109/CVPR.2019.00288
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Learning new concepts from a few of samples is a standard challenge in computer vision. The main directions to improve the learning ability of few-shot training models include (i) a robust similarity learning and (ii) generating or hallucinating additional data from the limited existing samples. In this paper, we follow the latter direction and present a novel data hallucination model. Currently, most datapoint generators contain a specialized network (i.e., GAN) tasked with hallucinating new datapoints, thus requiring large numbers of annotated data for their training in the first place. In this paper, we propose a novel less-costly hallucination method for few-shot learning which utilizes saliency maps. To this end, we employ a saliency network to obtain the foregrounds and backgrounds of available image samples and feed the resulting maps into a two-stream network to hallucinate datapoints directly in the feature space from viable foreground-background combinations. To the best of our knowledge, we are the first to leverage saliency maps for such a task and we demonstrate their usefulness in hallucinating additional datapoints for few-shot learning. Our proposed network achieves the state of the art on publicly available datasets.
引用
收藏
页码:2765 / 2774
页数:10
相关论文
共 47 条
[1]   Label-Embedding for Attribute-Based Classification [J].
Akata, Zeynep ;
Perronnin, Florent ;
Harchaoui, Zaid ;
Schmid, Cordelia .
2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, :819-826
[2]  
[Anonymous], CORR180201093
[3]  
[Anonymous], 2017, ADV NEURAL INFORM PR
[4]  
[Anonymous], 2011, COGSCI
[5]  
[Anonymous], 2018, ICLR
[6]  
[Anonymous], ECCV
[7]  
[Anonymous], 2015, IEEE T IMAGE PROCESS, DOI DOI 10.1109/TIP.2015.2487833
[8]  
[Anonymous], 2010, ICML
[9]  
[Anonymous], 2012, ECCV
[10]  
[Anonymous], ICML