Hallucinating Saliency Maps for Fine-grained Image Classification for Limited Data Domains

被引:3
作者
Figueroa-Flores, Carola [1 ,2 ]
Raducanu, Bogdan [1 ]
Berga, David [1 ]
van de Weijer, Joost [1 ]
机构
[1] Comp Vis Ctr, Edifici O,Campus UAB, Bellaterra 8193, Barcelona, Spain
[2] Univ Bio Bio, Dept Comp Sci & Informat Technol, Concepcion, Chile
来源
VISAPP: PROCEEDINGS OF THE 16TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS - VOL. 4: VISAPP | 2021年
关键词
Fine-grained Image Classification; Saliency Detection; Convolutional Neural Networks; VISUAL-ATTENTION;
D O I
10.5220/0010299501630171
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
It has been shown that saliency maps can be used to improve the performance of object recognition systems, especially on datasets that have only limited training data. However, a drawback of such an approach is that it requires a pre-trained saliency network. In the current paper, we propose an approach which does not require explicit saliency maps to improve image classification, but they are learned implicitely, during the training of an end-to-end image classification task. We show that our approach obtains similar results as the case when the saliency maps are provided explicitely. We validate our method on several datasets for fine-grained classification tasks (Flowers, Birds and Cars), and show that especially for domains with limited data the proposed method significantly improves the results.
引用
收藏
页码:163 / 171
页数:9
相关论文
共 44 条
[11]   Look Closer to See Better: Recurrent Attention Convolutional Neural Network for Fine-grained Image Recognition [J].
Fu, Jianlong ;
Zheng, Heliang ;
Mei, Tao .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :4476-4484
[12]   Compact Bilinear Pooling [J].
Gao, Yang ;
Beijbom, Oscar ;
Zhang, Ning ;
Darrell, Trevor .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :317-326
[13]  
Glorot X., 2010, P 13 INT C ART INT S, P249
[14]   Biologically plausible saliency mechanisms improve feedforward object recognition [J].
Han, Sunhyoung ;
Vasconcelos, Nuno .
VISION RESEARCH, 2010, 50 (22) :2295-2307
[15]   Low-shot Visual Recognition by Shrinking and Hallucinating Features [J].
Hariharan, Bharath ;
Girshick, Ross .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :3037-3046
[16]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[17]   Learning with Side Information through Modality Hallucination [J].
Hoffman, Judy ;
Gupta, Saurabh ;
Darrell, Trevor .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :826-834
[18]   Part-Stacked CNN for Fine-Grained Visual Categorization [J].
Huang, Shaoli ;
Xu, Zhe ;
Tao, Dacheng ;
Zhang, Ya .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :1173-1182
[19]   A model of saliency-based visual attention for rapid scene analysis [J].
Itti, L ;
Koch, C ;
Niebur, E .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1998, 20 (11) :1254-1259
[20]   Low-rank Bilinear Pooling for Fine-Grained Classification [J].
Kong, Shu ;
Fowlkes, Charless .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :7025-7034