Picking Neural Activations for Fine-Grained Recognition

被引:32
作者
Zhang, Xiaopeng [1 ]
Xiong, Hongkai [1 ]
Zhou, Wengang [2 ]
Lin, Weiyao [1 ]
Tian, Qi [3 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Elect Engn, Shanghai 200240, Peoples R China
[2] Univ Sci & Technol China, CAS Key Lab Technol Geospatial Informat Proc & Ap, Hefei 230027, Anhui, Peoples R China
[3] Univ Texas San Antonio, Dept Comp Sci, San Antonio, TX 78249 USA
基金
美国国家科学基金会;
关键词
Fine-grained recognition; regularized multiple instance leaning; spatially weighted Fisher Vectors (SWFV); weakly supervised part discovery; IMAGE CLASSIFICATION; FISHER VECTOR;
D O I
10.1109/TMM.2017.2710803
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
It is a challenging task to recognize fine-grained subcategories due to the highly localized and subtle differences among them. Different from most previous methods that rely on object/part annotations, this paper proposes an automatic fine-grained recognition approach, which is free of any object/part annotation at both training and testing stages. The key idea includes two steps of picking neural activations computed from the convolutional neural networks, one for localization, and the other for description. The first picking step is to find distinctive neurons that are sensitive to specific patterns significantly and consistently. Based on these picked neurons, we initialize positive samples and formulate the localization as a regularized multiple instance learning task, which aims at refining the detectors via iteratively alternating between new positive sample mining and part model retraining. The second picking step is to pool deep neural activations via a spatially weighted combination of Fisher Vectors coding. We conditionally select activations to encode them into the final representation, which considers the importance of each activation. Integrating the above techniques produces a powerful framework, and experiments conducted on several extensive fine-grained benchmarks demonstrate the superiority of our proposed algorithm over the existing methods.
引用
收藏
页码:2736 / 2750
页数:15
相关论文
共 53 条
  • [1] Aditya Khosla, 2011, PROC CVPR WORKSHOP F
  • [2] Andrews S., 2002, NIPS, P561
  • [3] [Anonymous], P BRIT MACH VIS C
  • [4] [Anonymous], 2012, Advances in Neural Information Processing Systems
  • [5] POOF: Part-Based One-vs-One Features for Fine-Grained Categorization, Face Verification, and Attribute Estimation
    Berg, Thomas
    Belhumeur, Peter N.
    [J]. 2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 955 - 962
  • [6] BOUREAU YL, 2010, PROC CVPR IEEE, P2559, DOI DOI 10.1109/CVPR.2010.5539963
  • [7] The Ignorant Led by the Blind: A Hybrid Human-Machine Vision System for Fine-Grained Categorization
    Branson, Steve
    Van Horn, Grant
    Wah, Catherine
    Perona, Pietro
    Belongie, Serge
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2014, 108 (1-2) : 3 - 29
  • [8] Symbiotic Segmentation and Part Localization for Fine-Grained Categorization
    Chai, Yuning
    Lempitsky, Victor
    Zisserman, Andrew
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 321 - 328
  • [9] CIMPOI M, 2015, PROC CVPR IEEE, P3828, DOI DOI 10.1109/CVPR.2015.7299007
  • [10] Collobert R., 2004, RR0442 ID RES I