Weakly Supervised Fine-Grained Categorization With Part-Based Image Representation

被引:149
作者
Zhang, Yu [1 ]
Wei, Xiu-Shen [2 ]
Wu, Jianxin [2 ]
Cai, Jianfei [3 ]
Lu, Jiangbo [4 ]
Nguyen, Viet-Anh [4 ]
Do, Minh N. [5 ]
机构
[1] ASTAR, Bioinformat Inst, Singapore 138671, Singapore
[2] Nanjing Univ, Natl Key Lab Novel Software Technol, Nanjing 210023, Jiangsu, Peoples R China
[3] Nanyang Technol Univ, Sch Comp Engn, Singapore 639798, Singapore
[4] Adv Digital Sci Ctr, Singapore 138632, Singapore
[5] Univ Illinois, Urbana, IL 61801 USA
基金
中国国家自然科学基金; 美国国家科学基金会;
关键词
Fine-grained categorization; weakly-supervised; part selection; CLASSIFICATION; DETECTORS;
D O I
10.1109/TIP.2016.2531289
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a fine-grained image categorization system with easy deployment. We do not use any object/part annotation (weakly supervised) in the training or in the testing stage, but only class labels for training images. Fine-grained image categorization aims to classify objects with only subtle distinctions (e.g., two breeds of dogs that look alike). Most existing works heavily rely on object/part detectors to build the correspondence between object parts, which require accurate object or object part annotations at least for training images. The need for expensive object annotations prevents the wide usage of these methods. Instead, we propose to generate multi-scale part proposals from object proposals, select useful part proposals, and use them to compute a global image representation for categorization. This is specially designed for the weakly supervised fine-grained categorization task, because useful parts have been shown to play a critical role in existing annotation-dependent works, but accurate part detectors are hard to acquire. With the proposed image representation, we can further detect and visualize the key (most discriminative) parts in objects of different classes. In the experiments, the proposed weakly supervised method achieves comparable or better accuracy than the state-of-the-art weakly supervised methods and most existing annotation-dependent methods on three challenging datasets. Its success suggests that it is not always necessary to learn expensive object/part detectors in fine-grained image categorization.
引用
收藏
页码:1713 / 1725
页数:13
相关论文
共 53 条
[1]  
[Anonymous], PAMI
[2]  
[Anonymous], 2011, P CVPR WORKSHOP FINE
[3]  
[Anonymous], P BRIT MACH VIS C
[4]  
[Anonymous], PROC CVPR IEEE
[5]  
[Anonymous], IEEE T PATTERN ANAL
[6]  
[Anonymous], 2014, P 31 INT C INT C MAC
[7]  
[Anonymous], P BRIT MACH VIS C BM
[8]  
[Anonymous], FISHER KERNEL DEEP N
[9]   Birdsnap: Large-scale Fine-grained Visual Categorization of Birds [J].
Berg, Thomas ;
Liu, Jiongxin ;
Lee, Seung Woo ;
Alexander, Michelle L. ;
Jacobs, David W. ;
Belhumeur, Peter N. .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :2019-2026
[10]  
Bourdev L, 2010, LECT NOTES COMPUT SC, V6316, P168, DOI 10.1007/978-3-642-15567-3_13