PFNet: a novel part fusion network for fine-grained visual categorization

被引:0
作者
Jingyun Liang
Jinlin Guo
Yanming Guo
Songyang Lao
机构
[1] National University of Defense Technology,College of System Engineering
来源
Multimedia Tools and Applications | 2020年 / 79卷
关键词
Fine-grained visual categorization; Image classification; Convolutional neural network;
D O I
暂无
中图分类号
学科分类号
摘要
The existing methods in fine-grained visual categorization focus on integrating multiple deep CNN models or complicated attention mechanism, resulting in increasing cumbersome networks. In addition, most methods rely on part annotations which requires expensive expert guidance. In this paper, without extra annotation, we propose a novel part fusion network (PFNet) to effectively fuse discriminative image parts for classification. More specifically, PFNet consists of a part feature extractor to extract part features and a two-level classification network to utilize part-level and image-level features simultaneously. Part-level features are trained with the weighted part loss, which embeds a weighting mechanism based on different parts’ characteristics. Easy parts, hard parts and background parts are proposed and discriminatively used for classification. Moreover, part-level features are fused to form an image-level feature so as to introduce global supervision and generate final predictions. Experiments on three popular benchmark datasets show that our framework achieves competitive performance compared with the state-of-the-art. Code is available at https://github.com/MichaelLiang12/PFNet-FGVC.
引用
收藏
页码:33397 / 33416
页数:19
相关论文
共 21 条
[1]  
He K(2015)Spatial pyramid pooling in deep convolutional networks for visual recognition IEEE Trans Pattern Anal Mach Intell 37 1904-1916
[2]  
Zhang X(2012)Texture classification from random features IEEE Trans Pattern Anal Mach Intell 34 574-586
[3]  
Ren S(2004)Distinctive image features from scale-invariant keypoints Int J Comput Vis 60 91-110
[4]  
Sun J(2002)Multiresolution gray-scale and rotation invariant texture classification with local binary patterns IEEE Trans Pattern Anal Mach Intell 24 971-987
[5]  
Liu L(2012)Sparse ensemble learning for concept detection IEEE Trans Multimed 14 43-54
[6]  
Fieguth P(2016)Weakly supervised fine-grained categorization with part-based image representation IEEE Trans Image Process 25 1713-1725
[7]  
Lowe DG(undefined)undefined undefined undefined undefined-undefined
[8]  
Ojala T(undefined)undefined undefined undefined undefined-undefined
[9]  
Pietikainen M(undefined)undefined undefined undefined undefined-undefined
[10]  
Maenpaa T(undefined)undefined undefined undefined undefined-undefined