Refined probability distribution module for fine-grained visual categorization

被引:3
作者
Zhao, Peipei [1 ]
Miao, Qiguang [1 ]
Li, Hongsheng [2 ]
Liu, Ruyi [1 ]
Quan, Yining [1 ]
Song, Jianfeng [1 ]
机构
[1] Xidian Univ, Sch Comp Sci & Technol, Xian, Shaanxi, Peoples R China
[2] Chinese Univ Hong Kong, Dept Elect Engn, Hong Kong, Peoples R China
基金
国家重点研发计划; 中国博士后科学基金;
关键词
Image -to -image similarity scores; Batch random walk; Deep learning; Fine-grained visual categorization; PERSON REIDENTIFICATION;
D O I
10.1016/j.neucom.2022.10.004
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Fine-grained visual categorization is an important task in computer vision. Prior works on fine-grained visual categorization have paid much attention to addressing intra-class variation and inter-class similar-ity. However, they rarely study that task from the perspective of probability distribution. In this paper, we propose a novel refined probability distribution module based on deep convolutional neural network. Our module computes the probability of an image by fully utilizing the similarity information between images. Firstly, we use deep neural networks to obtain the initial probability distribution and extract fea-tures. Then, we build a network whose inputs are features for calculating image-to-image similarity scores. Finally, our module refines the initial probability distribution based on an effective batch random walk operation with similarity scores. Our module can be plugged into many deep convolutional neural networks. Experimental results show that our approach outperforms state-of-the-art methods on the CUB-200-2011, FGVC-Aircraft and Stanford Cars datasets respectively.CO 2022 Published by Elsevier B.V.
引用
收藏
页码:533 / 544
页数:12
相关论文
共 56 条
[1]  
Adam P., 2017, P NEURAL INFORM PROC
[2]  
Aldous D. J., 1999, J THEORETICAL PROBAB, V2, P91
[3]  
[Anonymous], 2018, P EUROPEAN C COMPUTE
[4]  
[Anonymous], 2013, P 21 ACM INT C MULTI
[5]  
Arandjelovic R, 2012, PROC CVPR IEEE, P2911, DOI 10.1109/CVPR.2012.6248018
[6]  
Bai S., 2017, P IEEE C COMP VIS PA, P2530
[7]   Sparse Contextual Activation for Efficient Visual Re-Ranking [J].
Bai, Song ;
Bai, Xiang .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (03) :1056-1069
[8]   Deep-Person: Learning discriminative deep features for person Re-Identification [J].
Bai, Xiang ;
Yang, Mingkun ;
Huang, Tengteng ;
Dou, Zhiyong ;
Yu, Rui ;
Xu, Yongchao .
PATTERN RECOGNITION, 2020, 98
[9]   Bag-of-Words Based Deep Neural Network for Image Retrieval [J].
Bai, Yalong ;
Yu, Wei ;
Xiao, Tianjun ;
Xu, Chang ;
Yang, Kuiyuan ;
Ma, Wei-Ying ;
Zhao, Tiejun .
PROCEEDINGS OF THE 2014 ACM CONFERENCE ON MULTIMEDIA (MM'14), 2014, :229-232
[10]   Convolutional Random Walk Networks for Semantic Image Segmentation [J].
Bertasius, Gedas ;
Torresani, Lorenzo ;
Yu, Stella X. ;
Shi, Jianbo .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6137-6145