Exploration of Class Center for Fine-Grained Visual Classification

被引:5
作者
Yao, Hang [1 ,2 ]
Miao, Qiguang [1 ,2 ]
Zhao, Peipei [1 ,2 ]
Li, Chaoneng [1 ,2 ]
Li, Xin [3 ]
Feng, Guanwen [1 ,2 ]
Liu, Ruyi
机构
[1] Xidian Univ, Sch Comp Sci & Technol, Xian Key Lab Big Data & Intelligent Vis, Xian 710071, Shaanxi, Peoples R China
[2] Xidian Univ, Key Lab Collaborat Intelligence Syst, Minist Educ, Xian 710071, Shaanxi, Peoples R China
[3] Yanshan Univ, Sch Mech Engn, Qinhuangdao 066004, Peoples R China
关键词
Visualization; Feature extraction; Predictive models; Task analysis; Training; Reliability; Optimization; Fine-grained visual classification; exploration of class center; class center; soft label;
D O I
10.1109/TCSVT.2024.3406443
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Different from large-scale classification tasks, fine-grained visual classification is a challenging task due to two critical problems: 1) evident intra-class variances and subtle inter-class differences, and 2) overfitting owing to fewer training samples in datasets. Most existing methods extract key features to reduce intra-class variances, but pay no attention to subtle inter-class differences in fine-grained visual classification. To address this issue, we propose a loss function named exploration of class center, which consists of a multiple class-center constraint and a class-center label generation. This loss function fully utilizes the information of the class center from the perspective of features and labels. From the feature perspective, the multiple class-center constraint pulls samples closer to the target class center, and pushes samples away from the most similar nontarget class center. Thus, the constraint reduces intra-class variances and enlarges inter-class differences. From the label perspective, the class-center label generation utilizes class-center distributions to generate soft labels to alleviate overfitting. Our method can be easily integrated with existing fine-grained visual classification approaches as a loss function, to further boost excellent performance with only slight training costs. Extensive experiments are conducted to demonstrate consistent improvements achieved by our method on four widely-used fine-grained visual classification datasets. In particular, our method achieves state-of-the-art performance on the FGVC-Aircraft and CUB-200-2011 datasets.
引用
收藏
页码:9954 / 9966
页数:13
相关论文
共 68 条
[21]   Dual Transformer With Multi-Grained Assembly for Fine-Grained Visual Classification [J].
Ji, Ruyi ;
Li, Jiaying ;
Zhang, Libo ;
Liu, Jing ;
Wu, Yanjun .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (09) :5009-5021
[22]   Attention Convolutional Binary Neural Tree for Fine-Grained Visual Categorization [J].
Ji, Ruyi ;
Wen, Longyin ;
Zhang, Libo ;
Du, Dawei ;
Wu, Yanjun ;
Zhao, Chen ;
Liu, Xianglong ;
Huang, Feiyue .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :10465-10474
[23]  
Jia C, 2021, PR MACH LEARN RES, V139
[24]   3D Object Representations for Fine-Grained Categorization [J].
Krause, Jonathan ;
Stark, Michael ;
Deng, Jia ;
Li Fei-Fei .
2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2013, :554-561
[25]   Frequency-aware Discriminative Feature Learning Supervised by Single-Center Loss for Face Forgery Detection [J].
Li, Jiaming ;
Xie, Hongtao ;
Li, Jiahong ;
Wang, Zhongyuan ;
Zhang, Yongdong .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :6454-6463
[26]   WDAN: A Weighted Discriminative Adversarial Network With Dual Classifiers for Fine-Grained Open-Set Domain Adaptation [J].
Li, Jing ;
Yang, Liu ;
Wang, Qilong ;
Hu, Qinghua .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (09) :5133-5147
[27]   A Simple Episodic Linear Probe Improves Visual Recognition in the Wild [J].
Liang, Yuanzhi ;
Zhu, Linchao ;
Wang, Xiaohan ;
Yang, Yi .
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, :9549-9559
[28]  
Lin D, 2015, PROC CVPR IEEE, P1666, DOI 10.1109/CVPR.2015.7298775
[29]   w Bilinear Convolutional Neural Networks for Fine-Grained Visual Recognition [J].
Lin, Tsung-Yu ;
RoyChowdhury, Aruni ;
Maji, Subhransu .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (06) :1309-1322
[30]   Swin Transformer: Hierarchical Vision Transformer using Shifted Windows [J].
Liu, Ze ;
Lin, Yutong ;
Cao, Yue ;
Hu, Han ;
Wei, Yixuan ;
Zhang, Zheng ;
Lin, Stephen ;
Guo, Baining .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :9992-10002