Exploration of Class Center for Fine-Grained Visual Classification

被引：5

作者：

Yao, Hang ^{[1
,2
]}

Miao, Qiguang ^{[1
,2
]}

Zhao, Peipei ^{[1
,2
]}

Li, Chaoneng ^{[1
,2
]}

Li, Xin ^{[3
]}

Feng, Guanwen ^{[1
,2
]}

Liu, Ruyi

机构：

[1] Xidian Univ, Sch Comp Sci & Technol, Xian Key Lab Big Data & Intelligent Vis, Xian 710071, Shaanxi, Peoples R China

[2] Xidian Univ, Key Lab Collaborat Intelligence Syst, Minist Educ, Xian 710071, Shaanxi, Peoples R China

[3] Yanshan Univ, Sch Mech Engn, Qinhuangdao 066004, Peoples R China

来源：

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY | 2024年 / 34卷 / 10期

关键词：

Visualization; Feature extraction; Predictive models; Task analysis; Training; Reliability; Optimization; Fine-grained visual classification; exploration of class center; class center; soft label;

D O I：

10.1109/TCSVT.2024.3406443

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Different from large-scale classification tasks, fine-grained visual classification is a challenging task due to two critical problems: 1) evident intra-class variances and subtle inter-class differences, and 2) overfitting owing to fewer training samples in datasets. Most existing methods extract key features to reduce intra-class variances, but pay no attention to subtle inter-class differences in fine-grained visual classification. To address this issue, we propose a loss function named exploration of class center, which consists of a multiple class-center constraint and a class-center label generation. This loss function fully utilizes the information of the class center from the perspective of features and labels. From the feature perspective, the multiple class-center constraint pulls samples closer to the target class center, and pushes samples away from the most similar nontarget class center. Thus, the constraint reduces intra-class variances and enlarges inter-class differences. From the label perspective, the class-center label generation utilizes class-center distributions to generate soft labels to alleviate overfitting. Our method can be easily integrated with existing fine-grained visual classification approaches as a loss function, to further boost excellent performance with only slight training costs. Extensive experiments are conducted to demonstrate consistent improvements achieved by our method on four widely-used fine-grained visual classification datasets. In particular, our method achieves state-of-the-art performance on the FGVC-Aircraft and CUB-200-2011 datasets.

引用

页码：9954 / 9966

页数：13

共 68 条

[21] Dual Transformer With Multi-Grained Assembly for Fine-Grained Visual Classification [J].

Ji, Ruyi ;

Li, Jiaying ;

Zhang, Libo ;

Liu, Jing ;

Wu, Yanjun .

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (09) :5009-5021

[22] Attention Convolutional Binary Neural Tree for Fine-Grained Visual Categorization [J].

Ji, Ruyi ;

Wen, Longyin ;

Zhang, Libo ;

Du, Dawei ;

Wu, Yanjun ;

Zhao, Chen ;

Liu, Xianglong ;

Huang, Feiyue .

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :10465-10474

[23]

Jia C, 2021, PR MACH LEARN RES, V139

[24] 3D Object Representations for Fine-Grained Categorization [J].

Krause, Jonathan ;

Stark, Michael ;

Deng, Jia ;

Li Fei-Fei .

2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2013, :554-561

[25] Frequency-aware Discriminative Feature Learning Supervised by Single-Center Loss for Face Forgery Detection [J].

Li, Jiaming ;

Xie, Hongtao ;

Li, Jiahong ;

Wang, Zhongyuan ;

Zhang, Yongdong .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :6454-6463

[26] WDAN: A Weighted Discriminative Adversarial Network With Dual Classifiers for Fine-Grained Open-Set Domain Adaptation [J].

Li, Jing ;

Yang, Liu ;

Wang, Qilong ;

Hu, Qinghua .

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (09) :5133-5147

[27] A Simple Episodic Linear Probe Improves Visual Recognition in the Wild [J].

Liang, Yuanzhi ;

Zhu, Linchao ;

Wang, Xiaohan ;

Yang, Yi .

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, :9549-9559

[28]

Lin D, 2015, PROC CVPR IEEE, P1666, DOI 10.1109/CVPR.2015.7298775

[29] w Bilinear Convolutional Neural Networks for Fine-Grained Visual Recognition [J].

Lin, Tsung-Yu ;

RoyChowdhury, Aruni ;

Maji, Subhransu .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (06) :1309-1322

[30] Swin Transformer: Hierarchical Vision Transformer using Shifted Windows [J].

Liu, Ze ;

Lin, Yutong ;

Cao, Yue ;

Hu, Han ;

Wei, Yixuan ;

Zhang, Zheng ;

Lin, Stephen ;

Guo, Baining .

2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :9992-10002

← 1 2 3 4 5 6 7 →