Global Information-Assisted Fine-Grained Visual Categorization in Internet of Things

被引：3

作者：

Li, Ang ^{[1
]}

Kang, Bin ^{[1
]}

Chen, Jianxin ^{[1
]}

Wu, Dan ^{[2
]}

Zhou, Liang ^{[1
]}

机构：

[1] Nanjing Univ Posts & Telecommun, Key Lab Broadband Wireless Commun & Sensor Network, Minist Educ, Nanjing 210003, Peoples R China

[2] Army Engn Univ PLA, Coll Commun Engn, Nanjing 210007, Peoples R China

来源：

IEEE INTERNET OF THINGS JOURNAL | 2023年 / 10卷 / 01期

基金：

中国国家自然科学基金;

关键词：

Alternative knowledge distillation strategy; fine-grained visual categorization; global-local aggregation strategy;

D O I：

10.1109/JIOT.2022.3218150

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In fine-grained visual categorization (FGVC), most part-based frameworks do not work effectively in some extremely challenging scenarios such as partial occlusion. This limitation is due to the heavy disorder of local features extracted from such occluded targets. To address this issue, we propose a global information-assisted network (GIAN), where auxiliary global information can search the useful elements of local information and integrate with them for an efficient unified feature representation. In particular, in order to acquire the global information, we design a global attention-concentrated convolutional neural network (GAC-CNN) by extending a convolutional neural network with a nonlocal GCN module. Then, the unified feature representation is produced by two strategies. On the one hand, a global-local aggregation strategy is developed to selectively integrate global features with local features through consistency evaluation and reweighting method. On the other hand, an alternative knowledge distillation strategy is developed to help generate more powerful global and local features. Two strategies collaboratively make the unified features more robust and more discriminative than traditional part-based features. Experimental results show that the proposed GIAN can achieve accuracies of 92.8%, 93.8%, and 95.7% on CUB-200-2011, FGVC Aircraft, and Stanford Cars, respectively.

引用

页码：940 / 952

页数：13

共 60 条

[1] Anderson C, 2020, Arxiv, DOI arXiv:2006.13190
[2] Anil R, 2020, Arxiv, DOI arXiv:1804.03235
[3] Branson S, 2014, Arxiv, DOI [arXiv:1406.2952, DOI 10.48550/ARXIV.1406.2952]
[4] GCNet: Non-local Networks Meet Squeeze-Excitation Networks and Beyond
Cao, Yue
Xu, Jiarui
Lin, Stephen
Wei, Fangyun
Hu, Han
[J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 1971 - 1980
[5] Destruction and Construction Learning for Fine-grained Image Recognition
Chen, Yue
Bai, Yalong
Zhang, Wei
Mei, Tao
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 5152 - 5161
[6] Cheng Y, 2020, Arxiv, DOI arXiv:1710.09282
[7] Selective Sparse Sampling for Fine-grained Image Recognition
Ding, Yao
Zhou, Yanzhao
Zhu, Yi
Ye, Qixiang
Jiao, Jianbin
[J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 6598 - 6607
[8] Dosovitskiy A, 2021, Arxiv, DOI arXiv:2010.11929
[9] Look Closer to See Better: Recurrent Attention Convolutional Neural Network for Fine-grained Image Recognition
Fu, Jianlong
Zheng, Heliang
Mei, Tao
[J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 4476 - 4484
[10] Gao Y, 2020, AAAI CONF ARTIF INTE, V34, P10818

← 1 2 3 4 5 6 →