Global Information-Assisted Fine-Grained Visual Categorization in Internet of Things

被引:3
作者
Li, Ang [1 ]
Kang, Bin [1 ]
Chen, Jianxin [1 ]
Wu, Dan [2 ]
Zhou, Liang [1 ]
机构
[1] Nanjing Univ Posts & Telecommun, Key Lab Broadband Wireless Commun & Sensor Network, Minist Educ, Nanjing 210003, Peoples R China
[2] Army Engn Univ PLA, Coll Commun Engn, Nanjing 210007, Peoples R China
基金
中国国家自然科学基金;
关键词
Alternative knowledge distillation strategy; fine-grained visual categorization; global-local aggregation strategy;
D O I
10.1109/JIOT.2022.3218150
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In fine-grained visual categorization (FGVC), most part-based frameworks do not work effectively in some extremely challenging scenarios such as partial occlusion. This limitation is due to the heavy disorder of local features extracted from such occluded targets. To address this issue, we propose a global information-assisted network (GIAN), where auxiliary global information can search the useful elements of local information and integrate with them for an efficient unified feature representation. In particular, in order to acquire the global information, we design a global attention-concentrated convolutional neural network (GAC-CNN) by extending a convolutional neural network with a nonlocal GCN module. Then, the unified feature representation is produced by two strategies. On the one hand, a global-local aggregation strategy is developed to selectively integrate global features with local features through consistency evaluation and reweighting method. On the other hand, an alternative knowledge distillation strategy is developed to help generate more powerful global and local features. Two strategies collaboratively make the unified features more robust and more discriminative than traditional part-based features. Experimental results show that the proposed GIAN can achieve accuracies of 92.8%, 93.8%, and 95.7% on CUB-200-2011, FGVC Aircraft, and Stanford Cars, respectively.
引用
收藏
页码:940 / 952
页数:13
相关论文
共 60 条
  • [1] Anderson C, 2020, Arxiv, DOI arXiv:2006.13190
  • [2] Anil R, 2020, Arxiv, DOI arXiv:1804.03235
  • [3] Branson S, 2014, Arxiv, DOI [arXiv:1406.2952, DOI 10.48550/ARXIV.1406.2952]
  • [4] GCNet: Non-local Networks Meet Squeeze-Excitation Networks and Beyond
    Cao, Yue
    Xu, Jiarui
    Lin, Stephen
    Wei, Fangyun
    Hu, Han
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 1971 - 1980
  • [5] Destruction and Construction Learning for Fine-grained Image Recognition
    Chen, Yue
    Bai, Yalong
    Zhang, Wei
    Mei, Tao
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 5152 - 5161
  • [6] Cheng Y, 2020, Arxiv, DOI arXiv:1710.09282
  • [7] Selective Sparse Sampling for Fine-grained Image Recognition
    Ding, Yao
    Zhou, Yanzhao
    Zhu, Yi
    Ye, Qixiang
    Jiao, Jianbin
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 6598 - 6607
  • [8] Dosovitskiy A, 2021, Arxiv, DOI arXiv:2010.11929
  • [9] Look Closer to See Better: Recurrent Attention Convolutional Neural Network for Fine-grained Image Recognition
    Fu, Jianlong
    Zheng, Heliang
    Mei, Tao
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 4476 - 4484
  • [10] Gao Y, 2020, AAAI CONF ARTIF INTE, V34, P10818