ConvNeXt-Based Fine-Grained Image Classification and Bilinear Attention Mechanism Model

被引:14
|
作者
Li, Zhiheng [1 ,2 ]
Gu, Tongcheng [2 ,3 ]
Li, Bing [2 ,3 ]
Xu, Wubin [2 ,3 ]
He, Xin [2 ,3 ]
Hui, Xiangyu [2 ,3 ]
机构
[1] Guangxi Liugong Machinery Co Ltd, Liuzhou 545006, Peoples R China
[2] Guangxi Sci & Technol Univ, Guangxi Earthmoving Machinery Collaborat Innovat, Liuzhou 545006, Peoples R China
[3] Guangxi Sci & Technol Univ, Coll Mech & Automot Engn, Liuzhou 545006, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2022年 / 12卷 / 18期
关键词
deep learning; convolutional neural network; image classification; fine grained; attention mechanism;
D O I
10.3390/app12189016
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Featured Application This paper studies attention-related optimizations and innovations for the ConvNeXt network proposed in January 2022, providing a reference for subsequent researchers to optimize this network . Thus far, few studies have been conducted on fine-grained classification tasks for the latest convolutional neural network ConvNeXt, and no effective optimization method has been made available. To achieve more accurate fine-grained classification, this paper proposes two attention embedding methods based on ConvNeXt network and designs a new bilinear CBAM; simultaneously, a multiscale, multi-perspective and all-around attention framework is proposed, which is then applied in ConvNeXt. Experimental verification shows that the accuracy rate of the improved ConvNeXt for fine-grained image classification reaches 87.8%, 91.2%, and 93.2% on fine-grained classification datasets CUB-200-2011, Stanford Cars, and FGVC Aircraft, respectively, showing increases of 2.7%, 0.3% and 0.4%, respectively, compared to those of the original network without optimization, and increases of 3.7%, 8.0% and 2.0%, respectively, compared to those of the traditional BCNN. In addition, ablation experiments are set up to verify the effectiveness of the proposed attention framework.
引用
收藏
页数:18
相关论文
共 50 条
  • [41] Fine-grained bird image classification based on counterfactual method of vision transformer model
    Chen, Tianhua
    Li, Yanyue
    Qiao, Qinghua
    JOURNAL OF SUPERCOMPUTING, 2024, 80 (05) : 6221 - 6239
  • [42] GA-SRN: graph attention based text-image semantic reasoning network for fine-grained image classification and retrieval
    Wenhao Li
    Hongqing Zhu
    Suyi Yang
    Pengyu Wang
    Han Zhang
    Neural Computing and Applications, 2022, 34 : 21387 - 21401
  • [43] Rethinking Attention Mechanism: Channel Re-attention and Spatial Multi-region Attention for Fine-grained Visual Classification
    XiaoHui Wang
    Yulin Sun
    Xin Liu
    Zhipeng Zou
    Li Wang
    Kun Wang
    Xiaoyang Liang
    Wei Liu
    Neural Processing Letters, 57 (3)
  • [44] A survey of recent work on fine-grained image classification techniques
    Wang, Yafei
    Wang, Zepeng
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2019, 59 : 210 - 214
  • [45] A biomedical event extraction method based on fine-grained and attention mechanism
    He, Xinyu
    Tai, Ping
    Lu, Hongbin
    Huang, Xin
    Ren, Yonggong
    BMC BIOINFORMATICS, 2022, 23 (01)
  • [46] GA-SRN: graph attention based text-image semantic reasoning network for fine-grained image classification and retrieval
    Li, Wenhao
    Zhu, Hongqing
    Yang, Suyi
    Wang, Pengyu
    Zhang, Han
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (23) : 21387 - 21401
  • [47] A biomedical event extraction method based on fine-grained and attention mechanism
    Xinyu He
    Ping Tai
    Hongbin Lu
    Xin Huang
    Yonggong Ren
    BMC Bioinformatics, 23
  • [48] Self-Attention-Based BiLSTM Model for Short Text Fine-Grained Sentiment Classification
    Xie, Jun
    Chen, Bo
    Gu, Xinglong
    Liang, Fengmei
    Xu, Xinying
    IEEE ACCESS, 2019, 7 : 180558 - 180570
  • [49] Dual attention guided multi-scale CNN for fine-grained image classification
    Liu, Xiaozhang
    Zhang, Lifeng
    Li, Tao
    Wang, Dejian
    Wang, Zhaojie
    INFORMATION SCIENCES, 2021, 573 : 37 - 45
  • [50] Fine-grained attention mechanism for neural machine translation
    Choi, Heeyoul
    Cho, Kyunghyun
    Bengio, Yoshua
    NEUROCOMPUTING, 2018, 284 : 171 - 176