Local Reversible Transformer for semantic segmentation of grape leaf diseases

被引:8
|
作者
Zhang, Xinxin [1 ,2 ]
Li, Fei [1 ]
Jin, Haibin [1 ,2 ]
Mu, Weisong [1 ,2 ,3 ]
机构
[1] China Agr Univ, Coll Informat & Elect Engn, Beijing 100083, Peoples R China
[2] Minist Agr, Key Lab Viticulture & Enol, Beijing 100083, Peoples R China
[3] China Agr Univ, POB 121,17 Tsinghua East Rd, Beijing 100083, Peoples R China
关键词
Local learning bottleneck; Reversible downsampling; Grape leaf diseases; Semantic segmentation;
D O I
10.1016/j.asoc.2023.110392
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Grape leaf diseases segmentation is an essential basis for achieving precise diagnosis and identification of diseases. However, the complex background renders it difficult for small disease areas to be precisely segmented. The existing Transformer mainly focuses on utilizing key and value downsampling to improve model performance while neglecting that downsampling is irreversible with the loss of contextual information. To this end, this paper proposed a novel Locally Reversible Transformer (LRT) segmentation model for grape leaf diseases in natural scene images, whose representation is learned in a reversible downsampling manner. Specifically, a Local Learning Bottleneck (LLB) is developed to enhance local perception and extract richer semantic information of grape leaf diseases via inverted residual convolution. Furthermore, motivated by the wavelet theory, the Reversible Attention (RA) is designed to replace the original downsampling operation by introducing wavelet transform into the multi-headed attention and solving the problem of difficult detection and segmentation of small disease targets with complex backgrounds. Extensive experiments demonstrate that the segmentation performance of LRT outperforms state-of-the-art models with comparable GFLOPs and parameters. Moreover, LRT can retain more multi-grain information and can increase the receptive field to focus on small disease regions with complex backgrounds.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] Full-Scale Selective Transformer for Semantic Segmentation
    Lin, Fangjian
    Wu, Sitong
    Ma, Yizhe
    Tian, Shengwei
    COMPUTER VISION - ACCV 2022, PT VII, 2023, 13847 : 310 - 326
  • [22] Efficient and adaptive semantic segmentation network based on Transformer
    Zhang H.-B.
    Cai L.
    Ren J.-P.
    Wang R.-Y.
    Liu F.
    Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2023, 57 (06): : 1205 - 1214
  • [23] TransKD: Transformer Knowledge Distillation for Efficient Semantic Segmentation
    Liu, Ruiping
    Yang, Kailun
    Roitberg, Alina
    Zhang, Jiaming
    Peng, Kunyu
    Liu, Huayao
    Wang, Yaonan
    Stiefelhagen, Rainer
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (12) : 20933 - 20949
  • [24] A Patch Diversity Transformer for Domain Generalized Semantic Segmentation
    He, Pei
    Jiao, Licheng
    Shang, Ronghua
    Liu, Xu
    Liu, Fang
    Yang, Shuyuan
    Zhang, Xiangrong
    Wang, Shuang
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (10) : 14138 - 14150
  • [25] CMLFormer: CNN and Multiscale Local-Context Transformer Network for Remote Sensing Images Semantic Segmentation
    Wu, Honglin
    Zhang, Min
    Huang, Peng
    Tang, Wenlong
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 7233 - 7241
  • [26] Evaluating Transformer-based Semantic Segmentation Networks for Pathological Image Segmentation
    Cam Nguyen
    Asad, Zuhayr
    Deng, Ruining
    Huo, Yuankai
    MEDICAL IMAGING 2022: IMAGE PROCESSING, 2022, 12032
  • [27] Fourier Domain Adaptation for the Identification of Grape Leaf Diseases
    Wang, Jing
    Wu, Qiufeng
    Liu, Tianci
    Wang, Yuqi
    Li, Pengxian
    Yuan, Tianhao
    Ji, Ziyang
    APPLIED SCIENCES-BASEL, 2024, 14 (09):
  • [28] ETFT: Equiangular Tight Frame Transformer for Imbalanced Semantic Segmentation
    Jeong, Seonggyun
    Heo, Yong Seok
    SENSORS, 2024, 24 (21)
  • [29] TBFormer: three-branch efficient transformer for semantic segmentation
    Wei, Can
    Wei, Yan
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (04) : 3661 - 3672
  • [30] Global and edge enhanced transformer for semantic segmentation of remote sensing
    Wang, Hengyou
    Li, Xiao
    Huo, Lianzhi
    Hu, Changmiao
    APPLIED INTELLIGENCE, 2024, 54 (07) : 5658 - 5673