Spatial-channel transformer network based on mask-RCNN for efficient mushroom instance segmentation

被引:0
|
作者
Wang, Jiaoling [1 ,2 ,4 ]
Song, Weidong [2 ]
Zheng, Wengang [3 ]
Feng, Qingchun [3 ]
Wang, Mingfei [3 ]
Zhao, Chunjiang [1 ,3 ]
机构
[1] Northwest Agr & Forestry Univ, Xian 712199, Peoples R China
[2] Minist Agr & Rural Affairs, Nanjing Inst Agr Mechanizat, Nanjing 210014, Peoples R China
[3] Beijing Acad Agr & Forestry Sci, Intelligent Equipment Technol Res Ctr, Beijing 100097, Peoples R China
[4] Zhejiang Univ, Coll Biosyst Engn & Food Sci, Zhejiang Prov Key Lab Agr Intelligent Equipment &, Hangzhou 310058, Peoples R China
关键词
edible mushrooms; picking; instance segmentation; deep learning; algorithm; WHEAT FIELDS; RECOGNITION;
D O I
10.25165/j.ijabe.20241704.8987
中图分类号
S2 [农业工程];
学科分类号
0828 ;
摘要
Edible mushrooms are rich in nutrients; however, harvesting mainly relies on manual labor. Coarse localization of each mushroom is necessary to enable a robotic arm to accurately pick edible mushrooms. Previous studies used detection algorithms that did not consider mushroom pixel-level information. When these algorithms are combined with a depth map, the information is lost. Moreover, in instance segmentation algorithms, convolutional neural network (CNN)-based methods are lightweight, and the extracted features are not correlated. To guarantee real-time location detection and improve the accuracy of mushroom segmentation, this study proposed a new spatial-channel transformer network model based on Mask-CNN (SCTMask-RCNN). The fusion of Mask-RCNN with the self-attention mechanism extracts the global correlation outcomes of image features from the channel and spatial dimensions. Subsequently, Mask-RCNN was used to maintain a lightweight structure and extract local features using a spatial pooling pyramidal structure to achieve multiscale local feature fusion and improve detection accuracy. The results showed that the SCT-Mask-RCNN method achieved a segmentation accuracy of 0.750 on segm_Precision_mAP and detection accuracy of 0.638 on Bbox_Precision_mAP. Compared to existing methods, the proposed method improved the accuracy of the evaluation metrics Bbox_Precision_mAP and segm_Precision_mAP by over 2% and 5%, respectively.
引用
收藏
页码:227 / 235
页数:9
相关论文
共 27 条
  • [21] SCTRANS: A TRANSFORMER NETWORK BASED ON THE SPATIAL AND CHANNEL ATTENTION FOR CLOUD DETECTION
    Jiao, Wenke
    Zhang, Yongjun
    Zhang, Bin
    Wan, Yi
    2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 615 - 618
  • [22] DSCA-PSPNet: Dynamic spatial-channel attention pyramid scene parsing network for sugarcane field segmentation in satellite imagery
    Yuan, Yujian
    Yang, Lina
    Chang, Kan
    Huang, Youju
    Yang, Haoyan
    Wang, Jiale
    FRONTIERS IN PLANT SCIENCE, 2024, 14
  • [23] PIS-Net: Efficient weakly supervised instance segmentation network based on annotated points for rice field weed identification
    Chen, Hao
    Zhang, Youliu
    He, Caijie
    Chen, Chuting
    Zhang, Yaoxuan
    Chen, Zhiying
    Jiang, Yu
    Lin, Caixia
    Ma, Ruijun
    Qi, Long
    SMART AGRICULTURAL TECHNOLOGY, 2024, 9
  • [24] Ore image segmentation Based on Multiscale Parallel Efficient Channel Attention U-Network
    Wang, Xiaoli
    Feng, Mengguang
    Tang, Xiangxiang
    Peng, Tao
    Li, Zhongmei
    Yang, Chunhua
    IFAC PAPERSONLINE, 2024, 58 (22): : 101 - 106
  • [25] Research on Non-Destructive Testing of Curtain Wall Hangings Based on Efficient Channel Attention Vision Transformer Network
    Gao, Chongliang
    Gao, Jie
    Cao, Yajun
    IEEE ACCESS, 2024, 12 : 50689 - 50697
  • [26] D2T-Net: A dual-domain transformer network exploiting spatial and channel dimensions for semantic segmentation of urban mobile laser scanning point clouds
    Luo, Ziwei
    Zeng, Ziyin
    Wan, Jie
    Tang, Wei
    Jin, Zhongge
    Xie, Zhong
    Xu, Yongyang
    INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2024, 132
  • [27] scSE-NL V-Net: A Brain Tumor Automatic Segmentation Method Based on Spatial and Channel "Squeeze-and-Excitation" Network With Non-local Block
    Zhou, Juhua
    Ye, Jianming
    Liang, Yu
    Zhao, Jialu
    Wu, Yan
    Luo, Siyuan
    Lai, Xiaobo
    Wang, Jianqing
    FRONTIERS IN NEUROSCIENCE, 2022, 16