Spatial-channel transformer network based on mask-RCNN for efficient mushroom instance segmentation

被引:0
|
作者
Wang, Jiaoling [1 ,2 ,4 ]
Song, Weidong [2 ]
Zheng, Wengang [3 ]
Feng, Qingchun [3 ]
Wang, Mingfei [3 ]
Zhao, Chunjiang [1 ,3 ]
机构
[1] Northwest Agr & Forestry Univ, Xian 712199, Peoples R China
[2] Minist Agr & Rural Affairs, Nanjing Inst Agr Mechanizat, Nanjing 210014, Peoples R China
[3] Beijing Acad Agr & Forestry Sci, Intelligent Equipment Technol Res Ctr, Beijing 100097, Peoples R China
[4] Zhejiang Univ, Coll Biosyst Engn & Food Sci, Zhejiang Prov Key Lab Agr Intelligent Equipment &, Hangzhou 310058, Peoples R China
关键词
edible mushrooms; picking; instance segmentation; deep learning; algorithm; WHEAT FIELDS; RECOGNITION;
D O I
10.25165/j.ijabe.20241704.8987
中图分类号
S2 [农业工程];
学科分类号
0828 ;
摘要
Edible mushrooms are rich in nutrients; however, harvesting mainly relies on manual labor. Coarse localization of each mushroom is necessary to enable a robotic arm to accurately pick edible mushrooms. Previous studies used detection algorithms that did not consider mushroom pixel-level information. When these algorithms are combined with a depth map, the information is lost. Moreover, in instance segmentation algorithms, convolutional neural network (CNN)-based methods are lightweight, and the extracted features are not correlated. To guarantee real-time location detection and improve the accuracy of mushroom segmentation, this study proposed a new spatial-channel transformer network model based on Mask-CNN (SCTMask-RCNN). The fusion of Mask-RCNN with the self-attention mechanism extracts the global correlation outcomes of image features from the channel and spatial dimensions. Subsequently, Mask-RCNN was used to maintain a lightweight structure and extract local features using a spatial pooling pyramidal structure to achieve multiscale local feature fusion and improve detection accuracy. The results showed that the SCT-Mask-RCNN method achieved a segmentation accuracy of 0.750 on segm_Precision_mAP and detection accuracy of 0.638 on Bbox_Precision_mAP. Compared to existing methods, the proposed method improved the accuracy of the evaluation metrics Bbox_Precision_mAP and segm_Precision_mAP by over 2% and 5%, respectively.
引用
收藏
页码:227 / 235
页数:9
相关论文
共 27 条
  • [1] Instance Segmentation of Concrete Defects Based on Improved Mask-RCNN
    Huang C.
    Xie X.
    Zhou Y.
    Li G.
    Bridge Construction, 2023, 53 (06) : 63 - 70
  • [2] Instance Segmentation for Large, Multi-Channel Remote Sensing Imagery Using Mask-RCNN and a Mosaicking Approach
    Carvalho, Osmar Luiz Ferreira de
    de Carvalho Junior, Osmar Abilio
    Albuquerque, Anesmar Olino de
    Bem, Pablo Pozzobon de
    Silva, Cristiano Rosa
    Ferreira, Pedro Henrique Guimaraes
    Moura, Rebeca dos Santos de
    Gomes, Roberto Arnaldo Trancoso
    Guimaraes, Renato Fontes
    Borges, Dibio Leandro
    REMOTE SENSING, 2021, 13 (01) : 1 - 24
  • [3] Instance-Based Segmentation for Boundary Detection of Neuropathic Ulcers Through Mask-RCNN
    Gamage, H. V. L. C.
    Wijesinghe, W. O. K. I. S.
    Perera, Indika
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: WORKSHOP AND SPECIAL SESSIONS, 2019, 11731 : 511 - 522
  • [4] Score-based mask edge improvement of Mask-RCNN for segmentation of fruit and vegetables
    Hameed, Khurram
    Chai, Douglas
    Rassau, Alexander
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 190
  • [5] Segmentation of the Germ from Corn Kernel CT Image Based on the Mask-RCNN
    Zheng, Zhaohui
    Fan, Ben
    Ren, Liuyang
    Fu, Hanyu
    Lv, Lanlan
    Yang, Deyong
    2022 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, COMPUTER VISION AND MACHINE LEARNING (ICICML), 2022, : 94 - 97
  • [6] Optimized Mask-RCNN model for particle chain segmentation based on improved online ferrograph sensor
    Wang, Shuo
    Wan, Miao
    Wu, Tonghai
    Bai, Zichen
    Wang, Kunpeng
    FRICTION, 2024, 12 (06) : 1194 - 1213
  • [7] ISTR: Mask-Embedding-Based Instance Segmentation Transformer
    Hu, Jie
    Lu, Yao
    Zhang, Shengchuan
    Cao, Liujuan
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 2895 - 2907
  • [8] Breast lesions segmentation and classification in a two-stage process based on Mask-RCNN and Transfer Learning
    Hama Soltani
    Mohamed Amroune
    Issam Bendib
    Mohamed-Yassine Haouam
    Elhadj Benkhelifa
    Muhammad Moazam Fraz
    Multimedia Tools and Applications, 2024, 83 : 35763 - 35780
  • [9] Breast lesions segmentation and classification in a two-stage process based on Mask-RCNN and Transfer Learning
    Soltani, Hama
    Amroune, Mohamed
    Bendib, Issam
    Haouam, Mohamed-Yassine
    Benkhelifa, Elhadj
    Fraz, Muhammad Moazam
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (12) : 35763 - 35780
  • [10] SCTransNet: Spatial-Channel Cross Transformer Network for Infrared Small Target Detection
    Yuan, Shuai
    Qin, Hanlin
    Yan, Xiang
    Akhtar, Naveed
    Mian, Ajmal
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 15