Spatial-channel transformer network based on mask-RCNN for efficient mushroom instance segmentation

被引：0

作者：

Wang, Jiaoling ^{[1
,2
,4
]}

Song, Weidong ^{[2
]}

Zheng, Wengang ^{[3
]}

Feng, Qingchun ^{[3
]}

Wang, Mingfei ^{[3
]}

Zhao, Chunjiang ^{[1
,3
]}

机构：

[1] Northwest Agr & Forestry Univ, Xian 712199, Peoples R China

[2] Minist Agr & Rural Affairs, Nanjing Inst Agr Mechanizat, Nanjing 210014, Peoples R China

[3] Beijing Acad Agr & Forestry Sci, Intelligent Equipment Technol Res Ctr, Beijing 100097, Peoples R China

[4] Zhejiang Univ, Coll Biosyst Engn & Food Sci, Zhejiang Prov Key Lab Agr Intelligent Equipment &, Hangzhou 310058, Peoples R China

来源：

INTERNATIONAL JOURNAL OF AGRICULTURAL AND BIOLOGICAL ENGINEERING | 2024年 / 17卷 / 04期

关键词：

edible mushrooms; picking; instance segmentation; deep learning; algorithm; WHEAT FIELDS; RECOGNITION;

D O I：

10.25165/j.ijabe.20241704.8987

中图分类号：

S2 [农业工程];

学科分类号：

0828 ;

摘要：

Edible mushrooms are rich in nutrients; however, harvesting mainly relies on manual labor. Coarse localization of each mushroom is necessary to enable a robotic arm to accurately pick edible mushrooms. Previous studies used detection algorithms that did not consider mushroom pixel-level information. When these algorithms are combined with a depth map, the information is lost. Moreover, in instance segmentation algorithms, convolutional neural network (CNN)-based methods are lightweight, and the extracted features are not correlated. To guarantee real-time location detection and improve the accuracy of mushroom segmentation, this study proposed a new spatial-channel transformer network model based on Mask-CNN (SCTMask-RCNN). The fusion of Mask-RCNN with the self-attention mechanism extracts the global correlation outcomes of image features from the channel and spatial dimensions. Subsequently, Mask-RCNN was used to maintain a lightweight structure and extract local features using a spatial pooling pyramidal structure to achieve multiscale local feature fusion and improve detection accuracy. The results showed that the SCT-Mask-RCNN method achieved a segmentation accuracy of 0.750 on segm_Precision_mAP and detection accuracy of 0.638 on Bbox_Precision_mAP. Compared to existing methods, the proposed method improved the accuracy of the evaluation metrics Bbox_Precision_mAP and segm_Precision_mAP by over 2% and 5%, respectively.

引用

页码：227 / 235

页数：9

共 27 条

[21] SCTRANS: A TRANSFORMER NETWORK BASED ON THE SPATIAL AND CHANNEL ATTENTION FOR CLOUD DETECTION
Jiao, Wenke
Zhang, Yongjun
Zhang, Bin
Wan, Yi
2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 615 - 618
[22] DSCA-PSPNet: Dynamic spatial-channel attention pyramid scene parsing network for sugarcane field segmentation in satellite imagery
Yuan, Yujian
Yang, Lina
Chang, Kan
Huang, Youju
Yang, Haoyan
Wang, Jiale
FRONTIERS IN PLANT SCIENCE, 2024, 14
[23] PIS-Net: Efficient weakly supervised instance segmentation network based on annotated points for rice field weed identification
Chen, Hao
Zhang, Youliu
He, Caijie
Chen, Chuting
Zhang, Yaoxuan
Chen, Zhiying
Jiang, Yu
Lin, Caixia
Ma, Ruijun
Qi, Long
SMART AGRICULTURAL TECHNOLOGY, 2024, 9
[24] Ore image segmentation Based on Multiscale Parallel Efficient Channel Attention U-Network
Wang, Xiaoli
Feng, Mengguang
Tang, Xiangxiang
Peng, Tao
Li, Zhongmei
Yang, Chunhua
IFAC PAPERSONLINE, 2024, 58 (22): : 101 - 106
[25] Research on Non-Destructive Testing of Curtain Wall Hangings Based on Efficient Channel Attention Vision Transformer Network
Gao, Chongliang
Gao, Jie
Cao, Yajun
IEEE ACCESS, 2024, 12 : 50689 - 50697
[26] D2T-Net: A dual-domain transformer network exploiting spatial and channel dimensions for semantic segmentation of urban mobile laser scanning point clouds
Luo, Ziwei
Zeng, Ziyin
Wan, Jie
Tang, Wei
Jin, Zhongge
Xie, Zhong
Xu, Yongyang
INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2024, 132
[27] scSE-NL V-Net: A Brain Tumor Automatic Segmentation Method Based on Spatial and Channel "Squeeze-and-Excitation" Network With Non-local Block
Zhou, Juhua
Ye, Jianming
Liang, Yu
Zhao, Jialu
Wu, Yan
Luo, Siyuan
Lai, Xiaobo
Wang, Jianqing
FRONTIERS IN NEUROSCIENCE, 2022, 16

← 1 2 3 →