Tobacco Leaf Segmentation Based on Improved MASK RCNN Algorithm and SAM Model

被引:1
作者
Zhang, Weizheng [1 ]
Wang, Yuefeng [1 ]
Shen, Guangcai [2 ]
Li, Canlin [1 ]
Li, Meng [3 ]
Guo, Yingcheng [2 ]
机构
[1] Zhengzhou Univ Light Ind, Coll Comp Sci & Technol, Zhengzhou 450001, Peoples R China
[2] Yunnan Tobacco Co, Baoshan Branch, Baoshan 678000, Peoples R China
[3] Zhengzhou Univ Light Ind, Coll Tobacco Sci & Engn, Zhengzhou 450001, Peoples R China
关键词
Tobacco leaf; occlusion; mask region-based convolutional neural networks (MASK RCNN); SAM; image segmentation;
D O I
10.1109/ACCESS.2023.3316364
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
High-precision segmentation of tobacco leaves is a prerequisite for analysis of phenotypic information. Challenges such as mutual occlusion and fuzzy edges make leaf segmentation difficult. This paper proposes an improved algorithm based on the Mask Region-based Convolutional Neural Networks (MASK RCNN) model and an instance segmentation method based on the SAM model to address these challenges. First, the MASK RCNN model is enhanced by incorporating a feature fusion layer and a hybrid attention mechanism, which improves the segmentation performance. The improved MASK RCNN model achieves an Avg.MIoU metric of approximately 85.10%, which is an improvement of 11.10% over the original algorithm. It also achieves an Avg.MPA metric of about 84.94%, indicating an improvement of 10.84%. Second, the Segment Anything Model (SAM) model is presented for the first time for tobacco leaf segmentation, providing empirical support for its application in the tobacco field. The SAM model demonstrates accurate segmentation of tobacco leaf images at different growth stages, demonstrating its good generality. In conclusion, the proposed methods effectively address the challenges in tobacco leaf segmentation, resulting in improved accuracy and performance. These techniques provide significant technical support for tobacco leaf phenotype research.
引用
收藏
页码:103102 / 103114
页数:13
相关论文
共 20 条
  • [1] Future Scenarios for Plant Phenotyping
    Fiorani, Fabio
    Schurr, Ulrich
    [J]. ANNUAL REVIEW OF PLANT BIOLOGY, VOL 64, 2013, 64 : 267 - 291
  • [2] Attributes based skin lesion detection and recognition: A mask RCNN and transfer learning-based deep learning framework
    Khan, Muhammad Attique
    Akram, Tallha
    Zhang, Yu-Dong
    Sharif, Muhammad
    [J]. PATTERN RECOGNITION LETTERS, 2021, 143 : 58 - 66
  • [3] Data Augmentation for Leaf Segmentation and Counting Tasks in Rosette Plants
    Kuznichov, Dmitry
    Zvirin, Alon
    Honen, Yaron
    Kimmel, Ron
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 2580 - 2589
  • [4] Measuring plant growth characteristics using smartphone based image analysis technique in controlled environment agriculture
    Li, Cheng
    Adhikari, Ranjeeta
    Yao, Yuan
    Miller, Alexander G.
    Kalbaugh, Kirby
    Li, Daoliang
    Nemali, Krishna
    [J]. COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2020, 168
  • [5] Classification of Hyperspectral Image Based on Double-Branch Dual-Attention Mechanism Network
    Li, Rui
    Zheng, Shunyi
    Duan, Chenxi
    Yang, Yang
    Wang, Xiqi
    [J]. REMOTE SENSING, 2020, 12 (03)
  • [6] Finely-grained annotated datasets for image-based plant phenotyping
    Minervini, Massimo
    Fischbach, Andreas
    Scharr, Hanno
    Tsaftaris, Sotirios A.
    [J]. PATTERN RECOGNITION LETTERS, 2016, 81 : 80 - 89
  • [7] A novel approach for unsupervised image segmentation fusion of plant leaves based on G-mutual information
    Nikbakhsh, Navid
    Baleghi, Yasser
    Agahi, Hamzeh
    [J]. MACHINE VISION AND APPLICATIONS, 2020, 32 (01)
  • [8] Pape J., 2015, Proceedings of the Computer Vision Problems in Plant Phenotyping, p3.1, DOI [10.5244/C.29.CVPPP.3, DOI 10.5244/C.29.CVPPP.3]
  • [9] End-to-End Instance Segmentation with Recurrent Attention
    Ren, Mengye
    Zemel, Richard S.
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 293 - 301
  • [10] Recurrent Instance Segmentation
    Romera-Paredes, Bernardino
    Torr, Philip Hilaire Sean
    [J]. COMPUTER VISION - ECCV 2016, PT VI, 2016, 9910 : 312 - 329