Generating segmentation masks of herbarium specimens and a data set for training segmentation models using deep learning

被引:22
|
作者
White, Alexander E. [1 ,2 ]
Dikow, Rebecca B. [1 ]
Baugh, Makinnon [3 ]
Jenkins, Abigail [3 ]
Frandsen, Paul B. [1 ,3 ]
机构
[1] Smithsonian Inst, Data Sci Lab, Off Chief Informat Officer, Washington, DC 20560 USA
[2] Smithsonian Inst, Dept Bot, Natl Museum Nat Hist, Washington, DC 20560 USA
[3] Brigham Young Univ, Dept Plant & Wildlife Sci, Provo, UT 84602 USA
来源
APPLICATIONS IN PLANT SCIENCES | 2020年 / 8卷 / 06期
关键词
deep learning; digitized herbarium specimens; ferns; machine learning; semantic segmentation; U-Net; PLANTS;
D O I
10.1002/aps3.11352
中图分类号
Q94 [植物学];
学科分类号
071001 ;
摘要
Premise Digitized images of herbarium specimens are highly diverse with many potential sources of visual noise and bias. The systematic removal of noise and minimization of bias must be achieved in order to generate biological insights based on the plants rather than the digitization and mounting practices involved. Here, we develop a workflow and data set of high-resolution image masks to segment plant tissues in herbarium specimen images and remove background pixels using deep learning. Methods and Results We generated 400 curated, high-resolution masks of ferns using a combination of automatic and manual tools for image manipulation. We used those images to train a U-Net-style deep learning model for image segmentation, achieving a final Sorensen-Dice coefficient of 0.96. The resulting model can automatically, efficiently, and accurately segment massive data sets of digitized herbarium specimens, particularly for ferns. Conclusions The application of deep learning in herbarium sciences requires transparent and systematic protocols for generating training data so that these labor-intensive resources can be generalized to other deep learning applications. Segmentation ground-truth masks are hard-won data, and we share these data and the model openly in the hopes of furthering model training and transfer learning opportunities for broader herbarium applications.
引用
收藏
页数:8
相关论文
共 50 条
  • [41] Review of Semantic Segmentation by Using Deep learning methods
    Rajeswari, B.
    Ram, J. Mani
    Kumar, D. V. T. Praveen
    Harshith, K. L. V. V.
    2024 INTERNATIONAL CONFERENCE ON SOCIAL AND SUSTAINABLE INNOVATIONS IN TECHNOLOGY AND ENGINEERING, SASI-ITE 2024, 2024, : 272 - 277
  • [42] Performance Analysis on Deep Learning Semantic Segmentation with multivariate Training Procedures
    Lourenco, Bernardo
    Santos, Vitor
    Oliveira, Miguel
    Almeida, Tiago
    2020 IEEE INTERNATIONAL CONFERENCE ON AUTONOMOUS ROBOT SYSTEMS AND COMPETITIONS (ICARSC 2020), 2020, : 89 - 95
  • [43] FWNet: Semantic Segmentation for Full-Waveform LiDAR Data Using Deep Learning
    Shinohara, Takayuki
    Xiu, Haoyi
    Matsuoka, Masashi
    SENSORS, 2020, 20 (12) : 1 - 20
  • [44] Fast body part segmentation and tracking of neonatal video data using deep learning
    Christoph Hoog Antink
    Joana Carlos Mesquita Ferreira
    Michael Paul
    Simon Lyra
    Konrad Heimann
    Srinivasa Karthik
    Jayaraj Joseph
    Kumutha Jayaraman
    Thorsten Orlikowsky
    Mohanasankar Sivaprakasam
    Steffen Leonhardt
    Medical & Biological Engineering & Computing, 2020, 58 : 3049 - 3061
  • [45] Blood Cell Images Segmentation using Deep Learning Semantic Segmentation
    Thanh Tran
    Kwon, Oh-Heum
    Kwon, Ki-Ryong
    Lee, Suk-Hwan
    Kang, Kyung-Won
    2018 IEEE INTERNATIONAL CONFERENCE ON ELECTRONICS AND COMMUNICATION ENGINEERING (ICECE 2018), 2018, : 13 - 16
  • [46] Deep learning models and traditional automated techniques for brain tumor segmentation in MRI: a review
    Parvathy Jyothi
    A. Robert Singh
    Artificial Intelligence Review, 2023, 56 : 2923 - 2969
  • [47] Generation of Istanbul road data set using Google Map API for deep learning-based segmentation
    Ozturk, Ozan
    Isik, Mustafa Serkan
    Sariturk, Batuhan
    Seker, Dursun Zafer
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2022, 43 (08) : 2793 - 2812
  • [48] Automated segmentation of RGB-D images into a comprehensive set of building components using deep learning
    Czerniawski, Thomas
    Leite, Fernanda
    ADVANCED ENGINEERING INFORMATICS, 2020, 45 (45)
  • [49] Deep learning models and traditional automated techniques for brain tumor segmentation in MRI: a review
    Jyothi, Parvathy
    Singh, A. Robert
    ARTIFICIAL INTELLIGENCE REVIEW, 2023, 56 (04) : 2923 - 2969
  • [50] Fast body part segmentation and tracking of neonatal video data using deep learning
    Antink, Christoph Hoog
    Ferreira, Joana Carlos Mesquita
    Paul, Michael
    Lyra, Simon
    Heimann, Konrad
    Karthik, Srinivasa
    Joseph, Jayaraj
    Jayaraman, Kumutha
    Orlikowsky, Thorsten
    Sivaprakasam, Mohanasankar
    Leonhardt, Steffen
    MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2020, 58 (12) : 3049 - 3061