Generating segmentation masks of herbarium specimens and a data set for training segmentation models using deep learning

被引:22
|
作者
White, Alexander E. [1 ,2 ]
Dikow, Rebecca B. [1 ]
Baugh, Makinnon [3 ]
Jenkins, Abigail [3 ]
Frandsen, Paul B. [1 ,3 ]
机构
[1] Smithsonian Inst, Data Sci Lab, Off Chief Informat Officer, Washington, DC 20560 USA
[2] Smithsonian Inst, Dept Bot, Natl Museum Nat Hist, Washington, DC 20560 USA
[3] Brigham Young Univ, Dept Plant & Wildlife Sci, Provo, UT 84602 USA
来源
APPLICATIONS IN PLANT SCIENCES | 2020年 / 8卷 / 06期
关键词
deep learning; digitized herbarium specimens; ferns; machine learning; semantic segmentation; U-Net; PLANTS;
D O I
10.1002/aps3.11352
中图分类号
Q94 [植物学];
学科分类号
071001 ;
摘要
Premise Digitized images of herbarium specimens are highly diverse with many potential sources of visual noise and bias. The systematic removal of noise and minimization of bias must be achieved in order to generate biological insights based on the plants rather than the digitization and mounting practices involved. Here, we develop a workflow and data set of high-resolution image masks to segment plant tissues in herbarium specimen images and remove background pixels using deep learning. Methods and Results We generated 400 curated, high-resolution masks of ferns using a combination of automatic and manual tools for image manipulation. We used those images to train a U-Net-style deep learning model for image segmentation, achieving a final Sorensen-Dice coefficient of 0.96. The resulting model can automatically, efficiently, and accurately segment massive data sets of digitized herbarium specimens, particularly for ferns. Conclusions The application of deep learning in herbarium sciences requires transparent and systematic protocols for generating training data so that these labor-intensive resources can be generalized to other deep learning applications. Segmentation ground-truth masks are hard-won data, and we share these data and the model openly in the hopes of furthering model training and transfer learning opportunities for broader herbarium applications.
引用
收藏
页数:8
相关论文
共 50 条
  • [31] Portrait Segmentation Using Ensemble of Heterogeneous Deep-Learning Models
    Kim, Yong-Woon
    Byun, Yung-Cheol
    Krishna, Addapalli V. N.
    ENTROPY, 2021, 23 (02) : 1 - 20
  • [32] Enhancing plant morphological trait identification in herbarium collections through deep learning-based segmentation
    Ariouat, Hanane
    Sklab, Youcef
    Prifti, Edi
    Zucker, Jean-Daniel
    Chenin, Eric
    APPLICATIONS IN PLANT SCIENCES, 2025,
  • [33] Hippocampus segmentation on noncontrast CT using deep learning
    Porter, Evan
    Fuentes, Patricia
    Siddiqui, Zaid
    Thompson, Andrew
    Levitin, Ronald
    Solis, David
    Myziuk, Nick
    Guerrero, Thomas
    MEDICAL PHYSICS, 2020, 47 (07) : 2950 - 2961
  • [34] A Systematic Analysis of CMR Segmentation Using Deep Learning
    Parikh, Yogita
    Koringa, Hasmukh
    INTERNATIONAL JOURNAL OF NEXT-GENERATION COMPUTING, 2022, 13 (03): : 519 - 529
  • [35] Automatic segmentation of leukocytes images using deep learning
    Backes, Andre Ricardo
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (05) : 4259 - 4266
  • [36] Wildland fires detection and segmentation using deep learning
    Akhloufi, Moulay A.
    Tokime, Roger Booto
    Elassady, Hassan
    PATTERN RECOGNITION AND TRACKING XXIX, 2018, 10649
  • [37] Segmentation of seagrass blade images using deep learning
    Mehrubeoglu, Mehrube
    Vargas, Isaac
    Huang, Chi
    Cammarata, Kirk
    REAL-TIME IMAGE PROCESSING AND DEEP LEARNING 2021, 2021, 11736
  • [38] Survey on semantic segmentation using deep learning techniques
    Lateef, Fahad
    Ruichek, Yassine
    NEUROCOMPUTING, 2019, 338 : 321 - 348
  • [39] Cardiac MRI Semantic Segmentation for Ventricles and Myocardium Using Deep Learning
    Mukisa, Racheal
    Bansal, Arvind K.
    INTELLIGENT COMPUTING, VOL 3, 2024, 2024, 1018 : 169 - 188
  • [40] Myocardium Segmentation using Two-Step Deep Learning with Smoothed Masks by Gaussian Blur
    Slobodzian, Vitalii
    Radiuk, Pavlo
    Zingailo, Anastasiia
    Barmak, Olexander
    Krak, Iurii
    6TH INTERNATIONAL CONFERENCE ON INFORMATICS & DATA-DRIVEN MEDICINE, IDDM 2023, 2023, 3609