Generating segmentation masks of herbarium specimens and a data set for training segmentation models using deep learning

被引:22
|
作者
White, Alexander E. [1 ,2 ]
Dikow, Rebecca B. [1 ]
Baugh, Makinnon [3 ]
Jenkins, Abigail [3 ]
Frandsen, Paul B. [1 ,3 ]
机构
[1] Smithsonian Inst, Data Sci Lab, Off Chief Informat Officer, Washington, DC 20560 USA
[2] Smithsonian Inst, Dept Bot, Natl Museum Nat Hist, Washington, DC 20560 USA
[3] Brigham Young Univ, Dept Plant & Wildlife Sci, Provo, UT 84602 USA
来源
APPLICATIONS IN PLANT SCIENCES | 2020年 / 8卷 / 06期
关键词
deep learning; digitized herbarium specimens; ferns; machine learning; semantic segmentation; U-Net; PLANTS;
D O I
10.1002/aps3.11352
中图分类号
Q94 [植物学];
学科分类号
071001 ;
摘要
Premise Digitized images of herbarium specimens are highly diverse with many potential sources of visual noise and bias. The systematic removal of noise and minimization of bias must be achieved in order to generate biological insights based on the plants rather than the digitization and mounting practices involved. Here, we develop a workflow and data set of high-resolution image masks to segment plant tissues in herbarium specimen images and remove background pixels using deep learning. Methods and Results We generated 400 curated, high-resolution masks of ferns using a combination of automatic and manual tools for image manipulation. We used those images to train a U-Net-style deep learning model for image segmentation, achieving a final Sorensen-Dice coefficient of 0.96. The resulting model can automatically, efficiently, and accurately segment massive data sets of digitized herbarium specimens, particularly for ferns. Conclusions The application of deep learning in herbarium sciences requires transparent and systematic protocols for generating training data so that these labor-intensive resources can be generalized to other deep learning applications. Segmentation ground-truth masks are hard-won data, and we share these data and the model openly in the hopes of furthering model training and transfer learning opportunities for broader herbarium applications.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Semantic Segmentation of Herbarium Specimens Using Deep Learning Techniques
    Hussein, Burhan Rashid
    Malik, Owais Ahmed
    Ong, Wee-Hong
    Slik, Johan Willem Frederik
    COMPUTATIONAL SCIENCE AND TECHNOLOGY (ICCST 2019), 2020, 603 : 321 - 330
  • [2] Training deep-learning segmentation models from severely limited data
    Zhao, Yao
    Rhee, Dong Joo
    Cardenas, Carlos
    Court, Laurence E.
    Yang, Jinzhong
    MEDICAL PHYSICS, 2021, 48 (04) : 1697 - 1706
  • [3] RePolyp: A Framework for Generating Realistic Colon Polyps with Corresponding Segmentation Masks using Diffusion Models
    Pishva, Alexander K.
    Thambawita, Vajira
    Torreseni, Jim
    Hicks, Steven A.
    2023 IEEE 36TH INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS, CBMS, 2023, : 47 - 52
  • [4] An overview of industrial image segmentation using deep learning models
    Wang, Guina
    Li, Zhen
    Weng, Guirong
    Chen, Yiyang
    INTELLIGENCE & ROBOTICS, 2025, 5 (01): : 143 - 180
  • [5] Segmentation of retinal arteries and veins using deep learning with limited training data
    Salah, Ahmed Tamer
    Khoriba, Ghada
    Rashed, Essam A.
    9TH INTERNATIONAL CONFERENCE ON MULTIMEDIA AND IMAGE PROCESSING, ICMIP 2024, 2024, : 146 - 151
  • [6] Deep learning based approach for digitized herbarium specimen segmentation
    Triki, Abdelaziz
    Bouaziz, Bassem
    Mahdi, Walid
    Hamed, Hamdi
    Gaikwad, Jitendra
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (20) : 28689 - 28707
  • [7] Deep learning based approach for digitized herbarium specimen segmentation
    Abdelaziz Triki
    Bassem Bouaziz
    Walid Mahdi
    Hamdi Hamed
    Jitendra Gaikwad
    Multimedia Tools and Applications, 2022, 81 : 28689 - 28707
  • [8] Segmentation of Masses on Mammograms Using Data Augmentation and Deep Learning
    Zeiser, Felipe Andre
    da Costa, Cristiano Andre
    Zonta, Tiago
    Marques, Nuno M. C.
    Roehe, Adriana Vial
    Moreno, Marcelo
    Righi, Rodrigo da Rosa
    JOURNAL OF DIGITAL IMAGING, 2020, 33 (04) : 858 - 868
  • [9] Segmentation of Masses on Mammograms Using Data Augmentation and Deep Learning
    Felipe André Zeiser
    Cristiano André da Costa
    Tiago Zonta
    Nuno M. C. Marques
    Adriana Vial Roehe
    Marcelo Moreno
    Rodrigo da Rosa Righi
    Journal of Digital Imaging, 2020, 33 : 858 - 868
  • [10] Deep Open-Set Segmentation in Visual Learning
    Nunes, Ian M.
    Poggi, Marcus
    Oliveira, Hugo
    Pereira, Matheus B.
    dos Santos, Jefersson A.
    2022 35TH SIBGRAPI CONFERENCE ON GRAPHICS, PATTERNS AND IMAGES (SIBGRAPI 2022), 2022, : 314 - 319