Synthetic bootstrapping of convolutional neural networks for semantic plant part segmentation

被引:42
|
作者
Barth, R. [1 ,3 ]
IJsselmuiden, J. [2 ]
Hemming, J. [1 ]
Van Henten, E. J. [2 ]
机构
[1] Wageningen Univ & Res, Greenhouse Hort, POB 644, NL-6700 AP Wageningen, Netherlands
[2] Wageningen Univ & Res, Farm Technol Grp, POB 16, NL-6700 AA Wageningen, Netherlands
[3] Harvard Univ, Biorobot Lab, 60 Oxford St, Cambridge, MA 02138 USA
基金
欧盟地平线“2020”;
关键词
Computer vision; Semantic segmentation; Synthetic dataset; Bootstrapping; Big data; PEPPER PLANTS; IMAGE; LOCALIZATION;
D O I
10.1016/j.compag.2017.11.040
中图分类号
S [农业科学];
学科分类号
09 ;
摘要
A current bottleneck of state-of-the-art machine learning methods for image segmentation in agriculture, e.g. convolutional neural networks (CNNs), is the requirement of large manually annotated datasets on a per-pixel level. In this paper, we investigated how related synthetic images can be used to bootstrap CNNs for successful learning as compared to other learning strategies. We hypothesise that a small manually annotated empirical dataset is sufficient for fine-tuning a synthetically bootstrapped CNN. Furthermore we investigated (i) multiple deep learning architectures, (ii) the correlation between synthetic and empirical dataset size on part segmentation performance, (iii) the effect of post-processing using conditional random fields (CRF) and (iv) the generalisation performance on other related datasets. For this we have performed 7 experiments using the Capsicum mutton (bell or sweet pepper) dataset containing 50 empirical and 10,500 synthetic images with 7 pixel-level annotated part classes. Results confirmed our hypothesis that only 30 empirical images were required to obtain the highest performance on all 7 classes (mean IOU = 0.40) when a CNN was bootstrapped on related synthetic data. Furthermore we found optimal empirical performance when a VGG-16 network was modified to include a trous spatial pyramid pooling. Adding CRF only improved performance on the synthetic data. Training binary classifiers did not improve results. We have found a positive correlation between dataset size and performance. For the synthetic dataset, learning stabilises around 3000 images. Generalisation to other related datasets proved possible.
引用
收藏
页码:291 / 304
页数:14
相关论文
共 50 条
  • [1] Crack segmentation in the wild using convolutional neural networks and bootstrapping
    Ahmad, Tasweer
    Gharehbaghi, Vahidreza
    Li, Jian
    Bennett, Caroline
    Lequesne, Remy
    EARTHQUAKE ENGINEERING AND RESILIENCE, 2023, 2 (03): : 348 - 363
  • [2] Semantic Segmentation of Sheep Organs by Convolutional Neural Networks
    Hassan, Syed Ibrahim
    Stommel, Martin
    Lowe, Andrew
    Zhang, Qi
    Xu, Weiliang
    2019 INTERNATIONAL CONFERENCE ON IMAGE AND VISION COMPUTING NEW ZEALAND (IVCNZ), 2019,
  • [3] Deep Context Convolutional Neural Networks for Semantic Segmentation
    Yang, Wenbin
    Zhou, Quan
    Fan, Yawen
    Gao, Guangwei
    Wu, Songsong
    Ou, Weihua
    Lu, Huimin
    Cheng, Jie
    Latecki, Longin Jan
    COMPUTER VISION, PT I, 2017, 771 : 696 - 704
  • [4] Semantic Segmentation of Bioimages Using Convolutional Neural Networks
    Wiehman, Stiaan
    de Villiers, Hendrik
    2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 624 - 631
  • [5] Deep convolutional neural networks for semantic segmentation of cracks
    Wang, Jia-Ji
    Liu, Yu-Fei
    Nie, Xin
    Mo, Y. L.
    STRUCTURAL CONTROL & HEALTH MONITORING, 2022, 29 (01):
  • [6] Efficient Yet Deep Convolutional Neural Networks for Semantic Segmentation
    Kamran, Sharif Amit
    Sabbir, Ali Shihab
    2018 INTERNATIONAL SYMPOSIUM ON ADVANCED INTELLIGENT INFORMATICS (SAIN), 2018, : 123 - 130
  • [7] Semantic Segmentation of Hyperspectral Imaging Using Convolutional Neural Networks
    A. Mukhin
    G. Danil
    R. Paringer
    Optical Memory and Neural Networks, 2022, 31 : 38 - 47
  • [8] Convolutional neural networks rarely learn shape for semantic segmentation
    Zhang, Yixin
    Mazurowski, Maciej A.
    PATTERN RECOGNITION, 2024, 146
  • [9] DOMAIN ADAPTATION FOR SEMANTIC SEGMENTATION USING CONVOLUTIONAL NEURAL NETWORKS
    Schenkel, Fabian
    Middelmann, Wolfgang
    2019 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2019), 2019, : 728 - 731
  • [10] Semantic Segmentation With Light Field Imaging and Convolutional Neural Networks
    Jia, Chen
    Shi, Fan
    Zhao, Meng
    Zhang, Yao
    Cheng, Xu
    Wang, Mianzhao
    Chen, Shengyong
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2021, 70