Synthetic bootstrapping of convolutional neural networks for semantic plant part segmentation

被引：42

作者：

Barth, R. ^{[1
,3
]}

IJsselmuiden, J. ^{[2
]}

Hemming, J. ^{[1
]}

Van Henten, E. J. ^{[2
]}

机构：

[1] Wageningen Univ & Res, Greenhouse Hort, POB 644, NL-6700 AP Wageningen, Netherlands

[2] Wageningen Univ & Res, Farm Technol Grp, POB 16, NL-6700 AA Wageningen, Netherlands

[3] Harvard Univ, Biorobot Lab, 60 Oxford St, Cambridge, MA 02138 USA

来源：

COMPUTERS AND ELECTRONICS IN AGRICULTURE | 2019年 / 161卷

基金：

欧盟地平线“2020”;

关键词：

Computer vision; Semantic segmentation; Synthetic dataset; Bootstrapping; Big data; PEPPER PLANTS; IMAGE; LOCALIZATION;

D O I：

10.1016/j.compag.2017.11.040

中图分类号：

S [农业科学];

学科分类号：

09 ;

摘要：

A current bottleneck of state-of-the-art machine learning methods for image segmentation in agriculture, e.g. convolutional neural networks (CNNs), is the requirement of large manually annotated datasets on a per-pixel level. In this paper, we investigated how related synthetic images can be used to bootstrap CNNs for successful learning as compared to other learning strategies. We hypothesise that a small manually annotated empirical dataset is sufficient for fine-tuning a synthetically bootstrapped CNN. Furthermore we investigated (i) multiple deep learning architectures, (ii) the correlation between synthetic and empirical dataset size on part segmentation performance, (iii) the effect of post-processing using conditional random fields (CRF) and (iv) the generalisation performance on other related datasets. For this we have performed 7 experiments using the Capsicum mutton (bell or sweet pepper) dataset containing 50 empirical and 10,500 synthetic images with 7 pixel-level annotated part classes. Results confirmed our hypothesis that only 30 empirical images were required to obtain the highest performance on all 7 classes (mean IOU = 0.40) when a CNN was bootstrapped on related synthetic data. Furthermore we found optimal empirical performance when a VGG-16 network was modified to include a trous spatial pyramid pooling. Adding CRF only improved performance on the synthetic data. Training binary classifiers did not improve results. We have found a positive correlation between dataset size and performance. For the synthetic dataset, learning stabilises around 3000 images. Generalisation to other related datasets proved possible.

引用

页码：291 / 304

页数：14

共 50 条

[1] Crack segmentation in the wild using convolutional neural networks and bootstrapping
Ahmad, Tasweer
Gharehbaghi, Vahidreza
Li, Jian
Bennett, Caroline
Lequesne, Remy
EARTHQUAKE ENGINEERING AND RESILIENCE, 2023, 2 (03): : 348 - 363
[2] Semantic Segmentation of Sheep Organs by Convolutional Neural Networks
Hassan, Syed Ibrahim
Stommel, Martin
Lowe, Andrew
Zhang, Qi
Xu, Weiliang
2019 INTERNATIONAL CONFERENCE ON IMAGE AND VISION COMPUTING NEW ZEALAND (IVCNZ), 2019,
[3] Deep Context Convolutional Neural Networks for Semantic Segmentation
Yang, Wenbin
Zhou, Quan
Fan, Yawen
Gao, Guangwei
Wu, Songsong
Ou, Weihua
Lu, Huimin
Cheng, Jie
Latecki, Longin Jan
COMPUTER VISION, PT I, 2017, 771 : 696 - 704
[4] Semantic Segmentation of Bioimages Using Convolutional Neural Networks
Wiehman, Stiaan
de Villiers, Hendrik
2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 624 - 631
[5] Deep convolutional neural networks for semantic segmentation of cracks
Wang, Jia-Ji
Liu, Yu-Fei
Nie, Xin
Mo, Y. L.
STRUCTURAL CONTROL & HEALTH MONITORING, 2022, 29 (01):
[6] Efficient Yet Deep Convolutional Neural Networks for Semantic Segmentation
Kamran, Sharif Amit
Sabbir, Ali Shihab
2018 INTERNATIONAL SYMPOSIUM ON ADVANCED INTELLIGENT INFORMATICS (SAIN), 2018, : 123 - 130
[7] Semantic Segmentation of Hyperspectral Imaging Using Convolutional Neural Networks
A. Mukhin
G. Danil
R. Paringer
Optical Memory and Neural Networks, 2022, 31 : 38 - 47
[8] Convolutional neural networks rarely learn shape for semantic segmentation
Zhang, Yixin
Mazurowski, Maciej A.
PATTERN RECOGNITION, 2024, 146
[9] DOMAIN ADAPTATION FOR SEMANTIC SEGMENTATION USING CONVOLUTIONAL NEURAL NETWORKS
Schenkel, Fabian
Middelmann, Wolfgang
2019 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2019), 2019, : 728 - 731
[10] Semantic Segmentation With Light Field Imaging and Convolutional Neural Networks
Jia, Chen
Shi, Fan
Zhao, Meng
Zhang, Yao
Cheng, Xu
Wang, Mianzhao
Chen, Shengyong
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2021, 70

← 1 2 3 4 5 →