Detection of Invasive Species in Wetlands: Practical DL with Heavily Imbalanced Data

被引:19
作者
Cabezas, Mariano [1 ]
Kentsch, Sarah [2 ]
Tomhave, Luca [2 ]
Gross, Jens [3 ]
Caceres, Maximo Larry Lopez [2 ]
Diez, Yago [4 ]
机构
[1] Univ Sydney, Brain & Mind Ctr, Sydney, NSW 2006, Australia
[2] Yamagata Univ, Fac Agr, Tsuruoka, Yamagata 9978555, Japan
[3] Leibniz Univ Hannover, Inst Phys Geog & Landscape Ecol, D-30167 Hannover, Germany
[4] Yamagata Univ, Fac Sci, Yamagata 9909585, Japan
关键词
unmanned aerial vehicles (UAV)-acquired images; unbalanced data; transfer learning; deep learning; data analysis; PERFORMANCE;
D O I
10.3390/rs12203431
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Deep Learning (DL) has become popular due to its ease of use and accuracy, with Transfer Learning (TL) effectively reducing the number of images needed to solve environmental problems. However, this approach has some limitations which we set out to explore: Our goal is to detect the presence of an invasive blueberry species in aerial images of wetlands. This is a key problem in ecosystem protection which is also challenging in terms of DL due to the severe imbalance present in the data. Results for the ResNet50 network show a high classification accuracy while largely ignoring the blueberry class, rendering these results of limited practical interest to detect that specific class. Moreover, by using loss function weighting and data augmentation results more akin to our practical application, our goals can be obtained. Our experiments regarding TL show that ImageNet weights do not produce satisfactory results when only the final layer of the network is trained. Furthermore, only minor gains are obtained compared with random weights when the whole network is retrained. Finally, in a study of state-of-the-art DL architectures best results were obtained by the ResNeXt architecture with 93.75 True Positive Rate and 98.11 accuracy for the Blueberry class with ResNet50, Densenet, and wideResNet obtaining close results.
引用
收藏
页码:1 / 17
页数:17
相关论文
共 38 条
  • [1] Deng LM, 2015, 2015 12TH INTERNATIONAL COMPUTER CONFERENCE ON WAVELET ACTIVE MEDIA TECHNOLOGY AND INFORMATION PROCESSING (ICCWAMTIP), P202
  • [2] Bootstrap re-sampling for unbalanced data in supervised learning
    Dupret, G
    Koda, M
    [J]. EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2001, 134 (01) : 141 - 156
  • [3] GAN-based synthetic medical image augmentation for increased CNN performance in liver lesion classification
    Frid-Adar, Maayan
    Diamant, Idit
    Klang, Eyal
    Amitai, Michal
    Goldberger, Jacob
    Greenspan, Hayit
    [J]. NEUROCOMPUTING, 2018, 321 : 321 - 331
  • [4] Gambella F., 2016, Contemporary Engineering Sciences, V9, P1359, DOI DOI 10.12988/CES.2016.68130
  • [5] Grenzdu┬rffer GJ., 2008, INT ARCH PHOTOGRAM R, V37, P1207, DOI DOI 10.2747/1548-1603.41.4.287
  • [6] Learning from Imbalanced Data
    He, Haibo
    Garcia, Edwardo A.
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2009, 21 (09) : 1263 - 1284
  • [7] Deep Residual Learning for Image Recognition
    He, Kaiming
    Zhang, Xiangyu
    Ren, Shaoqing
    Sun, Jian
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778
  • [8] Densely Connected Convolutional Networks
    Huang, Gao
    Liu, Zhuang
    van der Maaten, Laurens
    Weinberger, Kilian Q.
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 2261 - 2269
  • [9] Iandola F. N., 2016, ARXIV160207360
  • [10] Jung A. B., 2020, imgaug