Impact of Training Set Configurations for Differentiating Plantation Forest Genera with Sentinel-2 Imagery and Machine Learning

被引:3
作者
Higgs, Caley [1 ]
van Niekerk, Adriaan [1 ]
机构
[1] Stellenbosch Univ, Dept Geog & Environm Studies, 82 Ryneveld St, ZA-7600 Stellenbosch, South Africa
关键词
plantation genera classification; uneven training samples; area-proportionate training samples; even training samples; random forests; WORLDVIEW-2; IMAGERY; HYPERSPECTRAL DATA; CLASSIFICATION; DISCRIMINATION; CONSERVATION; LIDAR;
D O I
10.3390/rs14163992
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Forest plantations in South Africa impose genus-specific demands on limited soil moisture. Hence, plantation composition and distribution mapping is critical for water conservation planning. Genus maps are used to quantify the impact of post-harvest genus-exchange activities in the forestry sector. Collecting genus data using in situ methods is costly and time-consuming, especially when performed at regional or national scales. Although remotely sensed data and machine learning show potential for mapping genera at regional scales, the efficacy of such methods is highly dependent on the size and quality of the training data used to build the models. However, it is not known what sampling scheme (e.g., sample size, proportion per genus, and spatial distribution) is most effective to map forest genera over large and complex areas. Using Sentinel-2 imagery as inputs, this study evaluated the effects of different sampling strategies (e.g., even, uneven, and area-proportionate) for training the random forests machine learning classifier to differentiate between Acacia, Eucalyptus, and Pinus trees in South Africa. Sample size (s) was related to the number of input features (n) to better understand the potential impact of sample sparseness. The results show that an even sample with maximum size (100%, s similar to 91n) produced the highest overall accuracy (76.3%). Although larger training set sizes (s > n) resulted in higher OAs, a saturation point was reached at s similar to 64n.
引用
收藏
页数:19
相关论文
共 65 条
  • [1] [Anonymous], 2014, PAP PULP SECT FP M S
  • [2] [Anonymous], 2002, INT J APPL EARTH OBS, DOI DOI 10.1016/S0303-2434(02)00025-9
  • [3] GlobeLand30 as an alternative fine-scale global land cover map: Challenges, possibilities, and implications for developing countries
    Arsanjani, Jamal Jokar
    Tayyebi, Amin
    Vaz, Eric
    [J]. HABITAT INTERNATIONAL, 2016, 55 : 25 - 31
  • [4] Random forest in remote sensing: A review of applications and future directions
    Belgiu, Mariana
    Dragut, Lucian
    [J]. ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2016, 114 : 24 - 31
  • [5] Random forests
    Breiman, L
    [J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32
  • [6] Classification of coniferous tree species and age classes using hyperspectral data and geostatistical methods
    Buddenbaum, H
    Schlerf, M
    Hill, J
    [J]. INTERNATIONAL JOURNAL OF REMOTE SENSING, 2005, 26 (24) : 5453 - 5465
  • [7] Budei BC, 2018, REMOTE SENS ENVIRON, V204, P632, DOI [10.1016, 10.1016/j.rse.2017.09.037]
  • [8] Bujang MA., 2017, BIOSTAT PUBLIC HLTH, V14, pe12267
  • [9] Assessing the utility WorldView-2 imagery for tree species mapping in South African subtropical humid forest and the conservation implications: Dukuduku forest patch as case study
    Cho, Moses Azong
    Malahlela, Oupa
    Ramoelo, Abel
    [J]. INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2015, 38 : 349 - 357
  • [10] Clulow A.D., 2011, TT50511 WRC PRET