AutoML for estimating grass height from ETM plus /OLI data from field measurements at a nature reserve

被引:9
作者
de Sa, Nuno Cesar [1 ]
Baratchi, Mitra [2 ]
Buitenhuis, Vincent [2 ]
Cornelissen, Perry [3 ]
van Bodegom, Peter M. [1 ]
机构
[1] Leiden Univ, Inst Environm Sci CML, Leiden, Netherlands
[2] Leiden Univ, Leiden Inst Adv Comp Sci LIACS, Leiden, Netherlands
[3] State Forestry Serv, Amersfoort, Netherlands
关键词
AutoML; machine learning; remote sensing; grassland; Landsat; ABOVEGROUND BIOMASS; RANDOM FOREST; LAND-COVER; VEGETATION; CLASSIFICATION; INDEX; REPRODUCIBILITY; REPLICABILITY; PASTURES; SYSTEM;
D O I
10.1080/15481603.2022.2152304
中图分类号
P9 [自然地理学];
学科分类号
0705 ; 070501 ;
摘要
Remote sensing (RS) is now a standard tool used for grassland monitoring thanks to the availability of data at an unprecedented spatial and temporal resolution. The approaches to monitor grasslands often rely on the use of vegetation indices (e.g. NDVI) and empirical models trained on field data collected in tandem with the RS data. The best combination of models and features is often found by ad-hoc experimentation by the expert. This "classic" approach does not necessarily result in the best possible model. Automatic machine learning (AutoML) allows to automate this procedure by identifying the best possible pipeline in a data-driven manner. This study assesses the applicability of two distinct AutoML algorithms - AutoSklearn and AutoGluon - to monitor grass height from RS data and to systematically compare them to "classic" RS approaches. Grass height was estimated from Landsat ETM+ and OLI for a well-known conservation area as a case study. The "classic" RS approach emulated all possible ad hoc decisions by comparing all combinations of bands and vegetation indices against a naive use of the AutoML systems. While model selection and optimization are automated within AutoML models, for the "classic" RS approach, we used Bayesian optimization for hyperparameter tuning. We found that AutoML methods outperformed "classic" methods with the test error varying between similar to 1.73 cm +/- 0.02 and similar to 1.78 cm +/- 0.03 while for the "classic" methods it varied between similar to 1.84 cm +/- 0.03 and similar to 2.81 cm +/- 0.02. In the case of the "classic" methods, our exhaustive exploration of the possible feature combinations showed that while vegetation indices were always selected for the best models, which index got selected depended on the algorithm. The performance of AutoML compared to "classic" RS approaches clearly demonstrates the ability of these methods to quickly and effectively identify high-performing models. However, as this work focused on a single case-study, the results cannot be directly generalized to other study areas. Nevertheless, it provided a number of insights into future research opportunities to improve the use of AutoML in RS.
引用
收藏
页码:2164 / 2183
页数:20
相关论文
共 73 条
[1]   Review of Machine Learning Approaches for Biomass and Soil Moisture Retrievals from Remote Sensing Data [J].
Ali, Iftikhar ;
Greifeneder, Felix ;
Stamenkovic, Jelena ;
Neumann, Maxim ;
Notarnicola, Claudia .
REMOTE SENSING, 2015, 7 (12) :16398-16421
[2]   Reproducibility and Replicability in SAR Remote Sensing [J].
Balz, Timo ;
Rocca, Fabio .
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2020, 13 :3834-3843
[3]   Estimating above-ground biomass on mountain meadows and pastures through remote sensing [J].
Barrachina, M. ;
Cristobal, J. ;
Tulla, A. F. .
INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2015, 38 :184-192
[4]   Random forest in remote sensing: A review of applications and future directions [J].
Belgiu, Mariana ;
Dragut, Lucian .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2016, 114 :24-31
[5]  
Caruana Rich, 2004, P 21 INT C MACH LEAR, P18
[6]   Optimal Combination of Predictors and Algorithms for Forest Above-Ground Biomass Mapping from Sentinel and SRTM Data [J].
Chen, Lin ;
Wang, Yeqiao ;
Ren, Chunying ;
Zhang, Bai ;
Wang, Zongming .
REMOTE SENSING, 2019, 11 (04)
[7]   Exploring Bamboo Forest Aboveground Biomass Estimation Using Sentinel-2 Data [J].
Chen, Yuyun ;
Li, Longwei ;
Lu, Dengsheng ;
Li, Dengqiu .
REMOTE SENSING, 2019, 11 (01)
[8]   Using the IPOL Journal for Online Reproducible Research in Remote Sensing [J].
Colom, Miguel ;
Dagobert, Tristan ;
de Franchis, Carlo ;
von Gioi, Rafael Grompone ;
Hessel, Charles ;
Morel, Jean-Michel .
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2020, 13 :6384-6390
[9]   Transition of a Sambucus nigra L. dominated woody vegetation into grassland by a multi-species herbivore assemblage [J].
Cornelissen, Perry ;
Gresnigt, Marca C. ;
Vermeulen, Roeland A. ;
Bokdam, Jan ;
Smit, Ruben .
JOURNAL FOR NATURE CONSERVATION, 2014, 22 (01) :84-92
[10]  
Craglia M, 2017, BIG EARTH DATA, V1, P21, DOI 10.1080/20964471.2017.1401284