A Brief Review of Random Forests for Water Scientists and Practitioners and Their Recent History in Water Resources

被引:447
作者
Tyralis, Hristos [1 ]
Papacharalampous, Georgia [2 ]
Langousis, Andreas [3 ]
机构
[1] Elefsina Air Base, Air Force Support Command, Hellen Air Force, Elefsina 19200, Greece
[2] Natl Tech Univ Athens, Sch Civil Engn, Dept Water Resources & Environm Engn, Iroon Polytech 5, Zografos 15780, Greece
[3] Univ Patras, Sch Engn, Dept Civil Engn, Univ Campus, Patras 26504, Greece
关键词
classification; data-driven; hydrological modeling; hydrology; machine learning; prediction; quantile regression forests; supervised learning; variable importance metrics; MACHINE LEARNING-METHODS; VARIABLE IMPORTANCE MEASURES; SUPPORT VECTOR MACHINE; NEURAL-NETWORK MODELS; ARTIFICIAL-INTELLIGENCE; BIG DATA; OVERLAND-FLOW; SOIL-MOISTURE; LAND-COVER; UPSCALING EVAPOTRANSPIRATION;
D O I
10.3390/w11050910
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Random forests (RF) is a supervised machine learning algorithm, which has recently started to gain prominence in water resources applications. However, existing applications are generally restricted to the implementation of Breiman's original algorithm for regression and classification problems, while numerous developments could be also useful in solving diverse practical problems in the water sector. Here we popularize RF and their variants for the practicing water scientist, and discuss related concepts and techniques, which have received less attention from the water science and hydrologic communities. In doing so, we review RF applications in water resources, highlight the potential of the original algorithm and its variants, and assess the degree of RF exploitation in a diverse range of applications. Relevant implementations of random forests, as well as related concepts and techniques in the R programming language, are also covered.
引用
收藏
页数:37
相关论文
共 336 条
[71]   BART: BAYESIAN ADDITIVE REGRESSION TREES [J].
Chipman, Hugh A. ;
George, Edward I. ;
McCulloch, Robert E. .
ANNALS OF APPLIED STATISTICS, 2010, 4 (01) :266-298
[72]   State-of-the-Art for Modelling Reservoir Inflows and Management Optimization [J].
Choong, Shi-Mei ;
El-Shafie, A. .
WATER RESOURCES MANAGEMENT, 2015, 29 (04) :1267-1282
[73]  
Clémençon S, 2013, J MACH LEARN RES, V14, P39
[74]   Tree-Based Ranking Methods [J].
Clemencon, Stephan ;
Vayatis, Nicolas .
IEEE TRANSACTIONS ON INFORMATION THEORY, 2009, 55 (09) :4316-4336
[75]   Big data: Some statistical issues [J].
Cox, D. R. ;
Kartsonaki, Christiana ;
Keogh, Ruth H. .
STATISTICS & PROBABILITY LETTERS, 2018, 136 :111-115
[76]   Statistical thinking for 21st century scientists [J].
Cox, D. R. ;
Efron, Bradley .
SCIENCE ADVANCES, 2017, 3 (06)
[77]  
Criminisil A, 2011, FOUND TRENDS COMPUT, V7, P81, DOI [10.1561/0600000035, 10.1501/0000000035]
[78]   Random forests for classification in ecology [J].
Cutler, D. Richard ;
Edwards, Thomas C., Jr. ;
Beard, Karen H. ;
Cutler, Adele ;
Hess, Kyle T. .
ECOLOGY, 2007, 88 (11) :2783-2792
[79]   Observations of net soil exchange of CO2 in a dryland show experimental warming increases carbon losses in biocrust soils [J].
Darrouzet-Nardi, Anthony ;
Reed, Sasha C. ;
Grote, Edmund E. ;
Belnap, Jayne .
BIOGEOCHEMISTRY, 2015, 126 (03) :363-378
[80]   HydroTest: A web-based toolbox of evaluation metrics for the standardised assessment of hydrological forecasts [J].
Dawson, C. W. ;
Abrahart, R. J. ;
See, L. M. .
ENVIRONMENTAL MODELLING & SOFTWARE, 2007, 22 (07) :1034-1052