Comparison of random forests and other statistical methods for the prediction of lake water level: a case study of the Poyang Lake in China

被引:125
作者
Li, Bing [1 ,2 ]
Yang, Guishan [1 ]
Wan, Rongrong [1 ]
Dai, Xue [1 ,2 ]
Zhang, Yanhui [1 ]
机构
[1] Chinese Acad Sci, Nanjing Inst Geog & Limnol, Key Lab Watershed Geog Sci, Nanjing 210008, Jiangsu, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
来源
HYDROLOGY RESEARCH | 2016年 / 47卷
关键词
artificial neural networks; lake water level; Poyang Lake; random forests; support vector regression; variable importance analysis; ARTIFICIAL NEURAL-NETWORKS; SUPPORT VECTOR MACHINES; RIVER-BASIN; FLUCTUATIONS; REGRESSION; CLIMATE; MODEL; PERFORMANCE; RESPONSES; FLOW;
D O I
10.2166/nh.2016.264
中图分类号
TV21 [水资源调查与水利规划];
学科分类号
081501 ;
摘要
Modeling of hydrological time series is essential for sustainable development and management of lake water resources. This study aims to develop an efficient model for forecasting lake water level variations, exemplified by the Poyang Lake (China) case study. A random forests (RF) model was first applied and compared with artificial neural networks, support vector regression, and a linear model. Three scenarios were adopted to investigate the effect of time lag and previous water levels as model inputs for real-time forecasting. Variable importance was then analyzed to evaluate the influence of each predictor for water level variations. Results indicated that the RF model exhibits the best performance for daily forecasting in terms of root mean square error (RMSE) and coefficient of determination (R-2). Moreover, the highest accuracy was achieved using discharge series at 4-day-ahead and the average water level over the previous week as model inputs, with an average RMSE of 0.25 m for five stations within the lake. In addition, the previous water level was the most efficient predictor for water level forecasting, followed by discharge from the Yangtze River. Based on the performance of the soft computing methods, RF can be calibrated to provide information or simulation scenarios for water management and decision-making.
引用
收藏
页码:69 / 83
页数:15
相关论文
共 55 条
[1]   Forecasting surface water level fluctuations of lake van by artificial neural networks [J].
Altunkaynak, Adduesselam .
WATER RESOURCES MANAGEMENT, 2007, 21 (02) :399-408
[2]   Water level forecasting through fuzzy logic and artificial neural network approaches [J].
Alvisi, S ;
Mascellani, G ;
Franchini, M ;
Bárdossy, A .
HYDROLOGY AND EARTH SYSTEM SCIENCES, 2006, 10 (01) :1-17
[3]  
[Anonymous], 2012, R LANG ENV STAT COMP
[4]   Analysis and prediction of flow from local source in a river basin using a Neuro-fuzzy modeling tool [J].
Aqil, Muhammad ;
Kita, Ichiro ;
Yano, Akira ;
Nishlyama, Soichi .
JOURNAL OF ENVIRONMENTAL MANAGEMENT, 2007, 85 (01) :215-223
[5]   Multi-step-ahead time series prediction using multiple-output support vector regression [J].
Bao, Yukun ;
Xiong, Tao ;
Hu, Zhongyi .
NEUROCOMPUTING, 2014, 129 :482-493
[6]   SIMULATION-MODEL OF THE WATER-BALANCE OF A CROPPED SOIL - SWATRE [J].
BELMANS, C ;
WESSELING, JG ;
FEDDES, RA .
JOURNAL OF HYDROLOGY, 1983, 63 (3-4) :271-286
[7]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[8]   Seasonal River Discharge Forecasting Using Support Vector Regression: A Case Study in the Italian Alps [J].
Callegari, Mattia ;
Mazzoli, Paolo ;
de Gregorio, Ludovica ;
Notarnicola, Claudia ;
Pasolli, Luca ;
Petitta, Marcello ;
Pistocchi, Alberto .
WATER, 2015, 7 (05) :2494-2515
[9]   A hybrid model coupled with singular spectrum analysis for daily rainfall prediction [J].
Chau, K. W. ;
Wu, C. L. .
JOURNAL OF HYDROINFORMATICS, 2010, 12 (04) :458-473
[10]   A comparative study of population-based optimization algorithms for downstream river flow forecasting by a hybrid neural network model [J].
Chen, X. Y. ;
Chau, K. W. ;
Busari, A. O. .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2015, 46 :258-268