A RAINFALL FORECASTING METHOD USING MACHINE LEARNING MODELS AND ITS APPLICATION TO THE FUKUOKA CITY CASE

被引:49
作者
Sumi, S. Monira [1 ]
Zaman, M. Faisal [1 ,2 ]
Hirose, Hideo [1 ]
机构
[1] Kyushu Inst Technol, Dept Syst Design & Informat, Iizuka, Fukuoka, Japan
[2] Dublin City Univ, Sch Elect Engn, Dublin 9, Ireland
关键词
rainfall forecasting; machine learning; multi-model method; pre-processing; model ranking; TIME-SERIES; PREDICTION; REGRESSION; SELECTION;
D O I
10.2478/v10006-012-0062-1
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In the present article, an attempt is made to derive optimal data-driven machine learning methods for forecasting an average daily and monthly rainfall of the Fukuoka city in Japan. This comparative study is conducted concentrating on three aspects: modelling inputs, modelling methods and pre-processing techniques. A comparison between linear correlation analysis and average mutual information is made to find an optimal input technique. For the modelling of the rainfall, a novel hybrid multi-model method is proposed and compared with its constituent models. The models include the artificial neural network, multivariate adaptive regression splines, the k-nearest neighbour, and radial basis support vector regression. Each of these methods is applied to model the daily and monthly rainfall, coupled with a pre-processing technique including moving average and principal component analysis. In the first stage of the hybrid method, sub-models from each of the above methods are constructed with different parameter settings. In the second stage, the sub-models are ranked with a variable selection technique and the higher ranked models are selected based on the leave-one-out cross-validation error. The forecasting of the hybrid model is performed by the weighted combination of the finally selected models.
引用
收藏
页码:841 / 854
页数:14
相关论文
共 42 条
[1]   Multi-model data fusion for river flow forecasting: an evaluation of six alternative methods based on two contrasting catchments [J].
Abrahart, RJ ;
See, L .
HYDROLOGY AND EARTH SYSTEM SCIENCES, 2002, 6 (04) :655-670
[2]  
[Anonymous], 1990, Classical and modern regression with applications
[3]   Hybrid Classification Ensemble Using Topology-preserving Clustering [J].
Baruque, Bruno ;
Porras, Santiago ;
Corchado, Emilio .
NEW GENERATION COMPUTING, 2011, 29 (03) :329-344
[4]   Experimentally optimal ν in support vector regression for different noise models and parameter settings [J].
Chalimourda, A ;
Schölkopf, B ;
Smola, AJ .
NEURAL NETWORKS, 2004, 17 (01) :127-141
[5]   Practical selection of SVM parameters and noise estimation for SVM regression [J].
Cherkassky, V ;
Ma, YQ .
NEURAL NETWORKS, 2004, 17 (01) :113-126
[6]   Improving daily reservoir inflow forecasts with model combination [J].
Coulibaly, P ;
Haché, M ;
Fortin, V ;
Bobée, B .
JOURNAL OF HYDROLOGIC ENGINEERING, 2005, 10 (02) :91-99
[7]   Hydrological modelling using artificial neural networks [J].
Dawson, CW ;
Wilby, RL .
PROGRESS IN PHYSICAL GEOGRAPHY-EARTH AND ENVIRONMENT, 2001, 25 (01) :80-108
[8]   Constraints of artificial neural networks for rainfall-runoff modelling: trade-offs in hydrological state representation and model evaluation [J].
de Vos, NJ ;
Rientjes, THM .
HYDROLOGY AND EARTH SYSTEM SCIENCES, 2005, 9 (1-2) :111-126
[9]  
Deng Y.-F., 2005, P INT C MACH LEARN C, V2, P734
[10]   COMPARING PREDICTIVE ACCURACY [J].
DIEBOLD, FX ;
MARIANO, RS .
JOURNAL OF BUSINESS & ECONOMIC STATISTICS, 1995, 13 (03) :253-263