Feature selection and hyper parameters optimization for short-term wind power forecast

被引:47
作者
Huang, Hui [1 ,2 ]
Jia, Rong [1 ,3 ]
Shi, Xiaoyu [1 ,3 ]
Liang, Jun [4 ]
Dang, Jian [1 ,3 ]
机构
[1] Xian Univ Technol, Inst Water Resources & Hydroelect Engn, Xian 710048, Peoples R China
[2] North China Univ Water Resources & Elect Power, Sch Elect Power, Zhengzhou 450011, Henan, Peoples R China
[3] Xian Key Lab Intelligent Energy, Xian 710048, Peoples R China
[4] Cardiff Univ, Sch Engn, Cardiff CF24 3AA, Wales
基金
英国工程与自然科学研究理事会;
关键词
Wind power forecasting; Bayesian hyperparameters optimization; Feature selection; Gradient boosted regression trees; ENSEMBLE; PREDICTION; UNCERTAINTY; NETWORK; SUPPORT; TREES;
D O I
10.1007/s10489-021-02191-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Accurate wind power forecasting plays an increasingly significant role in power grid normal operation with large-scale wind energy. The precise and stable forecasting of wind power with short computational time is still a challenge owing to various uncertainty factors. This study proposes a hybrid model based on a data prepossessing strategy, a modified Bayesian optimization algorithm, and the gradient boosted regression trees approach. More specifically, the powerful information mining ability of maximum information coefficient is used to select the important input features, and the modified Bayesian optimization algorithm is introduced to optimize the hyperparameters of the gradient boosted regression trees to acquire more satisfactory forecasting precision and computation cost. Datasets from a Chinese wind farm are used in case studies to analyze the prediction accuracy, stability, and computation efficiency of the proposed model. The point forecasting and multi-step forecasting results reveal that the performance of the hybrid forecasting model positively exceeds all the contrasted models. The developed model is extremely useful for enhancing prediction precision and is a reasonable and valid tool for online prediction with increasing data.
引用
收藏
页码:6752 / 6770
页数:19
相关论文
共 42 条
[1]   Ensemble of relevance vector machines and boosted trees for electricity price forecasting [J].
Agrawal, Rahul Kumar ;
Muchahary, Frankle ;
Tripathi, Madan Mohan .
APPLIED ENERGY, 2019, 250 :540-548
[2]   Tree-based ensemble methods for predicting PV power generation and their comparison with support vector regression [J].
Ahmad, Muhammad Waseem ;
Mourshed, Monjur ;
Rezgui, Yacine .
ENERGY, 2018, 164 :465-474
[3]  
Calandra R, 2014, IEEE INT CONF ROBOT, P1951, DOI 10.1109/ICRA.2014.6907117
[4]  
Callens A, 2020, APPL OCEAN RES, V104
[5]   Multi-model ensemble for short-term traffic flow prediction under normal and abnormal conditions [J].
Chen, Xiqun ;
Zhang, Shuaichao ;
Li, Li .
IET INTELLIGENT TRANSPORT SYSTEMS, 2019, 13 (02) :260-268
[6]   Basic Enhancement Strategies When Using Bayesian Optimization for Hyperparameter Tuning of Deep Neural Networks [J].
Cho, Hyunghun ;
Kim, Yongjin ;
Lee, Eunjung ;
Choi, Daeyoung ;
Lee, Yongjae ;
Rhee, Wonjong .
IEEE ACCESS, 2020, 8 :52588-52608
[7]   Using CNN with Bayesian optimization to identify cerebral micro-bleeds [J].
Doke, Piyush ;
Shrivastava, Dhiraj ;
Pan, Chichun ;
Zhou, Qinghua ;
Zhang, Yu-Dong .
MACHINE VISION AND APPLICATIONS, 2020, 31 (05)
[8]   Correlation research of phase angle variation and coating performance by means of Pearson's correlation coefficient [J].
Fu, Ting ;
Tang, Xiaobo ;
Cai, Zekai ;
Zuo, Yu ;
Tang, Yuming ;
Zhao, Xuhui .
PROGRESS IN ORGANIC COATINGS, 2020, 139
[9]   Pressure drop modelling in sand filters in micro-irrigation using gradient boosted regression trees [J].
Garcia Nieto, Paulino J. ;
Garcia-Gonzalo, Esperanza ;
Arbat, Gerard ;
Duran-Ros, Miquel ;
Ramirez de Cartagena, Francisco ;
Puig-Bargues, Jaume .
BIOSYSTEMS ENGINEERING, 2018, 171 :41-51
[10]   Google Vizier: A Service for Black-Box Optimization [J].
Golovin, Daniel ;
Solnik, Benjamin ;
Moitra, Subhodeep ;
Kochanski, Greg ;
Karro, John ;
Sculley, D. .
KDD'17: PROCEEDINGS OF THE 23RD ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2017, :1487-1496