Analysis and Impact Evaluation of Missing Data Imputation in Day-ahead PV Generation Forecasting

被引:3
作者
Kim, Taeyoung [1 ]
Ko, Woong [2 ]
Kim, Jinho [1 ]
机构
[1] Gwangju Inst Sci & Technol, Sch Integrated Technol, Gwangju 61005, South Korea
[2] Gwangju Inst Sci & Technol, Res Inst Solar & Sustainable Energies, Gwangju 61005, South Korea
来源
APPLIED SCIENCES-BASEL | 2019年 / 9卷 / 01期
关键词
PV forecasting; missing data imputation; support vector regression; linear interpolation; mode imputation; k-nearest neighbors; multivariate imputation by chained equations; CHAINED EQUATIONS; TIME-SERIES;
D O I
10.3390/app9010204
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Over the past decade, PV power plants have increasingly contributed to power generation. However, PV power generation widely varies due to environmental factors; thus, the accurate forecasting of PV generation becomes essential. Meanwhile, weather data for environmental factors include many missing values; for example, when we estimated the missing values in the precipitation data of the Korea Meteorological Agency, they amounted to similar to 16% from 2015-2016, and further, 19% of the weather data were missing for 2017. Such missing values deteriorate the PV power generation prediction performance, and they need to be eliminated by filling in other values. Here, we explore the impact of missing data imputation methods that can be used to replace these missing values. We apply four missing data imputation methods to the training data and test data of the prediction model based on support vector regression. When the k-nearest neighbors method is applied to the test data, the prediction performance yields results closest to those for the original data with no missing values, and the prediction model's performance is stable even when the missing data rate increases. Therefore, we conclude that the most appropriate missing data imputation for application to PV forecasting is the KNN method.
引用
收藏
页数:18
相关论文
共 31 条
  • [1] [Anonymous], 2018, HIGHLIGHTS REN21 REN
  • [2] Multiple imputation by chained equations: what is it and how does it work?
    Azur, Melissa J.
    Stuart, Elizabeth A.
    Frangakis, Constantine
    Leaf, Philip J.
    [J]. INTERNATIONAL JOURNAL OF METHODS IN PSYCHIATRIC RESEARCH, 2011, 20 (01) : 40 - 49
  • [3] Batista GE., 2002, HIS, V87, P48
  • [4] Batista GEAPA, 2003, APPL ARTIF INTELL, V17, P519, DOI 10.1080/08839510390219309
  • [5] Campozano L., 2014, Maskana, V5, P99, DOI DOI 10.18537/MSKN.05.01.07
  • [6] A New Missing Data Imputation Algorithm Applied to Electrical Data Loggers
    Crespo Turrado, Concepcion
    Sanchez Lasheras, Fernando
    Luis Calvo-Rolle, Jose
    Jose Pinon-Pazos, Andres
    de Cos Juez, Francisco Javier
    [J]. SENSORS, 2015, 15 (12) : 31069 - 31082
  • [7] Missing Data Imputation of Solar Radiation Data under Different Atmospheric Conditions
    Crespo Turrado, Concepcion
    Meizoso Lopez, Maria del Carmen
    Sanchez Lasheras, Fernando
    Rodriguez Gomez, Benigno Antonio
    Calvo Rolle, Jose Luis
    de Cos Juez, Francisco Javier
    [J]. SENSORS, 2014, 14 (11) : 20382 - 20399
  • [8] SVR-Based Model to Forecast PV Power Generation under Different Weather Conditions
    Das, Utpal Kumar
    Tey, Kok Soon
    Seyedmahmoudian, Mehdi
    Idris, Mohd Yamani Idna
    Mekhilef, Saad
    Horan, Ben
    Stojcevski, Alex
    [J]. ENERGIES, 2017, 10 (07)
  • [9] K nearest neighbours with mutual information for simultaneous classification and missing data imputation
    Garcia-Laencina, Pedro J.
    Sancho-Gomez, Jose-Luis
    Figueiras-Vidal, Anibal R.
    Verleysen, Michel
    [J]. NEUROCOMPUTING, 2009, 72 (7-9) : 1483 - 1493
  • [10] Gilani SyedIhtsham-ul-Haq., 2011, International Journal of Environmental Science and Development, V2, P188, DOI [10.7763/IJESD.2011.V2.122, DOI 10.7763/IJESD.2011.V2.122]