Comparison of missing data imputation methods using weather data

被引:2
作者
Nida, Hafiza [1 ]
Kashif, Muhammad [1 ]
Khan, Muhammad Imran [1 ]
Ghamkhar, Madiha [1 ]
机构
[1] Univ Agr Faisalabad, Fac Sci, Dept Math & Stat, Faisalabad, Pakistan
来源
PAKISTAN JOURNAL OF AGRICULTURAL SCIENCES | 2023年 / 60卷 / 02期
关键词
Rainfall; temperature; missing data; imputation methods; root mean square error; TEMPERATURE; PAKISTAN; CLIMATE; CROP;
D O I
10.21162/PAKJAS/23.228
中图分类号
S [农业科学];
学科分类号
09 ;
摘要
Researchers and data analysts commonly experience challenges while dealing with missing data for analyzing large data sets in their respective field of studies. It is necessary to handle missing data properly to obtain better and more reliable outcomes about any research. The objective of this research is to evaluate different imputation techniques for handling missing observations occurred in the weather data. For this purpose, weather data of the variables: daily rainfall, maximum temperature (Tmax) and minimum temperature (Tmin) of 23 stations of Pakistan have been taken from Pakistan Metrological department for the years 1981 to 2020. There are about 14610 total observations of each variable while each variable has different number of missing observations, called as size of missingness, at different stations. The techniques: mean imputation, k nearest neighbors (KNN) imputation, predictive mean matching (PMM) imputation and sample imputation have been considered for the estimation of missing observations found while analyzing data of each station. The minimal value of root mean square error (RMSE) is considered to decide about station-wise imputation technique because the size of missingness varied from station to station. The KNN technique is the most appropriate to estimate the missing observations of the rainfall variables for all the stations while mean imputation technique is recommended for Tmax and Tmin data; as compared to other imputation methods.
引用
收藏
页码:327 / 336
页数:10
相关论文
共 50 条
  • [31] From Missing Data Imputation to Data Generation
    Neves, Diogo Telmo
    Alves, Joao
    Naik, Marcel Ganesh
    Proenca, Alberto Jose
    Prasser, Fabian
    [J]. JOURNAL OF COMPUTATIONAL SCIENCE, 2022, 61
  • [32] Improved imputation methods for missing data in two-occasion successive sampling
    Singh, Garib Nath
    Jaiswal, Ashok Kumar
    Pandey, Awadhesh K.
    [J]. COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2023, 52 (06) : 2010 - 2029
  • [33] Multiple imputation for missing data
    Patrician, PA
    [J]. RESEARCH IN NURSING & HEALTH, 2002, 25 (01) : 76 - 84
  • [34] Methods for imputation of missing values in air quality data sets
    Junninen, H
    Niska, H
    Tuppurainen, K
    Ruuskanen, J
    Kolehmainen, M
    [J]. ATMOSPHERIC ENVIRONMENT, 2004, 38 (18) : 2895 - 2907
  • [35] Missing data incremental imputation through tree based methods
    Conversano, C
    Cappelli, C
    [J]. COMPSTAT 2002: PROCEEDINGS IN COMPUTATIONAL STATISTICS, 2002, : 455 - 460
  • [36] Influence of Data Distribution in Missing Data Imputation
    Santos, Miriam Seoane
    Soares, Jastin Pompeu
    Abreu, Pedro Henriques
    Araujo, Helder
    Santos, Joao
    [J]. ARTIFICIAL INTELLIGENCE IN MEDICINE, AIME 2017, 2017, 10259 : 285 - 294
  • [37] Evaluation of Missing Data Imputation Methods for an Enhanced Distributed PV Generation Prediction
    Sundararajan, Aditya
    Sarwat, Arif I.
    [J]. PROCEEDINGS OF THE FUTURE TECHNOLOGIES CONFERENCE (FTC) 2019, VOL 1, 2020, 1069 : 590 - 609
  • [38] Data variability in the imputation quality of missing data
    Stochero, Elisandra Lucia Moro
    Lucio, Alessandro Dal'Col
    Jacobi, Luciane Flores
    [J]. ACTA SCIENTIARUM-AGRONOMY, 2024, 46
  • [39] Efficient Imputation Methods to Handle Missing Data in Sample Surveys
    Singh, G. N.
    Jaiswal, Ashok K.
    [J]. JOURNAL OF STATISTICAL THEORY AND PRACTICE, 2022, 16 (03)
  • [40] Investigation of Reliability Coefficients According to Missing Data Imputation Methods
    Akin Arikan, Cigdem
    Soysal, Sumeyra
    [J]. HACETTEPE UNIVERSITESI EGITIM FAKULTESI DERGISI-HACETTEPE UNIVERSITY JOURNAL OF EDUCATION, 2018, 33 (02): : 316 - 336