An Improved Imputation Method for Accurate Prediction of Imputed Dataset Based Radon Time Series

被引:3
作者
Mir, Adil Aslam [1 ,2 ]
Celebi, Fatih Vehbi [1 ]
Rafique, Muhammad [3 ]
Hussain, Lal [2 ,4 ]
Almasoud, Ahmed S. [5 ]
Alajmi, Masoud [6 ]
Al-Wesabi, Fahd N. [7 ]
Hilal, Anwer Mustafa [8 ]
机构
[1] Ankara Yildirim Beyazit Univ, Dept Comp Engn, TR-06010 Ankara, Turkey
[2] Univ Azad Jammu & Kashmir, Dept Comp Sci & Informat Technol, King Abdullah Campus, Muzaffarabad 13100, Azad Kashmir, Pakistan
[3] Univ Azad Jammu & Kashmir, Dept Phys, King Abdullah Campus, Muzatfarabad 13100, Azad Kashmir, Pakistan
[4] Univ Azad Jammu & Kashmir, Dept Comp Sci & IT, Neelum Campus, Athmuqam 13230, Azad Kashmir, Pakistan
[5] Prince Sultan Univ, Coll Comp & Informat Sci, Dept Informat Syst, Riyadh 12435, Saudi Arabia
[6] Taif Univ, Coll Comp & Informat Technol, Dept Comp Engn, At Taif 21944, Saudi Arabia
[7] King Khalid Univ, Coll Sci & Art Mahayil, Dept Comp Sci, Abha 62529, Saudi Arabia
[8] Prince Sattam Bin Abdulaziz Univ, Dept Comp & Self Dev, Al Kharj 16278, Saudi Arabia
关键词
Radon; Time series analysis; Soil; Earthquakes; Temperature measurement; Support vector machines; Predictive models; Predictive mean matching; missingness; radon concentration; support vector machine; imputation; IBFI; EARTHQUAKE PRECURSOR; MULTIPLE IMPUTATION; MISSING DATA; GAS; GROUNDWATER; REGRESSION; ANOMALIES; AIR;
D O I
10.1109/ACCESS.2022.3151892
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This article primarily focuses on the performance evaluation of a new methodology, imputation by feature importance (IBFI), to serve its imputed dataset in further regression scenarios when dealing with soil radon gas concentration (SRGC) time-series data. The time-series data have been collected spanning over fourteen(14) months period, which included four seismic events, and have been used for experimentation. The imputation by feature importance (IBFI) has been experimented and obtained results are found more efficient in the imputation of missing patterns in investigated time series when compared to traditionally used imputation methods viz. mean, median, mode, predictive mean matching (PMM), and hot-deck imputation.The IBFI methodology has been used in a variety of settings, such as data missing not at random (MNAR), missing completely at random (MCAR), and missing at random (MAR), with missingness percentages ranging from 10% to 30%. In this study, the imputed datasets, 9 for each imputation method, have been used further to predict the attribute of interest (radon concentration (RN)) keeping others as independent attributes such as thoron, temperature, relative humidity, and pressure time series. Support vector machine (SVM) with linear kernel has been used as a learning algorithm and its performance was evaluated based on the fact that how efficient and unbiased values were imputed. Statistical performance evaluation measures viz. root mean squared log error (RMSLE), root mean square error (RMSE), mean squared error (MSE),and mean absolute percentage error (MAPE) have been calculated for the assessment of performance. The findings of our study show that the IBFI imputed dataset has provided a better-fitted model. The model generation and predictions upon IBFI imputed time series result in more accurate predictions when compared to mean, median, mode, PMM, and hot-deck imputed time series. Furthermore, PMM and median imputed time series also perform closer to the IBFI imputed time series.
引用
收藏
页码:20590 / 20601
页数:12
相关论文
共 52 条
  • [1] ANN Based Sediment Prediction Model Utilizing Different Input Scenarios
    Afan, Haitham Abdulmohsin
    El-Shafie, Ahmed
    Yaseen, Zaher Mundher
    Hameed, Mohammed Majeed
    Mohtar, Wan Hanna Melini Wan
    Hussain, Aini
    [J]. WATER RESOURCES MANAGEMENT, 2015, 29 (04) : 1231 - 1245
  • [2] Activity concentrations of 226Ra, 228Ra, 222Rn and their health impact in the groundwater of Jordan
    Alomari, Ahmad Hussein
    Saleh, Muneer Aziz
    Hashim, Suhairul
    Alsayaheen, Amal
    Abdeldin, Ismael
    [J]. JOURNAL OF RADIOANALYTICAL AND NUCLEAR CHEMISTRY, 2019, 322 (02) : 305 - 318
  • [3] ANALYSIS OF RADON TIME SERIES RECORDED IN SLOVAK AND CZECH CAVES FOR THE DETECTION OF ANOMALIES DUE TO SEISMIC PHENOMENA
    Ambrosino, Fabrizio
    Thinova, Lenka
    Briestensky, Milos
    Sabbarese, Carlo
    [J]. RADIATION PROTECTION DOSIMETRY, 2019, 186 (2-3) : 428 - 432
  • [4] A TEST OF GOODNESS OF FIT
    ANDERSON, TW
    DARLING, DA
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1954, 49 (268) : 765 - 769
  • [5] Assesment of the response of the meteorological/hydrological parameters on the soil gas radon emission at Hsinchu, northern Taiwan: A prerequisite to identify earthquake precursors
    Arora, Baldev R.
    Kumar, Arvind
    Walia, Vivek
    Yang, Tsanyao Frank
    Fu, Ching-Chou
    Liu, Tsung-Kwei
    Wen, Kuo-Liang
    Chen, Cheng-Hong
    [J]. JOURNAL OF ASIAN EARTH SCIENCES, 2017, 149 : 49 - 63
  • [6] The chemistry of Norwegian groundwaters:: I.: The distribution of radon, major and minor elements in 1604 crystalline bedrock groundwaters
    Banks, D
    Frengstad, B
    Midtgård, AK
    Krog, JR
    Strand, T
    [J]. SCIENCE OF THE TOTAL ENVIRONMENT, 1998, 222 (1-2) : 71 - 91
  • [7] Radon as an earthquake precursor in and around northern Pakistan: A case study
    Barkat, Adnan
    Ali, Aamir
    Siddique, Naila
    Alam, Aftab
    Wasim, Mohammad
    Iqbal, Talat
    [J]. GEOCHEMICAL JOURNAL, 2017, 51 (04) : 337 - 346
  • [8] Byrne RF., 2012, J Bus Forecast, V31, P13
  • [9] Precursory signatures in the radon and geohydrological borehole data for M4.9 Kharsali earthquake of Garhwal Himalaya
    Choubey, V. M.
    Kumar, Naresh
    Arora, B. R.
    [J]. SCIENCE OF THE TOTAL ENVIRONMENT, 2009, 407 (22) : 5877 - 5883
  • [10] Radon anomalies preceding earthquakes which occurred in the UK, in summer and autumn 2002
    Crockett, RGM
    Gillmore, GK
    Phillips, PS
    Denman, AR
    Groves-Kirkby, CJ
    [J]. SCIENCE OF THE TOTAL ENVIRONMENT, 2006, 364 (1-3) : 138 - 148