A comparative analysis of machine learning approaches to gap filling meteorological datasets

被引:0
|
作者
Lalic, Branislava [1 ]
Stapleton, Adam [2 ]
Vergauwen, Thomas [3 ,4 ]
Caluwaerts, Steven [3 ,4 ]
Eichelmann, Elke [5 ]
Roantree, Mark [6 ]
机构
[1] Univ Novi Sad, Fac Agr, Novi Sad 21000, Serbia
[2] Dublin City Univ, Sch Comp, Dublin, Ireland
[3] Royal Meteorol Inst Belgium RMI, B-1066 Uccle, Belgium
[4] Univ Ghent, Phys & Astron, B-9000 Ghent, Belgium
[5] Univ Coll Dublin, Sch Biol & Environm Sci, Dublin, Ireland
[6] Dublin City Univ, Insight Ctr Data Analyt, Dublin, Ireland
关键词
Micrometeorology; Gap-filling; Machine learning; Feature Sets; TEMPERATURE; ALGORITHMS; FLUXES;
D O I
10.1007/s12665-024-11982-8
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Observational data of the Earth's weather and climate at the level of ground-based weather stations are prone to gaps due to a variety of causes. These gaps can inhibit scientific research as they impede the use of numerical models for agricultural, meteorological and climatological applications as well as introducing analytic biases. In this research, different machine learning techniques are evaluated together with traditional approaches to gap filling automated weather station data. When filling gaps for a specific data stream, data from neighbouring weather stations are used in addition to reanalysis data from the European Centre for Medium-Range Weather Forecasts atmospheric reanalyses of the global climate, ERA-5 Land. A novel gap creation method is introduced that provides 100% coverage in sampling the dataset while ensuring that the sampled data are randomly distributed. Gap filling across a range of different gap lengths and target variables are compared using a range of error functions. The variables selected for modelling are mean air temperature, dew point, mean relative humidity and leaf wetness. Our results show that models perform best on gap-filling temperature and dew point with worst performance on leaf wetness. As expected, model performance decreases with increasing gap length. Comparison between machine learning and reanalysis approaches show very promising results from a number of the machine learning models.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Machine learning for encrypted malicious traffic detection: Approaches, datasets and comparative study
    Wang, Zihao
    Fok, Kar Wai
    Thing, Vrizlynn L. L.
    COMPUTERS & SECURITY, 2022, 113
  • [2] Comparative Analysis of Machine Learning Classifiers on Bioinformatics and Clinical Datasets
    Ranadive, Falguni
    Surti, Akil
    Sharma, Priyanka
    PROCEEDINGS OF THE 2019 6TH INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT (INDIACOM), 2019, : 608 - 611
  • [3] A comparative evaluation of machine learning ensemble approaches for disease prediction using multiple datasets
    Mahajan, Palak
    Uddin, Shahadat
    Hajati, Farshid
    Moni, Mohammad Ali
    Gide, Ergun
    HEALTH AND TECHNOLOGY, 2024, 14 (03) : 597 - 613
  • [4] A comparative evaluation of machine learning ensemble approaches for disease prediction using multiple datasets
    Palak Mahajan
    Shahadat Uddin
    Farshid Hajati
    Mohammad Ali Moni
    Ergun Gide
    Health and Technology, 2024, 14 : 597 - 613
  • [5] A comparative study of Sentiment Analysis Machine Learning Approaches
    Maada, Loukmane
    Al Fararni, Khalid
    Aghoutane, Badraddine
    Fattah, Mohammed
    Farhaoui, Yousef
    2022 2ND INTERNATIONAL CONFERENCE ON INNOVATIVE RESEARCH IN APPLIED SCIENCE, ENGINEERING AND TECHNOLOGY (IRASET'2022), 2022, : 526 - 530
  • [6] Enhancing classification performance in imbalanced datasets: A comparative analysis of machine learning models
    Dube, Lindani
    Verster, Tanja
    DATA SCIENCE IN FINANCE AND ECONOMICS, 2023, 3 (04): : 354 - 379
  • [7] Comparative Analysis of Machine Learning Approaches for the Early Diagnosis of Keratoconus
    Subramanian, P.
    Ramesh, G. P.
    Parameshachari, B. D.
    DISTRIBUTED COMPUTING AND OPTIMIZATION TECHNIQUES, ICDCOT 2021, 2022, 903 : 241 - 250
  • [8] Machine Learning with Color and Multispectral Images: Comparative Analysis of Approaches
    Mishra, Awakash
    Sony, Anubhav
    Jamuna, K., V
    2024 2ND WORLD CONFERENCE ON COMMUNICATION & COMPUTING, WCONF 2024, 2024,
  • [9] A Comparative Review of Sentimental Analysis Using Machine Learning and Deep Learning Approaches
    Nagelli, Archana
    Saleena, B.
    JOURNAL OF INFORMATION & KNOWLEDGE MANAGEMENT, 2023, 22 (03)
  • [10] Machine Learning Approaches to Advanced Outlier Detection in Psychological Datasets
    Abri, Khoula Al.
    Sidhu, Manjit Singh
    INTERNATIONAL JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING SYSTEMS, 2024, 15 (01) : 13 - 20