Oxygen-18 prediction using machine learning in the Baltic Artesian Basin groundwater

被引:0
作者
Samalavicius, Vytautas [1 ]
Gadeikiene, Sonata [1 ]
Zarzojus, Gintaras [1 ]
Gadeikis, Saulius [1 ]
Lekstutyte, Ieva [1 ]
机构
[1] Vilnius Univ, Inst Geosci, Ciurlionio Str 21-27, LT-03101 Vilnius, Lithuania
关键词
Machine learning; Isotopes; Oxygen-18; Baltic Artesian Basin; Hydrochemistry; CAMBRIAN-VENDIAN AQUIFER; GEOCHEMICAL EVOLUTION; GLACIAL ORIGIN; ENVIRONMENTAL ISOTOPES; STABLE ISOTOPES; NORTHERN PART; SYSTEM; WATER; PALAEOGROUNDWATER; OXYGEN;
D O I
10.1007/s00477-024-02896-9
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Oxygen-18 is an important indicator determining groundwater origin, recharge sources, and age. However, compared to basic geochemical parameters such as major ion composition, oxygen-18 is measured significantly less. This study aims to develop a machine learning (ML) model to successfully predict oxygen-18 values based on ionic composition (Na+, K+, Mg2+, Ca2+, Cl-, SO42-, HCO3-), coordinates, well depth as input variables for the whole Baltic Artesian Basin (BAB). A dataset of 567 distinct sample entries was developed from previous research and databases of Lithuania, Latvia, and Estonia. Twelve individual ML models were tested in this research. The prediction results of each model were evaluated using three performance metrics, r-square (R2), mean absolute error (MAE), and root mean square error (RMSE). Overfitting was also evaluated by considering the error metric results of train and test sets and correlation plots of oxygen-18 predicted vs. actual values. The best-performing models-Gradient Boosting, Random Forest, and K-neighbors regressors-achieved R2 values greater than 0.8. However, overfitting is observed during the ML of Gradient Boosting and Random Forest models. Hyperparameter tuning helped to increase the accuracy of K-neighbors' regressor performance without creating overfitting. The study results show that the tuned K-neighbors regressor performance is the best fit: R2 0.82-0.84, MAE 0.98-0.99 parts per thousand, RMSE 1.67-1.74 parts per thousand. This study demonstrates that machine learning can be successfully applied to predict oxygen-18 values in groundwater across a basinal scale.
引用
收藏
页码:765 / 787
页数:23
相关论文
共 107 条
[1]  
Ali J., 2012, INT J COMPUT SCI ISS, V9, P272
[2]   Machine Learning Applied to the Oxygen-18 Isotopic Composition, Salinity and Temperature/Potential Temperature in the Mediterranean Sea [J].
Astray, Gonzalo ;
Soto, Benedicto ;
Barreiro, Enrique ;
Galvez, Juan F. ;
Mejuto, Juan C. .
MATHEMATICS, 2021, 9 (19)
[3]   Pleistocene age paleo-groundwater inferred from water-stable isotope values in the central part of the Baltic Artesian Basin [J].
Babre, Alise ;
Kalvans, Andis ;
Popovs, Konrads ;
Retike, Inga ;
Delina, Aija ;
Vaikmaee, Rein ;
Martma, Tonu .
ISOTOPES IN ENVIRONMENTAL AND HEALTH STUDIES, 2016, 52 (06) :706-725
[4]   Hydrochemical and isotopic (δ18O, δ2H, 87Sr/86Sr, δ37Cl and δ81Br) evidence for the origin of saline formation water in a gas reservoir [J].
Bagheri, R. ;
Nadri, A. ;
Raeisi, E. ;
Eggenkamp, H. G. M. ;
Kazemi, G. A. ;
Montaseri, A. .
CHEMICAL GEOLOGY, 2014, 384 :62-75
[5]   Groundwater level prediction in arid areas using wavelet analysis and Gaussian process regression [J].
Band, Shahab S. ;
Heggy, Essam ;
Bateni, Sayed M. ;
Karami, Hojat ;
Rabiee, Mobina ;
Samadianfard, Saeed ;
Chau, Kwok-Wing ;
Mosavi, Amir .
ENGINEERING APPLICATIONS OF COMPUTATIONAL FLUID MECHANICS, 2021, 15 (01) :1147-1158
[6]   Identifying the origin of groundwater samples in a multi-layer aquifer system with Random Forest classification [J].
Baudron, Paul ;
Alonso-Sarria, Francisco ;
Luis Garcia-Arostegui, Jose ;
Canovas-Garcia, Fulgencio ;
Martinez-Vicente, David ;
Moreno-Brotons, Jesus .
JOURNAL OF HYDROLOGY, 2013, 499 :303-315
[7]   Assessing automated gap imputation of regional scale groundwater level data sets with typical gap patterns [J].
Bikse, Janis ;
Retike, Inga ;
Haaf, Ezra ;
Kalvans, Andis .
JOURNAL OF HYDROLOGY, 2023, 620
[8]  
Brangulis A, 2002, Tectonics of Latvia In Latvian
[9]   Comparative analysis of machine learning techniques for estimating groundwater deuterium and oxygen-18 isotopes [J].
Cemek, Bilal ;
Arslan, Hakan ;
Kucuktopcu, Erdem ;
Simsek, Halis .
STOCHASTIC ENVIRONMENTAL RESEARCH AND RISK ASSESSMENT, 2022, 36 (12) :4271-4285
[10]   Comparison of prediction methods for oxygen-18 isotope composition in shallow groundwater [J].
Cerar, Sonja ;
Mezga, Kim ;
Zibret, Gorazd ;
Urbanc, Janko ;
Komac, Marko .
SCIENCE OF THE TOTAL ENVIRONMENT, 2018, 631-632 :358-368