Modelling spatiotemporal concentrations of PM2.5 over Nigerian cities using machine learning algorithms and open-source data

被引:2
作者
Abdulraheem, Khadijat Abdulkareem [1 ]
Aina, Yusuf A. [2 ]
Mustapha, Ismail B. [3 ]
Adekunle, Bello Saheed [4 ]
Jimoh, Haruna O. [5 ]
Adeniran, Jamiu Adetayo [6 ]
Olaleye, Abdul Ademola [7 ]
Hamid-Mosaku, Isa Adekunle [8 ]
Nasiru, Aliyu Ishola [9 ]
Abimbola, Ismaila [10 ]
Olatunji, Sunday Olusanya [11 ]
机构
[1] Univ Ilorin, Dept Water Resources & Environm Engn, Ilorin, Nigeria
[2] Yanbu Ind Coll, Dept Geomat Engn Technol, Yanbu PO 30436, Yanbu, Saudi Arabia
[3] Univ Teknol Malaysia, Sch Comp, Dept Comp Sci, Skudai 81310, Johor, Malaysia
[4] Natl Water Resources Inst, Kaduna, Nigeria
[5] Univ Lagos, Dept Urban & Reg Planning, Yaba, Lagos, Nigeria
[6] Univ Ilorin, Dept Chem Engn, Environm Engn Res Lab, Ilorin, Nigeria
[7] Fed Univ Oye Ekiti, Dept Chem, Oye Ekiti, Ekiti, Nigeria
[8] Univ Lagos, Fac Engn, Dept Surveying & Geoinformat, Yaba, Lagos, Nigeria
[9] Univ Ilorin, Dept Informat Technol, Ilorin, Nigeria
[10] ATU Sligo, Dept Environm Sci, Sligo, Ireland
[11] Adekunle Ajasin Univ, Akungba Akoko, Nigeria
关键词
Air pollution; Satellite-derived data; Nighttime light data; Machine learning; CatBoost model; NEURAL-NETWORK MODEL; PARTICULATE MATTER; AIR-POLLUTION; RESPIRATORY-DISEASES; PM10; ASSOCIATION; EXPOSURE; CHINA; TIME;
D O I
10.1007/s40808-024-02192-z
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Increasing attention has been drawn to the concentrations of particulate matter in cities because of the consequent health burden and environmental impacts. Due to its importance, particulate matter has been integrated into the UN SDG 11 as a target for monitoring and fostering sustainable communities. However, the paucity and irregularity of data pose a challenge to achieving the SDGs. This study aims to investigate the concentrations of particulate matter (PM2.5) in Nigerian cities and compare the predictions using machine learning models and open-source data, including satellite-derived data. The influence of meteorological factors, population growth, and human activities on PM2.5 emissions in eleven locations in Nigeria was investigated. The algorithms are linear regression (LR), K nearest neighbor (KNN), decision tree regression (DTR), support vector regression (SVR), artificial neural networks (ANN) and CatBoost (CBT). Hyperparameter optimization of the models was carried out by an exhaustive search of possible values and fivefold cross-validation. The results showed that the SVR, CBT, and ANN mostly predict PM2.5 to be more correlated to the actual targets than the KNN, DTR and LR during the training and test phases. The CatBoost is the best predictor with a root mean square error (RMSE) of 9.88 whereas decision tree regression had the highest error with an RMSE of 15.75. Precipitation made the highest contribution to the CatBoost prediction model, followed by temperature and nighttime light. The findings can be used in the management of particulate matter concentrations in the context of ground data paucity.
引用
收藏
页数:17
相关论文
共 56 条
[1]  
Abubakar I. R., 2018, URBANIZATION ITS IMP, P219, DOI DOI 10.4018/978-1-5225-2659-9.CH011
[2]  
Abulude FO., 2023, ASEAN J Sci Eng, V3, P39, DOI [10.17509/ajse.v3i1.43558, DOI 10.17509/AJSE.V3I1.43558]
[3]   Estimation of Ground PM2.5 Concentrations in Pakistan Using Convolutional Neural Network and Multi-Pollutant Satellite Images [J].
Ahmed, Maqsood ;
Xiao, Zemin ;
Shen, Yonglin .
REMOTE SENSING, 2022, 14 (07)
[4]   Modeling of diurnal pattern of air temperature in a tropical environment: Ile-Ife and Ibadan, Nigeria [J].
R.T A. ;
M.O A. .
Modeling Earth Systems and Environment, 2017, 3 (4) :1421-1439
[5]   Machine learning as a surrogate to building performance simulation: Predicting energy consumption under different operational settings [J].
Ali, Abdulrahim ;
Jayaraman, Raja ;
Mayyas, Ahmad ;
Alaifan, Bader ;
Azar, Elie .
ENERGY AND BUILDINGS, 2023, 286
[6]  
[Anonymous], 2010, Federal Republic of Nigeria, 2006 population and housing census. Priority table Volume III. Population distribution by state
[7]   An application of machine learning techniques to galaxy cluster mass estimation using the MACSIS simulations [J].
Armitage, Thomas J. ;
Kay, Scott T. ;
Barnes, David J. .
MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY, 2019, 484 (02) :1526-1537
[8]   An improved digital soil mapping approach to predict total N by combining machine learning algorithms and open environmental data [J].
Auzzas, Alessandro ;
Capra, Gian Franco ;
Jani, Arun Dilipkumar ;
Ganga, Antonio .
MODELING EARTH SYSTEMS AND ENVIRONMENT, 2024, 10 (05) :6519-6538
[9]   A review of the inter-correlation of climate change, air pollution and urban sustainability using novel machine learning algorithms and spatial information science [J].
Balogun, Abdul-Lateef ;
Tella, Abdulwaheed ;
Baloo, Lavania ;
Adebisi, Naheem .
URBAN CLIMATE, 2021, 40
[10]   The Megacity Lagos and Three Decades of Urban Heat Island Growth [J].
Bassett, R. ;
Young, P. J. ;
Blair, G. S. ;
Samreen, F. ;
Simm, W. .
JOURNAL OF APPLIED METEOROLOGY AND CLIMATOLOGY, 2020, 59 (12) :2041-2055