A Virtual Sensing Concept for Nitrogen and Phosphorus Monitoring Using Machine Learning Techniques

被引:10
作者
Paepae, Thulane [1 ]
Bokoro, Pitshou N. [1 ]
Kyamakya, Kyandoghere [2 ]
机构
[1] Univ Johannesburg, Dept Elect & Elect Engn Technol, ZA-2028 Doornfontein, South Africa
[2] Alpen Adria Univ Klagenfurt, Inst Smart Syst Technol, Transportat Informat, A-9020 Klagenfurt, Austria
关键词
water quality monitoring; specification book; baseline model; accuracy benchmark; data scaling; missing values handling; surrogate parameters; soft-sensor; machine learning; WATER-QUALITY; EUTROPHICATION; SENSOR; NETWORKS; OUTLIERS; NITRATE; MODELS;
D O I
10.3390/s22197338
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Harmful cyanobacterial bloom (HCB) is problematic for drinking water treatment, and some of its strains can produce toxins that significantly affect human health. To better control eutrophication and HCB, catchment managers need to continuously keep track of nitrogen (N) and phosphorus (P) in the water bodies. However, the high-frequency monitoring of these water quality indicators is not economical. In these cases, machine learning techniques may serve as viable alternatives since they can learn directly from the available surrogate data. In the present work, a random forest, extremely randomized trees (ET), extreme gradient boosting, k-nearest neighbors, a light gradient boosting machine, and bagging regressor-based virtual sensors were used to predict N and P in two catchments with contrasting land uses. The effect of data scaling and missing value imputation were also assessed, while the Shapley additive explanations were used to rank feature importance. A specification book, sensitivity analysis, and best practices for developing virtual sensors are discussed. Results show that ET, MinMax scaler, and a multivariate imputer were the best predictive model, scaler, and imputer, respectively. The highest predictive performance, reported in terms of R-2, was 97% in the rural catchment and 82% in an urban catchment.
引用
收藏
页数:20
相关论文
共 44 条
[1]   Effect of Data Scaling Methods on Machine Learning Algorithms and Model Performance [J].
Ahsan, Md Manjurul ;
Mahmud, M. A. Parvez ;
Saha, Pritom Kumar ;
Gupta, Kishor Datta ;
Siddique, Zahed .
TECHNOLOGIES, 2021, 9 (03)
[2]  
Badiru A.B, 2018, HDB MEASUREMENTS BEN, V53
[3]   Prediction of Nitrate and Phosphorus Concentrations Using Machine Learning Algorithms in Watersheds with Different Landuse [J].
Bhattarai, Aayush ;
Dhakal, Sandeep ;
Gautam, Yogesh ;
Bhattarai, Rabin .
WATER, 2021, 13 (21)
[4]   Health impacts from cyanobacteria harmful algae blooms: Implications for the North American Great Lakes [J].
Carmichael, Wayne W. ;
Boyer, Gregory L. .
HARMFUL ALGAE, 2016, 54 :194-212
[5]   Estimation of high frequency nutrient concentrations from water quality surrogates using machine learning methods [J].
Castrillo, Maria ;
Lopez Garcia, Alvaro .
WATER RESEARCH, 2020, 172
[6]   Chlorine Soft Sensor Based on Extreme Learning Machine for Water Quality Monitoring [J].
Djerioui, Mohamed ;
Bouamar, Mohamed ;
Ladjal, Mohamed ;
Zerguine, Azzedine .
ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2019, 44 (03) :2033-2044
[7]   Eutrophication of US Freshwaters: Analysis of Potential Economic Damages [J].
Dodds, Walter K. ;
Bouska, Wes W. ;
Eitzmann, Jeffrey L. ;
Pilger, Tyler J. ;
Pitts, Kristen L. ;
Riley, Alyssa J. ;
Schloesser, Joshua T. ;
Thornbrugh, Darren J. .
ENVIRONMENTAL SCIENCE & TECHNOLOGY, 2009, 43 (01) :12-19
[8]   Extremely randomized trees [J].
Geurts, P ;
Ernst, D ;
Wehenkel, L .
MACHINE LEARNING, 2006, 63 (01) :3-42
[9]   High-frequency water quality monitoring in an urban catchment: hydrochemical dynamics, primary production and implications for the Water Framework Directive [J].
Halliday, Sarah J. ;
Skeffington, Richard A. ;
Wade, Andrew J. ;
Bowes, Michael J. ;
Gozzard, Emma ;
Newman, Jonathan R. ;
Loewenthal, Matthew ;
Palmer-Felgate, Elizabeth J. ;
Jarvie, Helen P. .
HYDROLOGICAL PROCESSES, 2015, 29 (15) :3388-3407
[10]   The Water Quality of the River Enborne, UK: Observations from High-Frequency Monitoring in a Rural, Lowland River System [J].
Halliday, Sarah J. ;
Skeffington, Richard A. ;
Bowes, Michael J. ;
Gozzard, Emma ;
Newman, Jonathan R. ;
Loewenthal, Matthew ;
Palmer-Felgate, Elizabeth J. ;
Jarvie, Helen P. ;
Wade, Andrew J. .
WATER, 2014, 6 (01) :150-180