A novel robust principal component analysis-multivariate adaptive regression splines approach for BOD, COD, and NH3-N determination in wastewater

被引:0
作者
Rijab, Sanaa [1 ]
Khorrami, Mohammadreza Khanmohammadi [1 ]
Mohammadi, Mahsa [1 ]
机构
[1] Imam Khomeini Int Univ, Fac Sci, Dept Chem, Qazvin, Iran
关键词
Robust principal component analysis; Multivariate adaptive regression splines; BOD; COD; NH3-N; Wastewater;
D O I
10.1007/s13738-024-03170-z
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
One of the biggest environmental contaminants is wastewater, which can impede global sustainable development. Visible-near infrared spectroscopy can be used to enhance the management, efficiency, and wise use of water resources. However, noise information and the large dimensionality of spectral data frequently limit how accurate spectral models are for water quality metrics. The rPCA-MARS model will use visible-near infrared spectral data as a novel analytical technique for estimating the contents of biological oxygen demand, chemical oxygen demand, and NH3-N in WW. The MARS model will be built once the spectral data have been subjected to the rPCA algorithm to get principal component scores. The MARS model utilizes six PC scores as its input variables. The piecewise-linear and cubic MARS model will be used to build a mathematical correlation between the COD, BOD, and NH3-N content for each component (Y) and the data matrix (X). The rPCA-MARS model is calibrated using a set of 42 samples. An independent test set of 16 samples is then used to evaluate its performance. We will employ the duplex algorithm to select calibration and prediction sets from the data matrix. Prior to running the rPCA-MARS model on the spectral data, we will employ moving average smoothing and SNV transformation for data processing. Coefficient of determination (R-2), adjusted R-squared (R-adj(2)), R-2 estimated by generalized cross-validation (R(2)GCV), and mean square error (MSE) were used to assess the effectiveness of the rPCA-MARS model. Both piecewise-linear and piecewise-cubic rPCA-MARS models demonstrated excellent performance for BOD, COD, and NH3-N determination on the calibration and test sets. High R-2 values (> 0.93) in both datasets indicate a strong correlation between predicted and observed values. Additionally, the high adjusted R-2 (0.93) suggests that the model effectively avoids overfitting. Furthermore, the relatively high R(2)GCV (0.90) confirms both the model's accuracy and generalizability.
引用
收藏
页码:575 / 587
页数:13
相关论文
共 30 条
  • [1] Deep learning in wastewater treatment: a critical review
    Alvi, Maira
    Batstone, Damien
    Mbamba, Christian Kazadi
    Keymer, Philip
    French, Tim
    Ward, Andrew
    Dwyer, Jason
    Cardell-Oliver, Rachel
    [J]. WATER RESEARCH, 2023, 245
  • [2] Amor SB., 2023, Oper. Res, DOI [10.1007/s10479-022-04986-9, DOI 10.1007/S10479-022-04986-9]
  • [3] DeepARG: a deep learning approach for predicting antibiotic resistance genes from metagenomic data
    Arango-Argoty, Gustavo
    Garner, Emily
    Prudent, Amy
    Heath, Lenwood S.
    Vikesland, Peter
    Zhang, Liqing
    [J]. MICROBIOME, 2018, 6
  • [4] Application of random forest, radial basis function neural networks and central composite design for modeling and/or optimization of the ultrasonic assisted adsorption of brilliant green on ZnS-NP-AC
    Azqhandi, M. H. Ahmadi
    Ghaedi, M.
    Yousefi, F.
    Jamshidi, M.
    [J]. JOURNAL OF COLLOID AND INTERFACE SCIENCE, 2017, 505 : 278 - 292
  • [5] On-line monitoring of wastewater quality: a review
    Bourgeois, W
    Burgess, JE
    Stuetz, RM
    [J]. JOURNAL OF CHEMICAL TECHNOLOGY AND BIOTECHNOLOGY, 2001, 76 (04) : 337 - 348
  • [6] Brnmark C., 2002, Environ. Conserv. J, DOI [10.1017/S0376892902000218, DOI 10.1017/S0376892902000218]
  • [7] Leaf age effects on the spectral predictability of leaf traits in Amazonian canopy trees
    Chavana-Bryant, Cecilia
    Malhi, Yadvinder
    Anastasiou, Athanasios
    Enquist, Brian J.
    Cosio, Eric G.
    Keenan, Trevor F.
    Gerard, France F.
    [J]. SCIENCE OF THE TOTAL ENVIRONMENT, 2019, 666 : 1301 - 1315
  • [8] High breakdown estimators for principal components: the projection-pursuit approach revisited
    Croux, C
    Ruiz-Gazen, A
    [J]. JOURNAL OF MULTIVARIATE ANALYSIS, 2005, 95 (01) : 206 - 226
  • [9] Cloud Point Extraction Spectrophotometric Determination of Copper, Chromium and Cobalt by Salen as Reagent in Wastewater of Iraq
    Dhahir, Saadiyah Ahmed
    Bakir, Sana Rhajab
    [J]. ASIAN JOURNAL OF CHEMISTRY, 2014, 26 (16) : 5305 - 5310
  • [10] A comparison of multivariate calibration techniques applied to experimental NIR data sets Part II. Predictive ability under extrapolation conditions
    Estienne, F
    Pasti, L
    Centner, V
    Walczak, B
    Despagne, F
    Rimbaud, DJ
    de Noord, OE
    Massart, DL
    [J]. CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2001, 58 (02) : 195 - 211