Prediction of Total Phosphorus Based on Distance Correlation and Machine Learning Methods-a Case Study of Dongjiang River, China

被引:4
作者
Huang, Yongkai [1 ]
Chen, Yiling [1 ]
机构
[1] Guangdong Univ Technol, Sch Ecol Environm & Resources, Guangzhou 511400, Guangdong, Peoples R China
关键词
Water quality prediction; Total phosphorus; Feature selection tools; Machine learning; WATER-QUALITY PARAMETERS; MODEL; LAKE;
D O I
10.1007/s11270-024-06913-z
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Elevated total phosphorus (TP) leads to water eutrophication, affecting aquatic life and ecosystems. Effective control of phosphorus concentration in rivers is vital. Detecting TP and other nutrients is essential for evaluating river health. Establishing real-time TP prediction is key for studying eutrophication and blooms. In this study, we compared the prediction models combined with two different feature selection tools, Pearson and distance correlation coefficient, and evaluated them with four evaluation indicators. It was determined that the assessment outcomes of the distance correlation coefficient yielded heightened precision. Notably, the random forest validation contributed to an impressive 8.3% enhancement in the performance R2 of the model, and the utilization of BP neural network validation resulted in an improvement of 2.7% in the model's performance. A real-time TP prediction model using distance correlation coefficient was eventually created, which reaches an accuracy up to 82.6%. The final model was applied to field data, revealing elevated TP levels downstream of Dongjiang River in China due to high population density and numerous factories. Additionally, the results indicated that the distance correlation coefficient outperformed the Pearson correlation coefficient in predicting the total phosphorus in Dongjiang River. Our findings suggest that combining distance correlation coefficients with machine learning holds promise for more accurate water quality prediction models. These findings establish a foundation for enhancing the efficacy of the water quality prediction model.
引用
收藏
页数:14
相关论文
共 45 条
[1]   Feature selection in machine learning: A new perspective [J].
Cai, Jie ;
Luo, Jiawei ;
Wang, Shulin ;
Yang, Sheng .
NEUROCOMPUTING, 2018, 300 :70-79
[2]   Prediction of Oxidant Exposures and Micropollutant Abatement during Ozonation Using a Machine Learning Method [J].
Cha, Dongwon ;
Park, Sanghun ;
Kim, Min Sik ;
Kim, Taewan ;
Hong, Seok Won ;
Cho, Kyung Hwa ;
Lee, Changha .
ENVIRONMENTAL SCIENCE & TECHNOLOGY, 2021, 55 (01) :709-718
[3]  
Chen H. L., 2022, Sustainability, V14
[4]   Comprehensive assessment of water environmental carrying capacity for sustainable watershed development [J].
Chen, Shuying ;
He, Yanhu ;
Tan, Qian ;
Hu, Kejia ;
Zhang, Tianyuan ;
Zhang, Shan .
JOURNAL OF ENVIRONMENTAL MANAGEMENT, 2022, 303
[5]   Water Quality Prediction Model of a Water Diversion Project Based on the Improved Artificial Bee Colony-Backpropagation Neural Network [J].
Chen, Siyu ;
Fang, Guohua ;
Huang, Xianfeng ;
Zhang, Yuhong .
WATER, 2018, 10 (06)
[6]   On relationships between the Pearson and the distance correlation coefficients [J].
Edelmann, Dominic ;
Mori, Tamas F. ;
Szekely, Gabor J. .
STATISTICS & PROBABILITY LETTERS, 2021, 169
[8]   Application of machine learning methods for the prediction of organic solid waste treatment and recycling processes: A review [J].
Guo, Hao-nan ;
Wu, Shu-biao ;
Tian, Ying-jie ;
Zhang, Jun ;
Liu, Hong-tao .
BIORESOURCE TECHNOLOGY, 2021, 319
[9]   Monthly precipitation prediction based on the EMD-VMD-LSTM coupled model [J].
Guo, Shaolei ;
Sun, Shifeng ;
Zhang, Xianqi ;
Chen, Haiyang ;
Li, Haiyang .
WATER SUPPLY, 2023, 23 (11) :4742-4758
[10]   Detecting and explaining long-term changes in river water quality in south-eastern Australia [J].
He, Ziming ;
Yao, Jiayu ;
Lu, Yancen ;
Guo, Danlu .
HYDROLOGICAL PROCESSES, 2022, 36 (11)