Machine Learning-Based Water Quality Classification Assessment

被引:1
|
作者
Chen, Wenliang [1 ]
Xu, Duo [2 ]
Pan, Bowen [3 ]
Zhao, Yuan [4 ]
Song, Yan [1 ]
机构
[1] Liaoning Univ, Coll Phys, Measuring Technol & Instruments, Chongshan Campus, Shenyang 110036, Peoples R China
[2] Shenyang Ceprei Technol Serv Co Ltd, Shenyang 110015, Peoples R China
[3] Liaoshen Ind Grp Co Ltd, Shenyang 110801, Peoples R China
[4] Shipbuilding Equipment & Mat Northeast China Co Lt, Shenyang 100013, Peoples R China
关键词
water quality classification; GBDT; MLP; feature-weighted attention mechanism; PERFORMANCE; INDEX; MODEL;
D O I
10.3390/w16202951
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Water is a vital resource, and its quality has a direct impact on human health. Groundwater, as one of the primary water sources, requires careful monitoring to ensure its safety. Although manual methods for testing water quality are accurate, they are often time-consuming, costly, and inefficient when dealing with large and complex data sets. In recent years, machine learning has become an effective alternative for water quality assessment. However, current approaches still face challenges, such as the limited performance of individual models, minimal improvements from optimization algorithms, lack of dynamic feature weighting mechanisms, and potential information loss when simplifying model inputs. To address these challenges, this paper proposes a hybrid model, BS-MLP, which combines GBDT (gradient-boosted decision tree) and MLP (multilayer perceptron). The model leverages GBDT's strength in feature selection and MLP's capability to manage nonlinear relationships, enabling it to capture complex interactions between water quality parameters. We employ Bayesian optimization to fine-tune the model's parameters and introduce a feature-weighting attention mechanism to develop the BS-FAMLP model, which dynamically adjusts feature weights, enhancing generalization and classification accuracy. In addition, a comprehensive parameter selection strategy is employed to maintain data integrity. These innovations significantly improve the model's classification performance and efficiency in handling complex water quality environments and imbalanced datasets. This model was evaluated using a publicly available groundwater quality dataset consisting of 188,623 samples, each with 15 water quality parameters and corresponding labels. The BS-FAMLP model shows strong classification performance, with optimized hyperparameters and an adjusted feature-weighting attention mechanism. Specifically, it achieved an accuracy of 0.9616, precision of 0.9524, recall of 0.9655, F1 Score of 0.9589, and an AUC score of 0.9834 on the test set. Compared to single models, classification accuracy improved by approximately 10%, and when compared to other hybrid models with additional attention mechanisms, BS-FAMLP achieved an optimal balance between classification performance and computational efficiency. The core objective of this study is to utilize the acquired water quality parameter data for efficient classification and assessment of water samples, with the aim of streamlining traditional laboratory-based water quality analysis processes. By developing a reliable water quality classification model, this research provides robust technical support for water safety management.
引用
收藏
页数:29
相关论文
共 50 条
  • [1] AQuA-P: A machine learning-based tool for water quality assessment
    Diaz-Gonzalez, L.
    Aguilar-Rodriguez, R. A.
    Perez-Sansalvador, J. C.
    Lakouari, N.
    JOURNAL OF CONTAMINANT HYDROLOGY, 2025, 269
  • [2] A machine learning-based assessment of subjective quality of life
    Rodriguez, Sebastian
    Cabrera-Barona, Pablo
    JOURNAL OF COMPUTATIONAL SOCIAL SCIENCE, 2024, 7 (01): : 451 - 467
  • [3] Banana and Guava dataset for machine learning and deep learning-based quality classification
    Kumari, Abiban
    Singh, Jaswinder
    DATA IN BRIEF, 2024, 57
  • [4] Machine learning-based classification of quality grades for concrete vibration behaviour
    Fan, Shuai
    He, Tao
    Li, Weihao
    Zeng, Chuang
    Chen, Peng
    Chen, Lufeng
    Shu, Jiangpeng
    AUTOMATION IN CONSTRUCTION, 2024, 167
  • [5] Smart IoT and Machine Learning-based Framework for Water Quality Assessment and Device Component Monitoring
    Akashdeep Bhardwaj
    Vishal Dagar
    Muhammad Owais Khan
    Akarsh Aggarwal
    Rafael Alvarado
    Manoj Kumar
    Muhammad Irfan
    Ram Proshad
    Environmental Science and Pollution Research, 2022, 29 : 46018 - 46036
  • [6] Smart IoT and Machine Learning-based Framework for Water Quality Assessment and Device Component Monitoring
    Bhardwaj, Akashdeep
    Dagar, Vishal
    Khan, Muhammad Owais
    Aggarwal, Akarsh
    Alvarado, Rafael
    Kumar, Manoj
    Irfan, Muhammad
    Proshad, Ram
    ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH, 2022, 29 (30) : 46018 - 46036
  • [7] Machine learning for water quality classification
    Abuzir, Saleh Y.
    Abuzir, Yousef S.
    WATER QUALITY RESEARCH JOURNAL, 2022, 57 (03) : 152 - 164
  • [8] A machine learning-based method for protein global model quality assessment
    Dong, Qiwen
    Chen, Yufei
    Zhou, Shuigeng
    INTERNATIONAL JOURNAL OF GENERAL SYSTEMS, 2011, 40 (04) : 417 - 425
  • [9] Machine learning-based soft-sensor development for road quality classification
    Nagy, Roland
    Kummer, Alex
    Abonyi, Janos
    Szalai, Istvan
    JOURNAL OF VIBRATION AND CONTROL, 2024, 30 (11-12) : 2672 - 2684
  • [10] Machine Learning-Based Network Attack Classification
    Liang, Tianhong
    Ma, Li
    Wang, Zhichuang
    Hou, Fangyuan
    39TH YOUTH ACADEMIC ANNUAL CONFERENCE OF CHINESE ASSOCIATION OF AUTOMATION, YAC 2024, 2024, : 2392 - 2397