Performance prediction of an internal-circulation membrane bioreactor based on models comparison and data features analysis

被引:7
作者
Zhuang, Lipeng [1 ]
Tang, Bing [1 ,2 ]
Bin, Liying [1 ]
Li, Ping [1 ]
Huang, Shaosong [1 ]
Fu, Fenglian [1 ]
机构
[1] Guangdong Univ Technol, Sch Environm Sci & Engn, Guangzhou Key Lab Environm Catalysis & Pollut Con, Guangdong Key Lab Environm Catalysis & Hlth Risk, Guangzhou 510006, Peoples R China
[2] Guangzhou Higher Educ Mega Ctr, 100 Waihuan Xi Rd, Guangzhou 510006, Peoples R China
基金
中国国家自然科学基金;
关键词
Data-driven model; Performance prediction; Membrane bioreactor; Model comparison; Gradient boosting regression; Data feature analysis; WASTE-WATER; TREATMENT-PLANT; OPTIMIZATION; REMOVAL; DESIGN; SYSTEM;
D O I
10.1016/j.bej.2020.107850
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Complex hydrodynamic conditions and the dynamic process of multi-habitats have brought challenges for the real-time management of novel wastewater treatment techniques. Thus, based on a well-performing membrane bioreactor with internal circulation, this paper presented an intelligent data mining work of performance prediction and the analysis of data influence for efficient management. The experimental data with 440 samples and nine internal environment parameters were integrated, and the concentrations of COD, NH4+-N and total nitrogen (TN) in the effluent were used as multiple performance indicators. Then, a comprehensive testing was utilized for recognizing the most robust model from the deep neural network, regression tree, random forest, adaptive boosting and gradient boosting regression (GBR), which included the evaluations of model fitting and generalization ability. Furthermore, the feature relative importance (Imp) and principal component analysis (PCA) was applied to analyze the influence of data. The results indicated that the GBR model had a higher performance, in which, the R squared scores of 5-fold cross-validation were 0.847, 0.792 and 0.851 in predicting the concentration of COD, NH4+-N and TN, respectively. Besides, the quantification of Imp was accorded with the independent result of PCA, which indicated that the GBR model had well captured the dynamic information.
引用
收藏
页数:10
相关论文
共 29 条
  • [21] Data-driven surrogate modeling for performance prediction and sensitivity analysis of transport properties in proton exchange membrane water electrolyzers
    Raman, K. Ashoke
    Hammacher, Linus
    Kungl, Hans
    Karl, Andre
    Jodat, Eva
    Eichel, Ruediger-A.
    Karyofylli, Violeta
    APPLIED ENERGY, 2025, 386
  • [22] Study on student performance estimation, student progress analysis, and student potential prediction based on data mining
    Yang, Fan
    Li, Frederick W. B.
    COMPUTERS & EDUCATION, 2018, 123 : 97 - 108
  • [23] Performance comparison of physical process-based and data-driven models: a case study on the Edwards Aquifer, USA
    Zhang, Andi
    Winterle, James
    Yang, Changbing
    HYDROGEOLOGY JOURNAL, 2020, 28 (06) : 2025 - 2037
  • [24] Performance prediction and operating parameters optimization for proton exchange membrane fuel cell based on data-driven surrogate model and particle swarm optimization
    Zhang, Ning
    Wang, Hui
    Chen, Wenshang
    Zhou, Haoran
    Meng, Kai
    Chen, Ben
    INTERNATIONAL JOURNAL OF HYDROGEN ENERGY, 2024, 69 : 493 - 503
  • [25] Data-Driven Urban Traffic Accident Analysis and Prediction Using Logit and Machine Learning-Based Pattern Recognition Models
    Najafi Moghaddam Gilani, Vahid
    Hosseinian, Seyed Mohsen
    Ghasedi, Meisam
    Nikookar, Mohammad
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2021, 2021
  • [26] Comparative analysis of thermal preference prediction performance in different conditions using ensemble learning models based on ASHRAE Comfort Database II
    Bai, Yan
    Liu, Kai
    Wang, Yuying
    BUILDING AND ENVIRONMENT, 2022, 223
  • [27] Comparison of Latch-Based and Switch-Based, Sampled-Data, Three-Phase, PWM, Voltage- Source Inverter Models for Dynamic Analysis
    Kikuchi, Jun
    Wolf, Chris
    Degner, Michael
    2012 IEEE ENERGY CONVERSION CONGRESS AND EXPOSITION (ECCE), 2012, : 4619 - 4626
  • [28] How to improve the performance of China's energy-transport-economy-environment system: An analysis based on new strategy parallel-series input-output data envelopment analysis models
    Chen, Shanshan
    Zhang, Ruchuan
    Li, Peiwen
    Li, Aijun
    ENERGY, 2023, 281
  • [29] Development and validation of a set of six adaptable prognosis prediction (SAP) models based on time-series real-world big data analysis for patients with cancer receiving chemotherapy: A multicenter case crossover study
    Uneno, Yu
    Taneishi, Kei
    Kanai, Masashi
    Okamoto, Kazuya
    Yamamoto, Yosuke
    Yoshioka, Akira
    Hiramoto, Shuji
    Nozaki, Akira
    Nishikawa, Yoshitaka
    Yamaguchi, Daisuke
    Tomono, Teruko
    Nakatsui, Masahiko
    Baba, Mika
    Morita, Tatsuya
    Matsumoto, Shigemi
    Kuroda, Tomohiro
    Okuno, Yasushi
    Muto, Manabu
    PLOS ONE, 2017, 12 (08):