Universal machine-learning algorithm for predicting adsorption performance of organic molecules based on limited data set: Importance of feature description

被引:6
作者
Huang, Chaoyi [1 ]
Gao, Wenyang [2 ]
Zheng, Yingdie [1 ]
Wang, Wei [1 ]
Zhang, Yue [2 ]
Liu, Kai [1 ]
机构
[1] Westlake Univ, Coll Engn, Div Environm & Resources, Hangzhou 310024, Zhejiang, Peoples R China
[2] Westlake Univ, Coll Engn, Div Arti fi cial Intelligence & Data Sci, Hangzhou 310024, Zhejiang, Peoples R China
关键词
Water treatment; Organic contaminants; Artificial intelligence; Adsorption isotherm; ACTIVATED-CHARCOAL; AQUEOUS-SOLUTION; PHARMACEUTICALS; CARBON; METOPROLOL;
D O I
10.1016/j.scitotenv.2022.160228
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Adsorption of organic molecules from aqueous solution offers a simple and effective method for their removal. Re-cently, there have been several attempts to apply machine learning (ML) for this problem. To this end, polyparameter linear free energy relationships (pp-LFERs) were employed, and poor prediction results were observed outside model applicability domain of pp-LFERs. In this study, we improved the applicability of ML methods by adopting a chemical -structure (CS) based approach. We used the prediction of adsorption of organic molecules on carbon-based adsorbents as an example. Our results show that this approach can fully differentiate the structural differences between any or-ganic molecules, while providing significant information that is relevant to their interaction with the adsorbents. We compared two CS feature descriptors: 3D-coordination and simplified molecular-input line-entry system (SMILES). We then built CS-ML models based on neural networks (NN) and extreme gradient boosting (XGB). They all outperformed pp-LFERs based models and are capable to accurately predict adsorption isotherm of isomers with similar physiochemical properties such as chiral molecules, even though they are trained with achiral molecules and race -mates. We found for predicting adsorption isotherm, XGB shows better performance than NN, and 3D-coordinations allow effective differentiation between organic molecules.
引用
收藏
页数:11
相关论文
共 46 条
  • [1] Predictive Model Development for Adsorption of Aromatic Contaminants by Multi-Walled Carbon Nanotubes
    Apul, Onur G.
    Wang, Qiliang
    Shao, Ting
    Rieck, James R.
    Karanfil, Tanju
    [J]. ENVIRONMENTAL SCIENCE & TECHNOLOGY, 2013, 47 (05) : 2295 - 2303
  • [2] Predicting Depression From Smartphone Behavioral Markers Using Machine Learning Methods, Hyperparameter Optimization, and Feature Importance Analysis: Exploratory Study
    Asare, Kennedy Opoku
    Terhorst, Yannik
    Vega, Julio
    Peltonen, Ella
    Lagerspetz, Eemil
    Ferreira, Denzil
    [J]. JMIR MHEALTH AND UHEALTH, 2021, 9 (07):
  • [3] Belhamdi Badreddine, 2016, J. appl. res. technol, V14, P354, DOI 10.1016/j.jart.2016.08.004
  • [4] Phenol removal from aqueous solution by adsorption and ion exchange mechanisms onto polymeric resins
    Caetano, Michelle
    Valderrama, Cesar
    Farran, Adriana
    Luis Cortina, Jose
    [J]. JOURNAL OF COLLOID AND INTERFACE SCIENCE, 2009, 338 (02) : 402 - 409
  • [5] L- and D-Proline Adsorption by Chiral Ordered Mesoporous Silica
    Casado, Clara
    Castan, Joaquin
    Gracia, Ismael
    Yus, Miriam
    Mayoral, Alvaro
    Sebastian, Victor
    Lopez-Ram-de-Viu, Pilar
    Uriel, Santiago
    Coronas, Joaquin
    [J]. LANGMUIR, 2012, 28 (16) : 6638 - 6644
  • [6] Applications of nanomaterials in enantioseparation and related techniques
    Chang, Cuilan
    Wang, Xin
    Bai, Yu
    Liu, Huwei
    [J]. TRAC-TRENDS IN ANALYTICAL CHEMISTRY, 2012, 39 : 195 - 206
  • [7] XGBoost: A Scalable Tree Boosting System
    Chen, Tianqi
    Guestrin, Carlos
    [J]. KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, : 785 - 794
  • [8] Development of a wide-range soft sensor for predicting wastewater BOD5 using an eXtreme gradient boosting (XGBoost) machine
    Ching, P. M. L.
    Zou, X.
    Wu, Di
    So, R. H. Y.
    Chen, G. H.
    [J]. ENVIRONMENTAL RESEARCH, 2022, 210
  • [9] Predicting formation of haloacetic acids by chlorination of organic compounds using machine-learning-assisted quantitative structure-activity relationships
    Cordero, Jose Andres
    He, Kai
    Janya, Kanjira
    Echigo, Shinya
    Itoh, Sadahiko
    [J]. JOURNAL OF HAZARDOUS MATERIALS, 2021, 408 (408)
  • [10] Quantitative structure property relationships for the adsorption of pharmaceuticals onto activated carbon
    Dickenson, E. R. V.
    Drewes, J. E.
    [J]. WATER SCIENCE AND TECHNOLOGY, 2010, 62 (10) : 2270 - 2276