Next generation pure component property estimation models: With and without machine learning techniques

被引:52
|
作者
Alshehri, Abdulelah S. [1 ,2 ]
Tula, Anjan K. [3 ]
You, Fengqi [1 ]
Gani, Rafiqul [4 ]
机构
[1] Cornell Univ, Robert Frederick Smith Sch Chem & Biomol Engn, Ithaca, NY USA
[2] King Saud Univ, Coll Engn, Dept Chem Engn, Riyadh, Saudi Arabia
[3] Zhejiang Univ, Coll Control Sci & Engn, Hangzhou, Peoples R China
[4] Korea Adv Inst Sci & Technol KAIST, Dept Chem & Biomol Engn, Daejeon, South Korea
关键词
data analysis; group-contribution; machine learning; pure component property prediction; GAUSSIAN-PROCESSES; ORGANIC-COMPOUNDS; DESIGN; PREDICTION; SELECTION; PRODUCT;
D O I
10.1002/aic.17469
中图分类号
TQ [化学工业];
学科分类号
0817 ;
摘要
Physiochemical properties of pure components serve as the basis for the design and simulation of chemical products and processes. Models based on the molecular structural information of chemicals for the following 25 pure component properties are presented in this work: (critical-) temperature, pressure, volume, acentric factor; (normal-) boiling point, melting point, auto-ignition temperature; flash point; (standard-) enthalpy of formation, Gibbs energy of formation, enthalpy of fusion, enthalpy of vaporization, liquid molar volume; (environmental-) (lethal dose-) LC50 and LD50, photo-chemical oxidation potential, bioconcentration factor, permissible exposure limit; (physicochemical-) acid dissociation constant, water-solubility, octanol-water partition coefficient, Hildebrandt solubility parameter, Hansen solubility parameters. Utilizing functional groups for molecular representation, two parallel property estimation models where the group contributions for each property are regressed through traditional regression techniques and machine learning techniques are presented. Both techniques use an a priori data analysis before regression of model parameters. A dataset with more than 24,000 chemicals for the 25 pure component properties has been utilized for the development of the two sets of property models. The efficacy of the developed models and their use are highlighted together with a discussion on the overall performance, application range, and predictive capabilities with implications to product and/or process engineering problem solutions.
引用
收藏
页数:16
相关论文
共 50 条
  • [31] Review on Machine Learning Techniques for Developing Pavement Performance Prediction Models
    Justo-Silva, Rita
    Ferreira, Adelino
    Flintsch, Gerardo
    SUSTAINABILITY, 2021, 13 (09)
  • [32] Dengue models based on machine learning techniques: A systematic literature review
    Hoyos, William
    Aguilar, Jose
    Toro, Mauricio
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2021, 119
  • [33] Development of machine learning models based on air temperature for estimation of global solar radiation in India
    Husain, Shahid
    Khan, Uzair Ali
    ENVIRONMENTAL PROGRESS & SUSTAINABLE ENERGY, 2022, 41 (04)
  • [34] Multi-models machine learning methods for traffic flow estimation from Car Data
    Li, Jinjian
    Boonaert, Jacques
    Doniec, Arnaud
    Lozenguez, Guillaume
    TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2021, 132
  • [35] Large property models: a new generative machine-learning formulation for molecules
    Jin, Tianfan
    Singla, Veerupaksh
    Hsu, Hsuan-Hao
    Savoie, Brett M.
    FARADAY DISCUSSIONS, 2025, 256 (00) : 104 - 119
  • [36] Finding Transcripts Associated with Prostate Cancer Gleason Stages Using Next Generation Sequencing and Machine Learning Techniques
    Hamzeh, Osama
    Alkhateeb, Abedalrhman
    Rezaeian, Iman
    Karkar, Aram
    Rueda, Luis
    BIOINFORMATICS AND BIOMEDICAL ENGINEERING, IWBBIO 2017, PT II, 2017, 10209 : 337 - 348
  • [37] Next Generation ECG: The Impact of Artificial Intelligence and Machine Learning
    Adasuriya, Gamith
    Haldar, Shouvik
    CURRENT CARDIOVASCULAR RISK REPORTS, 2023, 17 (08) : 143 - 154
  • [38] Next Generation ECG: The Impact of Artificial Intelligence and Machine Learning
    Gamith Adasuriya
    Shouvik Haldar
    Current Cardiovascular Risk Reports, 2023, 17 : 143 - 154
  • [39] Machine learning techniques in estimation of eggplant crop evapotranspiration
    Bilal Cemek
    Sevda Tasan
    Aslıhan Canturk
    Mehmet Tasan
    Halis Simsek
    Applied Water Science, 2023, 13
  • [40] Machine learning techniques in estimation of eggplant crop evapotranspiration
    Cemek, Bilal
    Tasan, Sevda
    Canturk, Aslihan
    Tasan, Mehmet
    Simsek, Halis
    APPLIED WATER SCIENCE, 2023, 13 (06)