Next generation pure component property estimation models: With and without machine learning techniques

被引:52
|
作者
Alshehri, Abdulelah S. [1 ,2 ]
Tula, Anjan K. [3 ]
You, Fengqi [1 ]
Gani, Rafiqul [4 ]
机构
[1] Cornell Univ, Robert Frederick Smith Sch Chem & Biomol Engn, Ithaca, NY USA
[2] King Saud Univ, Coll Engn, Dept Chem Engn, Riyadh, Saudi Arabia
[3] Zhejiang Univ, Coll Control Sci & Engn, Hangzhou, Peoples R China
[4] Korea Adv Inst Sci & Technol KAIST, Dept Chem & Biomol Engn, Daejeon, South Korea
关键词
data analysis; group-contribution; machine learning; pure component property prediction; GAUSSIAN-PROCESSES; ORGANIC-COMPOUNDS; DESIGN; PREDICTION; SELECTION; PRODUCT;
D O I
10.1002/aic.17469
中图分类号
TQ [化学工业];
学科分类号
0817 ;
摘要
Physiochemical properties of pure components serve as the basis for the design and simulation of chemical products and processes. Models based on the molecular structural information of chemicals for the following 25 pure component properties are presented in this work: (critical-) temperature, pressure, volume, acentric factor; (normal-) boiling point, melting point, auto-ignition temperature; flash point; (standard-) enthalpy of formation, Gibbs energy of formation, enthalpy of fusion, enthalpy of vaporization, liquid molar volume; (environmental-) (lethal dose-) LC50 and LD50, photo-chemical oxidation potential, bioconcentration factor, permissible exposure limit; (physicochemical-) acid dissociation constant, water-solubility, octanol-water partition coefficient, Hildebrandt solubility parameter, Hansen solubility parameters. Utilizing functional groups for molecular representation, two parallel property estimation models where the group contributions for each property are regressed through traditional regression techniques and machine learning techniques are presented. Both techniques use an a priori data analysis before regression of model parameters. A dataset with more than 24,000 chemicals for the 25 pure component properties has been utilized for the development of the two sets of property models. The efficacy of the developed models and their use are highlighted together with a discussion on the overall performance, application range, and predictive capabilities with implications to product and/or process engineering problem solutions.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] An Improved Machine Learning Model for Pure Component Property Estimation
    Cao, Xinyu
    Gong, Ming
    Tula, Anjan
    Chen, Xi
    Gani, Rafiqul
    Venkatasubramanian, Venkat
    ENGINEERING, 2024, 39 : 61 - 73
  • [2] Applied Machine Learning for Developing Next-Generation Functional Materials
    Dinic, Filip
    Singh, Kamalpreet
    Dong, Tony
    Rezazadeh, Milad
    Wang, Zhibo
    Khosrozadeh, Ali
    Yuan, Tiange
    Voznyy, Oleksandr
    ADVANCED FUNCTIONAL MATERIALS, 2021, 31 (51)
  • [3] Applications of machine learning techniques in next-generation optical WDM networks
    Rai, Saloni
    Garg, Amit Kumar
    JOURNAL OF OPTICS-INDIA, 2022, 51 (03): : 772 - 781
  • [4] Applications of machine learning techniques in next-generation optical WDM networks
    Saloni Rai
    Amit Kumar Garg
    Journal of Optics, 2022, 51 : 772 - 781
  • [5] Next-Generation Machine Learning for Biological Networks
    Camacho, Diogo M.
    Collins, Katherine M.
    Powers, Rani K.
    Costello, James C.
    Collins, James J.
    CELL, 2018, 173 (07) : 1581 - 1592
  • [6] Machine learning driven models for microhardness estimation of composite materials
    Rezvanova, A. E.
    Kochergin, M. I.
    Luginin, N. A.
    Chebodaeva, V. V.
    RUSSIAN PHYSICS JOURNAL, 2025, : 113 - 121
  • [7] Designing the next generation of polymers with machine learning and physics-based models
    Chew, Alex K.
    Afzal, Mohammad Atif Faiz
    Chandrasekaran, Anand
    Kamps, Jan Henk
    Ramakrishnan, Vaidya
    MACHINE LEARNING-SCIENCE AND TECHNOLOGY, 2024, 5 (04):
  • [8] Machine Learning Models for Software Cost Estimation
    Al Asheeri, Mahmood Mohd
    Hammad, Mustafa
    2019 INTERNATIONAL CONFERENCE ON INNOVATION AND INTELLIGENCE FOR INFORMATICS, COMPUTING, AND TECHNOLOGIES (3ICT), 2019,
  • [9] A taxonomy of machine learning techniques for construction cost estimation
    Karadimos, Panagiotis
    Anthopoulos, Leonidas
    INNOVATIVE INFRASTRUCTURE SOLUTIONS, 2024, 9 (11)
  • [10] Estimation of Heavy Metal Content in Soil Based on Machine Learning Models
    Shi, Shuaiwei
    Hou, Meiyi
    Gu, Zifan
    Jiang, Ce
    Zhang, Weiqiang
    Hou, Mengyang
    Li, Chenxi
    Xi, Zenglei
    LAND, 2022, 11 (07)