Machine learning-based atom contribution method for the prediction of surface charge density profiles and solvent design

被引:38
作者
Liu, Qilei [1 ]
Zhang, Lei [1 ]
Tang, Kun [1 ]
Liu, Linlin [1 ]
Du, Jian [1 ]
Meng, Qingwei [2 ,3 ]
Gani, Rafiqul [4 ,5 ]
机构
[1] Dalian Univ Technol, Sch Chem Engn, Inst Chem Proc Syst Engn, Dalian 116024, Peoples R China
[2] Dalian Univ Technol, Sch Pharmaceut Sci & Technol, State Key Lab Fine Chem, Dalian, Peoples R China
[3] Dalian Univ Technol, Ningbo Inst, Ningbo, Peoples R China
[4] PSE SPEED, Skyttemosen 6, Allerod, Denmark
[5] Korea Adv Inst Sci & Technol KAIST, Dept Chem & Biomol Engn, Daejeon, South Korea
基金
中国国家自然科学基金;
关键词
atom contribution; computer-aided molecular design; decomposition-based algorithm; machine learning; surface charge density profiles (sigma-profiles); AIDED MOLECULAR DESIGN; FRAMEWORK; IBUPROFEN; CHEMISTRY; DATABASE; PRODUCT; CAMD; TOOL;
D O I
10.1002/aic.17110
中图分类号
TQ [化学工业];
学科分类号
0817 ;
摘要
Solvents are widely used in chemical processes. The use of efficient model-based solvent selection techniques is an option worth considering for rapid identification of candidates with better economic, environment and human health properties. In this paper, an optimization-based MLAC-CAMD framework is established for solvent design, where a novel machine learning-based atom contribution method is developed to predict molecular surface charge density profiles (sigma-profiles). In this method, weighted atom-centered symmetry functions are associated with atomic sigma-profiles using a high-dimensional neural network model, successfully leading to a higher prediction accuracy in molecular sigma-profiles and better isomer identifications compared with group contribution methods. The new method is integrated with the computer-aided molecular design technique by formulating and solving a mixed-integer nonlinear programming model, where model complexities are managed with a decomposition-based strategy. Finally, two case studies involving crystallization and reaction are presented to highlight the wide applicability and effectiveness of the MLAC-CAMD framework.
引用
收藏
页数:13
相关论文
共 31 条
  • [21] Group Contribution Prediction of Surface Charge Density Distribution of Molecules for COSMO-SAC
    Mu, Tiancheng
    Rarey, Juergen
    Gmehling, Juergen
    [J]. AICHE JOURNAL, 2009, 55 (12) : 3298 - 3300
  • [22] Sigma-profile database for using COSMO-based thermodynamic methods
    Mullins, Eric
    Oldland, Richard
    Liu, Y. A.
    Wang, Shu
    Sandler, Stanley I.
    Chen, Chau-Chyun
    Zwolak, Michael
    Seavey, Kevin C.
    [J]. INDUSTRIAL & ENGINEERING CHEMISTRY RESEARCH, 2006, 45 (12) : 4389 - 4415
  • [23] Open Babel: An open chemical toolbox
    O'Boyle, Noel M.
    Banck, Michael
    James, Craig A.
    Morley, Chris
    Vandermeersch, Tim
    Hutchison, Geoffrey R.
    [J]. JOURNAL OF CHEMINFORMATICS, 2011, 3
  • [24] ODELE O, 1993, FLUID PHASE EQUILIBR, V82, P47, DOI 10.1016/0378-3812(93)87127-M
  • [25] Reichardt C., 2010, Solvents and Solvent Effects in Organic Chemistry, P165
  • [26] COSMO-CAMD: A framework for optimization-based computer-aided molecular design using COSMO-RS
    Scheffczyk, Jan
    Fleitmann, Lorenz
    Schwarz, Annett
    Lampe, Matthias
    Bardow, Andre
    Leonhard, Kai
    [J]. CHEMICAL ENGINEERING SCIENCE, 2017, 159 : 84 - 92
  • [27] Struebing H, 2013, NAT CHEM, V5, P952, DOI [10.1038/NCHEM.1755, 10.1038/nchem.1755]
  • [28] Chemistry with ADF
    te Velde, G
    Bickelhaupt, FM
    Baerends, EJ
    Guerra, CF
    Van Gisbergen, SJA
    Snijders, JG
    Ziegler, T
    [J]. JOURNAL OF COMPUTATIONAL CHEMISTRY, 2001, 22 (09) : 931 - 967
  • [29] Solubilities of Ibuprofen in Different Pure Solvents
    Wang, Shui
    Song, Zhiqian
    Wang, Jidong
    Dong, Yongli
    Wu, Ming
    [J]. JOURNAL OF CHEMICAL AND ENGINEERING DATA, 2010, 55 (11) : 5283 - 5285
  • [30] New Vistas in Chemical Product and Process Design
    Zhang, Lei
    Babi, Deenesh K.
    Gani, Rafiqul
    [J]. ANNUAL REVIEW OF CHEMICAL AND BIOMOLECULAR ENGINEERING, VOL 7, 2016, 7 : 557 - 582