Machine learning methods to predict the crystallization propensity of small organic molecules

被引:11
|
作者
Pereira, Florbela [1 ,2 ]
机构
[1] Univ Nova Lisboa, Fac Ciencias & Tecnol, Dept Quim, LAQV, Caparica, Portugal
[2] Univ Nova Lisboa, Fac Ciencias & Tecnol, Dept Quim, REQUIMTE, Caparica, Portugal
关键词
CLASSIFICATION; STABILITY; TENDENCY;
D O I
10.1039/d0ce00070a
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Machine learning (ML) algorithms were explored for the prediction of the crystallization propensity based on molecular descriptors and fingerprints generated from 2D chemical structures and 3D molecular descriptors from 3D chemical structures optimized with empirical methods. In total, 57815 molecules were retrieved from the Reaxys (R) database, from those 53 998 molecules are recorded as crystalline (class A), 3097 as polymorphic (class B), and 720 as amorphous (class C). A training data set with 40 462 organic molecules was used to build the models, which were validated with an external test set comprising 17353 organic molecules. Several ML algorithms such as random forest (RF), support vector machines (SVM), and deep learning multilayer perceptron networks (MLP) were screened. The best performance was achieved with a consensus classification model obtained by RF, SVM, and MLP models, which predicted the external test set with an overall predictive accuracy (Q) of up to 80%.
引用
收藏
页码:2817 / 2826
页数:10
相关论文
共 50 条
  • [31] A machine learning algorithm to predict crown rot in organic bananas
    van der Waal, J. W. H.
    XXXI INTERNATIONAL HORTICULTURAL CONGRESS, IHC2022: XII INTERNATIONAL SYMPOSIUM ON BANANA: CELEBRATING BANANA ORGANIC PRODUCTION, 2023, 1367 : 209 - 216
  • [32] Predicting the Crystallization Propensity of Drug-Like Molecules
    Hancock, Bruno C.
    JOURNAL OF PHARMACEUTICAL SCIENCES, 2017, 106 (01) : 28 - 30
  • [33] Comparing Machine Learning Methods to Predict Photovoltaic Power Output
    Lee, Kanghyuk
    Kim, Woo Je
    Cho, Hyunwoong
    ADVANCED SCIENCE LETTERS, 2016, 22 (10) : 2955 - 2958
  • [34] Machine learning methods to predict outcomes of pharmacological treatment in psychosis
    Del Fabro, Lorenzo
    Bondi, Elena
    Serio, Francesca
    Maggioni, Eleonora
    D'Agostino, Armando
    Brambilla, Paolo
    TRANSLATIONAL PSYCHIATRY, 2023, 13 (01)
  • [35] Machine learning methods to predict outcomes of pharmacological treatment in psychosis
    Lorenzo Del Fabro
    Elena Bondi
    Francesca Serio
    Eleonora Maggioni
    Armando D’Agostino
    Paolo Brambilla
    Translational Psychiatry, 13
  • [36] Application of machine learning methods to predict drought cost in France
    Heranval, Antoine
    Lopez, Olivier
    Thomas, Maud
    EUROPEAN ACTUARIAL JOURNAL, 2023, 13 (02) : 731 - 753
  • [37] Explainable machine learning methods to predict postpartum depression risk
    Shivaprasad, Susmita
    Chadaga, Krishnaraj
    Sampathila, Niranjana
    Prabhu, Srikanth
    Chadaga, P. Rajagopala
    Swathi, K. S.
    SYSTEMS SCIENCE & CONTROL ENGINEERING, 2024, 12 (01)
  • [38] Application of machine learning methods to predict drought cost in France
    Antoine Heranval
    Olivier Lopez
    Maud Thomas
    European Actuarial Journal, 2023, 13 : 731 - 753
  • [39] Machine learning-Predicting Ames mutagenicity of small molecules
    Chu, Charmaine S. M.
    Simpson, Jack D.
    O'Neill, Paul M.
    Berry, Neil G.
    JOURNAL OF MOLECULAR GRAPHICS & MODELLING, 2021, 109
  • [40] The Influence of Small Organic Molecules on Coagulation from the Perspective of Hydrolysis Competition and Crystallization
    Zhang, Xuejia
    Graham, Nigel
    Xu, Lei
    Yu, Wenzheng
    Gregory, John
    ENVIRONMENTAL SCIENCE & TECHNOLOGY, 2021, 55 (11) : 7456 - 7465