Evaluation of Machine Learning Models for Aqueous Solubility Prediction in Drug Discovery

被引:0
作者
Xue, Nian [1 ]
Zhang, Yuzhu [2 ]
Liu, Sensen [3 ]
机构
[1] NYU, Dept Comp Sc & Engn, New York, NY USA
[2] Carnegie Mellon Univ, Sch Comp Sc, Pittsburgh, PA 15213 USA
[3] Washington Univ, Dept Elect & Syst Engn, St Louis, MO 63110 USA
来源
2024 7TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND BIG DATA, ICAIBD 2024 | 2024年
关键词
Machine Learning; Solubility Prediction; Drug Discovery; Feature Importance; DESCRIPTORS; QSAR;
D O I
10.1109/ICAIBD62003.2024.10604556
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Determining the aqueous solubility of the chemical compound is of great importance in-silico drug discovery. However, correctly and rapidly predicting the aqueous solubility remains a challenging task. This paper explores and evaluates the predictability of multiple machine learning models in the aqueous solubility of compounds. Specifically, we apply a series of machine learning algorithms, including Random Forest, XG-Boost, LightGBM, and CatBoost, on a well-established aqueous solubility dataset (i.e., the Huuskonen dataset) of over 1200 compounds. Experimental results show that even traditional machine learning algorithms can achieve satisfactory performance with high accuracy. In addition, our investigation goes beyond mere prediction accuracy, delving into the interpretability of models to identify key features and understand the molecular properties that influence the predicted outcomes. This study sheds light on the ability to use machine learning approaches to predict compound solubility, significantly shortening the time that researchers spend on new drug discovery.
引用
收藏
页码:26 / 33
页数:8
相关论文
共 50 条
  • [31] The role of machine learning in neuroimaging for drug discovery and development
    Orla M. Doyle
    Mitul A. Mehta
    Michael J. Brammer
    Psychopharmacology, 2015, 232 : 4179 - 4189
  • [32] Editorial: Molecular Dynamics and Machine Learning in Drug Discovery
    Decherchi, Sergio
    Grisoni, Francesca
    Tiwary, Pratyush
    Cavalli, Andrea
    FRONTIERS IN MOLECULAR BIOSCIENCES, 2021, 8
  • [33] The impact of machine learning on future tuberculosis drug discovery
    Winkler, David A.
    EXPERT OPINION ON DRUG DISCOVERY, 2022, 17 (09) : 925 - 927
  • [34] Mycobacterium abscessus drug discovery using machine learning
    Schmalstig, Alan A.
    Zorn, Kimberley M.
    Murcia, Sebastian
    Robinson, Andrew
    Savina, Svetlana
    Komarova, Elena
    Makarov, Vadim
    Braunstein, Miriam
    Ekins, Sean
    TUBERCULOSIS, 2022, 132
  • [35] Machine learning approaches and their applications in drug discovery and design
    Priya, Sonal
    Tripathi, Garima
    Singh, Dev Bukhsh
    Jain, Priyanka
    Kumar, Abhijeet
    CHEMICAL BIOLOGY & DRUG DESIGN, 2022, 100 (01) : 136 - 153
  • [36] Artificial intelligence and machine learning in drug discovery and development
    Patel V.
    Shah M.
    Intelligent Medicine, 2022, 2 (03): : 134 - 140
  • [37] Transforming Computational Drug Discovery with Machine Learning and AI
    Smith, Justin S.
    Roitberg, Adrian E.
    Isayev, Olexandr
    ACS MEDICINAL CHEMISTRY LETTERS, 2018, 9 (11): : 1065 - 1069
  • [38] The role of machine learning in neuroimaging for drug discovery and development
    Doyle, Orla M.
    Mehta, Mitul A.
    Brammer, Michael J.
    PSYCHOPHARMACOLOGY, 2015, 232 (21-22) : 4179 - 4189
  • [39] Prediction of the solubility of organic compounds in high-temperature water using machine learning
    Osada, Mitsumasa
    Tamura, Kotaro
    Shimada, Iori
    JOURNAL OF SUPERCRITICAL FLUIDS, 2022, 190
  • [40] Machine Learning in Drug Discovery and Development
    Wale, Nikil
    DRUG DEVELOPMENT RESEARCH, 2011, 72 (01) : 112 - 119