Evaluation of Machine Learning Models for Aqueous Solubility Prediction in Drug Discovery

被引:0
作者
Xue, Nian [1 ]
Zhang, Yuzhu [2 ]
Liu, Sensen [3 ]
机构
[1] NYU, Dept Comp Sc & Engn, New York, NY USA
[2] Carnegie Mellon Univ, Sch Comp Sc, Pittsburgh, PA 15213 USA
[3] Washington Univ, Dept Elect & Syst Engn, St Louis, MO 63110 USA
来源
2024 7TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND BIG DATA, ICAIBD 2024 | 2024年
关键词
Machine Learning; Solubility Prediction; Drug Discovery; Feature Importance; DESCRIPTORS; QSAR;
D O I
10.1109/ICAIBD62003.2024.10604556
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Determining the aqueous solubility of the chemical compound is of great importance in-silico drug discovery. However, correctly and rapidly predicting the aqueous solubility remains a challenging task. This paper explores and evaluates the predictability of multiple machine learning models in the aqueous solubility of compounds. Specifically, we apply a series of machine learning algorithms, including Random Forest, XG-Boost, LightGBM, and CatBoost, on a well-established aqueous solubility dataset (i.e., the Huuskonen dataset) of over 1200 compounds. Experimental results show that even traditional machine learning algorithms can achieve satisfactory performance with high accuracy. In addition, our investigation goes beyond mere prediction accuracy, delving into the interpretability of models to identify key features and understand the molecular properties that influence the predicted outcomes. This study sheds light on the ability to use machine learning approaches to predict compound solubility, significantly shortening the time that researchers spend on new drug discovery.
引用
收藏
页码:26 / 33
页数:8
相关论文
共 50 条
  • [41] An evaluation of thermodynamic models for the prediction of drug and drug-like molecule solubility in organic solvents
    Bouillot, Baptiste
    Teychene, Sebastien
    Biscans, Beatrice
    FLUID PHASE EQUILIBRIA, 2011, 309 (01) : 36 - 52
  • [42] Artificial Intelligence and Machine Learning Technology Driven Modern Drug Discovery and Development
    Sarkar, Chayna
    Das, Biswadeep
    Rawat, Vikram Singh
    Wahlang, Julie Birdie
    Nongpiur, Arvind
    Tiewsoh, Iadarilang
    Lyngdoh, Nari M.
    Das, Debasmita
    Bidarolli, Manjunath
    Sony, Hannah Theresa
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2023, 24 (03)
  • [43] Machine learning-based solubility prediction and methodology evaluation of active pharmaceutical ingredients in industrial crystallization
    Ma Yiming
    Gao Zhenguo
    Shi Peng
    Chen Mingyang
    Wu Songgu
    Yang Chao
    Wang JingKang
    Cheng Jingcai
    Gong Junbo
    Frontiers of Chemical Science and Engineering, 2022, 16 (04) : 523 - 535
  • [44] Machine learning-based solubility prediction and methodology evaluation of active pharmaceutical ingredients in industrial crystallization
    Yiming Ma
    Zhenguo Gao
    Peng Shi
    Mingyang Chen
    Songgu Wu
    Chao Yang
    Jingkang Wang
    Jingcai Cheng
    Junbo Gong
    Frontiers of Chemical Science and Engineering, 2022, 16 : 523 - 535
  • [45] Machine learning-based solubility prediction and methodology evaluation of active pharmaceutical ingredients in industrial crystallization
    Ma, Yiming
    Gao, Zhenguo
    Shi, Peng
    Chen, Mingyang
    Wu, Songgu
    Yang, Chao
    Wang, Jingkang
    Cheng, Jingcai
    Gong, Junbo
    FRONTIERS OF CHEMICAL SCIENCE AND ENGINEERING, 2022, 16 (04) : 523 - 535
  • [46] Fingerprinting Interactions between Proteins and Ligands for Facilitating Machine Learning in Drug Discovery
    Li, Zoe
    Huang, Ruili
    Xia, Menghang
    Patterson, Tucker A.
    Hong, Huixiao
    BIOMOLECULES, 2024, 14 (01)
  • [47] Advancing Anticancer Drug Discovery: Leveraging Metabolomics and Machine Learning for Mode of Action Prediction by Pattern Recognition
    Saoud, Mohamad
    Grau, Jan
    Rennert, Robert
    Mueller, Thomas
    Yousefi, Mohammad
    Davari, Mehdi D.
    Hause, Bettina
    Csuk, Rene
    Rashan, Luay
    Grosse, Ivo
    Tissier, Alain
    Wessjohann, Ludger A.
    Balcke, Gerd U.
    ADVANCED SCIENCE, 2024, 11 (47)
  • [48] Explainability of Machine Learning Models for Bankruptcy Prediction
    Park, Min Sue
    Son, Hwijae
    Hyun, Chongseok
    Hwang, Hyung Ju
    IEEE ACCESS, 2021, 9 : 124887 - 124899
  • [49] Machine learning models in the prediction of drug metabolism: challenges and future perspectives
    Litsa, Eleni E.
    Das, Payel
    Kavraki, Lydia E.
    EXPERT OPINION ON DRUG METABOLISM & TOXICOLOGY, 2021, 17 (11) : 1245 - 1247
  • [50] Prediction of CO2 solubility in aqueous amine solutions using machine learning method
    Liu, Bin
    Yu, Yanan
    Liu, Zijian
    Cui, Zhe
    Tian, Wende
    SEPARATION AND PURIFICATION TECHNOLOGY, 2025, 354