Cocrystal Prediction Using Machine Learning Models and Descriptors

被引:40
作者
Mswahili, Medard Edmund [1 ]
Lee, Min-Jeong [2 ]
Martin, Gati Lother [1 ]
Kim, Junghyun [3 ]
Kim, Paul [4 ]
Choi, Guang J. [2 ]
Jeong, Young-Seob [1 ]
机构
[1] Soonchunhyang Univ, Dept ICT Convergence, Asan 31538, South Korea
[2] Soonchunhyang Univ, Dept Pharmaceut Engn, Asan 31538, South Korea
[3] Soonchunhyang Univ, Dept Future Convergence Technol, Asan 31538, South Korea
[4] Soonchunhyang Univ, Dept Med Sci, Asan 31538, South Korea
来源
APPLIED SCIENCES-BASEL | 2021年 / 11卷 / 03期
关键词
descriptor; machine learning; feature selection; cocrystal prediction; SOLID-STATE PROPERTIES; PHARMACEUTICAL COCRYSTALS; INTERMOLECULAR INTERACTIONS; CO-CRYSTALS; PACKAGE; SALTS; PATH;
D O I
10.3390/app11031323
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Cocrystals are of much interest in industrial application as well as academic research, and screening of suitable coformers for active pharmaceutical ingredients is the most crucial and challenging step in cocrystal development. Recently, machine learning techniques are attracting researchers in many fields including pharmaceutical research such as quantitative structure-activity/property relationship. In this paper, we develop machine learning models to predict cocrystal formation. We extract descriptor values from simplified molecular-input line-entry system (SMILES) of compounds and compare the machine learning models by experiments with our collected data of 1476 instances. As a result, we found that artificial neural network shows great potential as it has the best accuracy, sensitivity, and F1 score. We also found that the model achieved comparable performance with about half of the descriptors chosen by feature selection algorithms. We believe that this will contribute to faster and more accurate cocrystal development.
引用
收藏
页码:1 / 12
页数:12
相关论文
共 67 条
[1]   Building co-crystals with molecular sense and supramolecular sensibility [J].
Aakeröy, CB ;
Salmon, DJ .
CRYSTENGCOMM, 2005, 7 :439-448
[2]   Polymorphs, Salts, and Cocrystals: What's in a Name? [J].
Aitipamula, Srinivasulu ;
Banerjee, Rahul ;
Bansal, Arvind K. ;
Biradha, Kumar ;
Cheney, Miranda L. ;
Choudhury, Angshuman Roy ;
Desiraju, Gautam R. ;
Dikundwar, Amol G. ;
Dubey, Ritesh ;
Duggirala, Nagakiran ;
Ghogale, Preetam P. ;
Ghosh, Soumyajit ;
Goswami, Pramod Kumar ;
Goud, N. Rajesh ;
Jetti, Ram R. K. R. ;
Karpinski, Piotr ;
Kaushik, Poonam ;
Kumar, Dinesh ;
Kumar, Vineet ;
Moulton, Brian ;
Mukherjee, Arijit ;
Mukherjee, Gargi ;
Myerson, Allan S. ;
Puri, Vibha ;
Ramanan, Arunachalam ;
Rajamannar, T. ;
Reddy, C. Malla ;
Rodriguez-Hornedo, Nair ;
Rogers, Robin D. ;
Row, T. N. Guru ;
Sanphui, Palash ;
Shan, Ning ;
Shete, Ganesh ;
Singh, Amit ;
Sun, Changquan C. ;
Swift, Jennifer A. ;
Thaimattam, Ram ;
Thakur, Tejender S. ;
Thaper, Rajesh Kumar ;
Thomas, Sajesh P. ;
Tothadi, Srinu ;
Vangala, Venu R. ;
Variankaval, Narayan ;
Vishweshwar, Peddy ;
Weyna, David R. ;
Zaworotko, Michael J. .
CRYSTAL GROWTH & DESIGN, 2012, 12 (05) :2147-2152
[3]   Crystal engineering of the composition of pharmaceutical phases.: Do pharmaceutical co-crystals represent a new path to improved medicines? [J].
Almarsson, Ö ;
Zaworotko, MJ .
CHEMICAL COMMUNICATIONS, 2004, (17) :1889-1896
[4]  
[Anonymous], 2010, ICML
[5]   Pharmaceutical cocrystals, salts and multicomponent systems; intermolecular interactions and property based design [J].
Berry, David J. ;
Steed, Jonathan W. .
ADVANCED DRUG DELIVERY REVIEWS, 2017, 117 :3-24
[6]   Pharmaceutical cocrystals: walking the talk [J].
Bolla, Geetha ;
Nangia, Ashwini .
CHEMICAL COMMUNICATIONS, 2016, 52 (54) :8342-8360
[7]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[8]   Pharmaceutical cocrystals: The coming wave of new drug substances [J].
Brittain, Harry G. .
JOURNAL OF PHARMACEUTICAL SCIENCES, 2013, 102 (02) :311-317
[9]   A tutorial on Support Vector Machines for pattern recognition [J].
Burges, CJC .
DATA MINING AND KNOWLEDGE DISCOVERY, 1998, 2 (02) :121-167
[10]   Rcpi: R/Bioconductor package to generate various descriptors of proteins, compounds and their interactions [J].
Cao, Dong-Sheng ;
Xiao, Nan ;
Xu, Qing-Song ;
Chen, Alex F. .
BIOINFORMATICS, 2015, 31 (02) :279-281