A random forest model for predicting the crystallisability of organic molecules

被引:25
作者
Bhardwaj, Rajni M. [1 ]
Johnston, Andrea [1 ]
Johnston, Blair F. [1 ]
Florence, Alastair J. [1 ]
机构
[1] Univ Strathclyde, Strathclyde Inst Pharm & Biomed Sci, Glasgow G4 0RE, Lanark, Scotland
基金
英国工程与自然科学研究理事会;
关键词
PROTEIN CRYSTALLIZATION PROPENSITY; CLASSIFICATION; QSAR; PURIFICATION; SCALE;
D O I
10.1039/c4ce02403f
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
A random forest model has for the first time enabled the prediction of the crystallisability (crystals vs. no crystals) of organic molecules with similar to 70% accuracy. The predictive model is based on calculated molecular descriptors and published experimental crystallisation propensities of a library of substituted acylanilides.
引用
收藏
页码:4272 / 4275
页数:4
相关论文
共 33 条
[1]  
Bhardwaj R. M., CRYSTENGCOMM UNPUB
[2]   Disappearing polymorphs and the role of reaction by-products: the case of sulphathiazole [J].
Blagden, N ;
Davey, RJ ;
Rowe, R ;
Roberts, R .
INTERNATIONAL JOURNAL OF PHARMACEUTICS, 1998, 172 (1-2) :169-177
[3]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[4]   Identifying SNPs predictive of phenotype using random forests [J].
Bureau, A ;
Dupuis, J ;
Falls, K ;
Lunetta, KL ;
Hayward, B ;
Keith, TP ;
Van Eerdewegh, P .
GENETIC EPIDEMIOLOGY, 2005, 28 (02) :171-182
[5]   Nucleation of Organic CrystalsA Molecular Perspective [J].
Davey, Roger J. ;
Schroeder, Sven L. M. ;
ter Horst, Joop H. .
ANGEWANDTE CHEMIE-INTERNATIONAL EDITION, 2013, 52 (08) :2166-2179
[6]   QSAR study of antimicrobial activity of some 3-nitrocoumarins and related compounds [J].
Debeljak, Zeljko ;
Skrbo, Armin ;
Jasprica, Ivona ;
Mornar, Ana ;
Plecko, Vanda ;
Banjanac, Mihajlo ;
Medic-Saric, Marica .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2007, 47 (03) :918-926
[7]   Predicting CNS permeability of drug molecules: comparison of neural network and support vector machine algorithms [J].
Doniger, S ;
Hofmann, T ;
Yeh, J .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2002, 9 (06) :849-864
[8]   An automated parallel crystallisation search for predicted crystal structures and packing motifs of carbamazepine [J].
Florence, Alastair J. ;
Johnston, Andrea ;
Price, Sarah L. ;
Nowell, Harriott ;
Kennedy, Alan R. ;
Shankland, Norman .
JOURNAL OF PHARMACEUTICAL SCIENCES, 2006, 95 (09) :1918-1930
[9]   Measuring CAMD technique performance. 2. How "druglike" are drugs? Implications of random test set selection exemplified using druglikeness classification models [J].
Good, Andrew C. ;
Hermsmeier, Mark A. .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2007, 47 (01) :110-114
[10]  
HURSTHOUSE MB, 2004, CRYSTALLOGR REV, V10, P85