Prediction of melting points of organic compounds using extreme learning machines

被引:37
作者
Bhat, Akshay U. [1 ]
Merchant, Shamel S. [1 ]
Bhagwat, Sunil S. [1 ]
机构
[1] Univ Bombay, Inst Chem Technol, Dept Chem Engn, Bombay 400019, Maharashtra, India
关键词
D O I
10.1021/ie0704647
中图分类号
TQ [化学工业];
学科分类号
0817 ;
摘要
Nonlinear regression methods such as artificial neural networks have been extensively used in prediction of properties of compounds from their molecular structure. Recently a new fast algorithm for training artificial neural networks known as the extreme learning machine was developed. In this paper we apply a simple ensemble of extreme learning machines to a large data set of melting points of organic molecules. The results obtained by extreme learning machines (cross-validated test set root-mean-square error = 45.4 K) are slightly better than those obtained using k nearest neighbor regression with genetic parameter optimization (cross-validated test set error = 46.2 K) and significantly better than those obtained by artificial neural networks trained using gradient descent (test set error = 49.3 K). The training of the extreme learning machine involves only linear regression resulting in faster training. Ensembling the extreme learning machines removes the dependence of results on initial random weights and improves the prediction. We also discuss the similarity between an ensemble of extreme learning machines and the random forest algorithm.
引用
收藏
页码:920 / 925
页数:6
相关论文
共 28 条
[1]   MELTING-POINT, BOILING-POINT, AND SYMMETRY [J].
ABRAMOWITZ, R ;
YALKOWSKY, SH .
PHARMACEUTICAL RESEARCH, 1990, 7 (09) :942-947
[2]   Molecular Descriptors influencing melting point and their role in classification of solid drugs [J].
Bergström, CAS ;
Norinder, U ;
Luthman, K ;
Artursson, P .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2003, 43 (04) :1177-1185
[3]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[4]  
*CHEM COMP GOUP IN, MOE MOL OP ENV
[5]  
Chen HW, 2007, LECT NOTES COMPUT SC, V4491, P1069
[6]  
Cherkassky V, 1997, IEEE Trans Neural Netw, V8, P1564, DOI 10.1109/TNN.1997.641482
[7]   Predicting the total entropy of melting: Application to pharmaceuticals and environmentally relevant compounds [J].
Dannenfelser, RM ;
Yalkowsky, SH .
JOURNAL OF PHARMACEUTICAL SCIENCES, 1999, 88 (07) :722-724
[8]   Quantitative structure-property relationships for prediction of boiling point, vapor pressure, and melting point [J].
Dearden, JC .
ENVIRONMENTAL TOXICOLOGY AND CHEMISTRY, 2003, 22 (08) :1696-1709
[9]   Virtual screening of Chinese herbs with random forest [J].
Ehrman, Thomas M. ;
Barlow, David J. ;
Hylands, Peter J. .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2007, 47 (02) :264-278
[10]  
Huang GB, 2004, IEEE IJCNN, P985