Interpretation of Nonlinear QSAR Models Applied to Ames Mutagenicity Data

被引:47
作者
Carlsson, Lars [1 ]
Helgee, Ernst Ahlberg [1 ]
Boyer, Scott [1 ]
机构
[1] AstraZeneca Res & Dev, Safety Assessment, S-43183 Molndal, Sweden
关键词
SIGNATURE MOLECULAR DESCRIPTOR; REGRESSION;
D O I
10.1021/ci9002206
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
A method for local interpretation of QSAR models is presented and applied to an Ames mutagenicity data set. In the work presented, local interpretation of Support Vector Machine and Random Forest models is achieved by retrieving the variable corresponding to the largest component of the decision-function gradient at any point in the model. This contribution to the model is the variable that is regarded as having the most importance at that particular point in the model. The method described has been verified using two sets of simulated data and Ames mutagenicity data. This work indicates that it is possible to interpret nonlinear machine-learning methods. Comparison to an interpretable linear method is also presented.
引用
收藏
页码:2551 / 2558
页数:8
相关论文
共 20 条
[1]  
[Anonymous], DAYLIGHT THEORY SMAR
[2]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[3]   LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
[4]  
Christianini N., 2004, INTRO SUPPORT VECTOR
[5]  
Dimitriadou E., 2006, Misc Functions of the Department of Statistics (e1071)
[6]   The signature molecular descriptor. 1. Using extended valence sequences in QSAR and QSPR studies [J].
Faulon, JL ;
Visco, DP ;
Pophale, RS .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2003, 43 (03) :707-720
[7]   The signature molecular descriptor. 2. Enumerating molecules from their extended valence sequences [J].
Faulon, JL ;
Churchwell, CJ ;
Visco, DP .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2003, 43 (03) :721-734
[8]   Machine learning techniques for in silico modeling of drug metabolism [J].
Fox, Thomas ;
Krieg, Jan M. .
CURRENT TOPICS IN MEDICINAL CHEMISTRY, 2006, 6 (15) :1579-1591
[9]   Extraction and visualization of potential pharmacophore points using support vector machines: Application to ligand-based virtual screening for COX-2 inhibitors [J].
Franke, L ;
Byvatov, E ;
Werz, O ;
Steinhilber, D ;
Schneider, P ;
Schneider, G .
JOURNAL OF MEDICINAL CHEMISTRY, 2005, 48 (22) :6997-7004
[10]   Local lazy regression: Making use of the neighborhood to improve QSAR predictions [J].
Guha, Rajarshi ;
Dutta, Debojyoti ;
Jurs, Peter C. ;
Chen, Ting .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2006, 46 (04) :1836-1847