Interpretation of QSAR Models by Coloring Atoms According to Changes in Predicted Activity: How Robust Is It?

被引:48
作者
Sheridan, Robert P. [1 ]
机构
[1] Merck & Co Inc, Modeling & Informat, Kenilworth, NJ 07065 USA
关键词
COMPOUND CLASSIFICATION; REGRESSION; TOOL;
D O I
10.1021/acs.jcim.8b00825
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Most chemists would agree that the ability to interpret a quantitative structure activity relationship (QSAR) model is as important as the ability of the model to make accurate predictions. One type of interpretation is coloration of atoms in molecules according to the contribution of each atom to the predicted activity, as in "heat maps". The ability to determine which parts of a molecule increase the activity in question and which decrease it should be useful to chemists who want to modify the molecule. For that type of application, we would hope the coloration to not be particularly sensitive to the details of model building. In this Article, we examine a number of aspects of coloration against 20 combinations of descriptors and QSAR methods. We demonstrate that atom-level coloration is much less robust to descriptor/method combinations than cross-validated predictions. Even in ideal cases where the contribution of individual atoms is known, we cannot always recover the important atoms for some descriptor/method combinations. Thus, model interpretation by atom coloration may not be as simple as it first appeared.
引用
收藏
页码:1324 / 1337
页数:14
相关论文
共 23 条
[1]   Kernel-Based Partial Least Squares: Application to Fingerprint-Based QSAR with Model Visualization [J].
An, Yuling ;
Sherman, Woody ;
Dixon, Steven L. .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2013, 53 (09) :2312-2321
[2]  
[Anonymous], PROCEEDINGS OF THE 2, DOI DOI 10.1109/ICPADS.2016.105
[3]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[4]   ATOM PAIRS AS MOLECULAR-FEATURES IN STRUCTURE ACTIVITY STUDIES - DEFINITION AND APPLICATIONS [J].
CARHART, RE ;
SMITH, DH ;
VENKATARAGHAVAN, R .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1985, 25 (02) :64-73
[5]   Benchmarking the Predictive Power of Ligand Efficiency Indices in QSAR [J].
Cortes-Ciriano, Isidro .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2016, 56 (08) :1576-1587
[6]  
Fan RE, 2008, J MACH LEARN RES, V9, P1871
[7]   ChEMBL: a large-scale bioactivity database for drug discovery [J].
Gaulton, Anna ;
Bellis, Louisa J. ;
Bento, A. Patricia ;
Chambers, Jon ;
Davies, Mark ;
Hersey, Anne ;
Light, Yvonne ;
McGlinchey, Shaun ;
Michalovich, David ;
Al-Lazikani, Bissan ;
Overington, John P. .
NUCLEIC ACIDS RESEARCH, 2012, 40 (D1) :D1100-D1107
[8]   6 Deep Learning in Drug Discovery [J].
Gawehn, Erik ;
Hiss, Jan A. ;
Schneider, Gisbert .
MOLECULAR INFORMATICS, 2016, 35 (01) :3-14
[9]   Chemical similarity using physiochemical property descriptors [J].
Kearsley, SK ;
Sallamack, S ;
Fluder, EM ;
Andose, JD ;
Mosley, RT ;
Sheridan, RP .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1996, 36 (01) :118-127
[10]   Deep Neural Nets as a Method for Quantitative Structure-Activity Relationships [J].
Ma, Junshui ;
Sheridan, Robert P. ;
Liaw, Andy ;
Dahl, George E. ;
Svetnik, Vladimir .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2015, 55 (02) :263-274