Extending in Silico Protein Target Prediction Models to Include Functional Effects

被引:6
作者
Mervin, Lewis H. [1 ]
Afzal, Avid M. [1 ]
Brive, Lars [2 ]
Engkvist, Ola [3 ]
Bender, Andreas [1 ]
机构
[1] Univ Cambridge, Dept Chem, Ctr Mol Informat, Cambridge, England
[2] Cygnal Biosci, Pixbo, Sweden
[3] AstraZeneca, IMED Biotech Unit, Discovery Sci, Hit Discovery, Gothenburg, Sweden
基金
英国生物技术与生命科学研究理事会;
关键词
target prediction; activation; inhibition; cheminformatics; functional effects; mechanism-of-action; chemical space; AD-AUC; DECONVOLUTION; CYTOTOXICITY; DATABASES; DESIGN;
D O I
10.3389/fphar.2018.00613
中图分类号
R9 [药学];
学科分类号
1007 ;
摘要
In silico protein target deconvolution is frequently used for mechanism-of-action investigations; however existing protocols usually do not predict compound functional effects, such as activation or inhibition, upon binding to their protein counterparts. This study is hence concerned with including functional effects in target prediction. To this end, we assimilated a bioactivity training set for 332 targets, comprising 817,239 active data points with unknown functional effect (binding data) and 20,761,260 inactive compounds, along with 226,045 activating and 1,032,439 inhibiting data points from functional screens. Chemical space analysis of the data first showed some separation between compound sets (binding and inhibiting compounds were more similar to each other than both binding and activating or activating and inhibiting compounds), providing a rationale for implementing functional prediction models. We employed three different architectures to predict functional response, ranging from simplistic random forest models ('Arch1') to cascaded models which use separate binding and functional effect classification steps ('Arch2' and 'Arch3'), differing in the way training sets were generated. Fivefold stratified cross-validation outlined cascading predictions provides superior precision and recall based on an internal test set. We next prospectively validated the architectures using a temporal set of 153,467 of in-house data points (after a 4-month interim from initial data extraction). Results outlined Arch3 performed with the highest target class averaged precision and recall scores of 71% and 53%, which we attribute to the use of inactive background sets. Distance-based applicability domain (AD) analysis outlined that Arch3 provides superior extrapolation into novel areas of chemical space, and thus based on the results presented here, propose as the most suitable architecture for the functional effect prediction of small molecules. We finally conclude including functional effects could provide vital insight in future studies, to annotate cases of unanticipated functional changeover, as outlined by our CHRM1 case study.
引用
收藏
页数:13
相关论文
共 39 条
[21]  
Liggi S, 2014, FUTURE MED CHEM, V6, P2029, DOI [10.4155/fmc.14.137, 10.4155/FMC.14.137]
[22]   Large-scale prediction and testing of drug activity on side-effect targets [J].
Lounkine, Eugen ;
Keiser, Michael J. ;
Whitebread, Steven ;
Mikhailov, Dmitri ;
Hamon, Jacques ;
Jenkins, Jeremy L. ;
Lavan, Paul ;
Weber, Eckhard ;
Doak, Allison K. ;
Cote, Serge ;
Shoichet, Brian K. ;
Urban, Laszlo .
NATURE, 2012, 486 (7403) :361-+
[23]  
MDL Drug Data Report [MDDR], 2006, MDL DRUG DAT REP MDD
[24]   Understanding Cytotoxicity and Cytostaticity in a High-Throughput Screening Collection [J].
Mervin, Lewis H. ;
Cao, Qing ;
Barret, Ian P. ;
Firth, Mike A. ;
Murray, David ;
McWilliams, Lisa ;
Haddrick, Malcolm ;
Wigglesworth, Mark ;
Engkvist, Ola ;
Bender, Andreas .
ACS CHEMICAL BIOLOGY, 2016, 11 (11) :3007-3023
[25]   Target prediction utilising negative bioactivity data covering large chemical space [J].
Mervin, Lewis H. ;
Afzal, Avid M. ;
Drakakis, Georgios ;
Lewis, Richard ;
Engkvist, Ola ;
Bender, Andreas .
JOURNAL OF CHEMINFORMATICS, 2015, 7
[26]   GENERATION OF A UNIQUE MACHINE DESCRIPTION FOR CHEMICAL STRUCTURES-A TECHNIQUE DEVELOPED AT CHEMICAL ABSTRACTS SERVICE [J].
MORGAN, HL .
JOURNAL OF CHEMICAL DOCUMENTATION, 1965, 5 (02) :107-&
[27]   Making every SAR point count: the development of Chemistry Connect for the large-scale integration of structure and bioactivity data [J].
Muresan, Sorel ;
Petrov, Plamen ;
Southan, Christopher ;
Kjellberg, Magnus J. ;
Koger, Thierry ;
Tyrchan, Christian ;
Varkonyi, Peter ;
Xie, Paul Hongxing .
DRUG DISCOVERY TODAY, 2011, 16 (23-24) :1019-1030
[28]  
NCBI, 2007, PUBCHEM PUG HELP
[29]  
OEChem Toolkits, 2017, OECHEM TOOLK VERS 2
[30]   PHOSPHORYLATION AND INACTIVATION OF THE MITOTIC INHIBITOR WEE1 BY THE NIM1/CDR1 KINASE [J].
PARKER, LL ;
WALTER, SA ;
YOUNG, PG ;
PIWNICAWORMS, H .
NATURE, 1993, 363 (6431) :736-738