P2Rank: machine learning based tool for rapid and accurate prediction of ligand binding sites from protein structure

被引:297
作者
Krivak, Radoslav [1 ]
Hoksza, David [1 ]
机构
[1] Charles Univ Prague, Dept Software Engn, Prague, Czech Republic
关键词
Ligand binding sites; Protein pockets; Binding site prediction; Protein surface descriptors; Machine learning; Random forests; DRUG DESIGN; POCKETS; IDENTIFICATION; DRUGGABILITY; ALGORITHM; CAVITIES; CLASSIFICATION; VALIDATION; FINDSITE; DIVERSE;
D O I
10.1186/s13321-018-0285-8
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Background: Ligand binding site prediction from protein structure has many applications related to elucidation of protein function and structure based drug discovery. It often represents only one step of many in complex computational drug design efforts. Although many methods have been published to date, only few of them are suitable for use in automated pipelines or for processing large datasets. These use cases require stability and speed, which disqualifies many of the recently introduced tools that are either template based or available only as web servers. Results: We present P2Rank, a stand-alone template-free tool for prediction of ligand binding sites based on machine learning. It is based on prediction of ligandability of local chemical neighbourhoods that are centered on points placed on the solvent accessible surface of a protein. We show that P2Rank outperforms several existing tools, which include two widely used stand-alone tools (Fpocket, SiteHound), a comprehensive consensus based tool (MetaPocket 2.0), and a recent deep learning based method (DeepSite). P2Rank belongs to the fastest available tools (requires under 1 s for prediction on one protein), with additional advantage of multi-threaded implementation. Conclusions: P2Rank is a new open source software package for ligand binding site prediction from protein structure. It is available as a user-friendly stand-alone command line program and a Java library. P2Rank has a lightweight installation and does not depend on other bioinformatics tools or large structural or sequence databases. Thanks to its speed and ability to make fully automated predictions, it is particularly well suited for processing large datasets or as a component of scalable structural bioinformatics pipelines.
引用
收藏
页数:12
相关论文
共 81 条
[11]   Graph-Based Clustering of Predicted Ligand-Binding Pockets on Protein Surfaces [J].
Degac, Jennifer ;
Winter, Uwe ;
Helms, Volkhard .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2015, 55 (09) :1944-1952
[12]   sc-PDB: a 3D-database of ligandable binding sites-10 years on [J].
Desaphy, Jeremy ;
Bret, Guillaume ;
Rognan, Didier ;
Kellenberger, Esther .
NUCLEIC ACIDS RESEARCH, 2015, 43 (D1) :D399-D404
[13]   Comparison and Druggability Prediction of Protein-Ligand Binding Sites from Pharmacophore-Annotated Cavity Shapes [J].
Desaphy, Jeremy ;
Azdimousa, Karima ;
Kellenberger, Esther ;
Rognan, Didier .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2012, 52 (08) :2287-2299
[14]  
Di Pietro O., 2017, PLOS ONE, V12, P1
[15]   THE DOUBLE CUBIC LATTICE METHOD - EFFICIENT APPROACHES TO NUMERICAL-INTEGRATION OF SURFACE-AREA AND VOLUME AND TO DOT SURFACE CONTOURING OF MOLECULAR ASSEMBLIES [J].
EISENHABER, F ;
LIJNZAAD, P ;
ARGOS, P ;
SANDER, C ;
SCHARF, M .
JOURNAL OF COMPUTATIONAL CHEMISTRY, 1995, 16 (03) :273-284
[16]   Structure-based druggability assessment - identifying suitable targets for small molecule therapeutics [J].
Fauman, Eric B. ;
Rai, Brajesh K. ;
Huang, Enoch S. .
CURRENT OPINION IN CHEMICAL BIOLOGY, 2011, 15 (04) :463-468
[17]   Calculating an optimal box size for ligand docking and virtual screening against experimental and predicted binding pockets [J].
Feinstein, Wei P. ;
Brylinski, Michal .
JOURNAL OF CHEMINFORMATICS, 2015, 7
[18]   bSiteFinder, an improved protein-binding sites prediction server based on structural alignment: more accurate and less time-consuming [J].
Gao, Jun ;
Zhang, Qingchen ;
Liu, Min ;
Zhu, Lixin ;
Wu, Dingfeng ;
Cao, Zhiwei ;
Zhu, Ruixin .
JOURNAL OF CHEMINFORMATICS, 2016, 8
[19]   EasyMIFs and SiteHound: a toolkit for the identification of ligand-binding sites in protein structures [J].
Ghersi, Dario ;
Sanchez, Roberto .
BIOINFORMATICS, 2009, 25 (23) :3185-3186
[20]  
Grove L. E., 2016, Fragment-based Drug Discovery: Lessons and Outlook, P197, DOI [10.1002/9783527683604.ch09, DOI 10.1002/9783527683604.CH09]