P2Rank: machine learning based tool for rapid and accurate prediction of ligand binding sites from protein structure

被引:297
作者
Krivak, Radoslav [1 ]
Hoksza, David [1 ]
机构
[1] Charles Univ Prague, Dept Software Engn, Prague, Czech Republic
关键词
Ligand binding sites; Protein pockets; Binding site prediction; Protein surface descriptors; Machine learning; Random forests; DRUG DESIGN; POCKETS; IDENTIFICATION; DRUGGABILITY; ALGORITHM; CAVITIES; CLASSIFICATION; VALIDATION; FINDSITE; DIVERSE;
D O I
10.1186/s13321-018-0285-8
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Background: Ligand binding site prediction from protein structure has many applications related to elucidation of protein function and structure based drug discovery. It often represents only one step of many in complex computational drug design efforts. Although many methods have been published to date, only few of them are suitable for use in automated pipelines or for processing large datasets. These use cases require stability and speed, which disqualifies many of the recently introduced tools that are either template based or available only as web servers. Results: We present P2Rank, a stand-alone template-free tool for prediction of ligand binding sites based on machine learning. It is based on prediction of ligandability of local chemical neighbourhoods that are centered on points placed on the solvent accessible surface of a protein. We show that P2Rank outperforms several existing tools, which include two widely used stand-alone tools (Fpocket, SiteHound), a comprehensive consensus based tool (MetaPocket 2.0), and a recent deep learning based method (DeepSite). P2Rank belongs to the fastest available tools (requires under 1 s for prediction on one protein), with additional advantage of multi-threaded implementation. Conclusions: P2Rank is a new open source software package for ligand binding site prediction from protein structure. It is available as a user-friendly stand-alone command line program and a Java library. P2Rank has a lightweight installation and does not depend on other bioinformatics tools or large structural or sequence databases. Thanks to its speed and ability to make fully automated predictions, it is particularly well suited for processing large datasets or as a component of scalable structural bioinformatics pipelines.
引用
收藏
页数:12
相关论文
共 81 条
[1]  
[Anonymous], ARXIV E PRINTS
[2]   An Augmented Pocketome: Detection and Analysis of Small-Molecule Binding Pockets in Proteins of Known 3D Structure [J].
Bhagavat, Raghu ;
Sankar, Santhosh ;
Srinivasan, Narayanaswamy ;
Chandra, Nagasuma .
STRUCTURE, 2018, 26 (03) :499-+
[3]   Can We Rely on Computational Predictions To Correctly Identify Ligand Binding Sites on Novel Protein Drug Targets? Assessment of Binding Site Prediction Methods and a Protocol for Validation of Predicted Binding Sites [J].
Broomhead, Neal K. ;
Soliman, Mahmoud E. .
CELL BIOCHEMISTRY AND BIOPHYSICS, 2017, 75 (01) :15-23
[4]   A threading-based method (FINDSITE) for ligand-binding site prediction and functional annotation [J].
Brylinski, Michal ;
Skolnick, Jeffrey .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2008, 105 (01) :129-134
[5]   eFindSite: Improved prediction of ligand binding sites in protein models using meta-threading, machine learning and auxiliary ligands [J].
Brylinski, Michal ;
Feinstein, Wei P. .
JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 2013, 27 (06) :551-567
[6]   Predicting Protein Ligand Binding Sites by Combining Evolutionary Sequence Conservation and 3D Structure [J].
Capra, John A. ;
Laskowski, Roman A. ;
Thornton, Janet M. ;
Singh, Mona ;
Funkhouser, Thomas A. .
PLOS COMPUTATIONAL BIOLOGY, 2009, 5 (12)
[7]   Assessment of ligand binding site predictions in CASP10 [J].
Cassarino, Tiziano Gallo ;
Bordoli, Lorenza ;
Schwede, Torsten .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2014, 82 :154-163
[8]   A Critical Comparative Assessment of Predictions of Protein-Binding Sites for Biologically Relevant Organic Compounds [J].
Chen, Ke ;
Mizianty, Marcin J. ;
Gao, Jianzhao ;
Kurgan, Lukasz .
STRUCTURE, 2011, 19 (05) :613-621
[9]  
Chen P, 2014, BMC BIOINFORM S15, V15, P4
[10]   Computational tools for designing and engineering enzymes [J].
Damborsky, Jiri ;
Brezovsky, Jan .
CURRENT OPINION IN CHEMICAL BIOLOGY, 2014, 19 :8-16