Prediction of Druggable Proteins Using Machine Learning and Systems Biology: A Mini-Review

被引:42
作者
Kandoi, Gaurav [1 ]
Acencio, Marcio L. [2 ]
Lemke, Ney [2 ]
机构
[1] Iowa State Univ, Dept Elect & Comp Engn, Ames, IA 50011 USA
[2] UNESP Sao Paulo State Univ, Inst Biosci Botucatu, Dept Phys & Biophys, Botucatu, SP, Brazil
基金
巴西圣保罗研究基金会;
关键词
druggability; machine learning; systems biology; review; drug targets; sequence properties; structural properties; network topology; GENE-EXPRESSION; DRUG; TARGETS; DATABASE; IDENTIFICATION; DISCOVERY; RESOURCE;
D O I
10.3389/fphys.2015.00366
中图分类号
Q4 [生理学];
学科分类号
071003 ;
摘要
The emergence of -omics technologies has allowed the collection of vast amounts of data on biological systems. Although, the pace of such collection has been exponential, the impact of these data remains small on many critical biomedical applications such as drug development. Limited resources, high costs, and low hit-to-lead ratio have led researchers to search for more cost effective methodologies. A possible alternative is to incorporate computational methods of potential drug target prediction early during drug discovery workflow. Computational methods based on systems approaches have the advantage of taking into account the global properties of a molecule not limited to its sequence, structure or function. Machine learning techniques are powerful tools that can extract relevant information from massive and noisy data sets. In recent years the scientific community has explored the combined power of these fields to propose increasingly accurate and low cost methods to propose interesting drug targets. In this mini-review, we describe promising approaches based on the simultaneous use of systems biology and machine learning to access gene and protein druggability. Moreover, we discuss the state-of-the-art of this emerging and interdisciplinary field, discussing data sources, algorithms and the performance of the different methodologies. Finally, we indicate interesting avenues of research and some remaining open challenges.
引用
收藏
页数:7
相关论文
共 31 条
[1]   Pathguide: a Pathway Resource List [J].
Bader, Gary D. ;
Cary, Michael P. ;
Sander, Chris .
NUCLEIC ACIDS RESEARCH, 2006, 34 :D504-D506
[2]   Properties and identification of human protein drug targets [J].
Bakheet, Tala M. ;
Doig, Andrew J. .
BIOINFORMATICS, 2009, 25 (04) :451-457
[3]   The ChEMBL bioactivity database: an update [J].
Bento, A. Patricia ;
Gaulton, Anna ;
Hersey, Anne ;
Bellis, Louisa J. ;
Chambers, Jon ;
Davies, Mark ;
Krueger, Felix A. ;
Light, Yvonne ;
Mak, Lora ;
McGlinchey, Shaun ;
Nowotka, Michal ;
Papadatos, George ;
Santos, Rita ;
Overington, John P. .
NUCLEIC ACIDS RESEARCH, 2014, 42 (D1) :D1083-D1090
[4]  
Bolton EE, 2010, ANN REP COMP CHEM, V4, P217, DOI 10.1016/S1574-1400(08)00012-1
[5]   Tissue specificity and the human protein interaction network [J].
Bossi, Alice ;
Lehner, Ben .
MOLECULAR SYSTEMS BIOLOGY, 2009, 5
[6]   The BioGRID interaction database:: 2008 update [J].
Breitkreutz, Bobby-Joe ;
Stark, Chris ;
Reguly, Teresa ;
Boucher, Lorrie ;
Breitkreutz, Ashton ;
Livstone, Michael ;
Oughtred, Rose ;
Lackner, Daniel H. ;
Bahler, Jurg ;
Wood, Valerie ;
Dolinski, Kara ;
Tyers, Mike .
NUCLEIC ACIDS RESEARCH, 2008, 36 :D637-D640
[7]  
Bureeva S, 2009, METHODS MOL BIOL, V563, P75, DOI 10.1007/978-1-60761-175-2_5
[8]   TTD: Therapeutic Target Database [J].
Chen, X ;
Ji, ZL ;
Chen, YZ .
NUCLEIC ACIDS RESEARCH, 2002, 30 (01) :412-415
[9]   A machine learning approach for genome-wide prediction of morbid and druggable human genes based on systems-level data [J].
Costa, Pedro R. ;
Acencio, Marcio L. ;
Lemke, Ney .
BMC GENOMICS, 2010, 11
[10]   Structure and dynamics of molecular networks: A novel paradigm of drug discovery A comprehensive review [J].
Csermely, Peter ;
Korcsmaros, Tamas ;
Kiss, Huba J. M. ;
London, Gabor ;
Nussinov, Ruth .
PHARMACOLOGY & THERAPEUTICS, 2013, 138 (03) :333-408