A comprehensive review and comparison of different computational methods for protein remote homology detection

被引:98
作者
Chen, Junjie [1 ]
Guo, Mingyue [1 ]
Wang, Xiaolong [1 ]
Liu, Bin [1 ]
机构
[1] Harbin Inst Technol, Shenzhen Grad Sch, Sch Comp Sci & Technol, Shenzhen, Peoples R China
基金
中国国家自然科学基金; 国家高技术研究发展计划(863计划);
关键词
protein remote homology detection; protein structure and function; alignment methods; discriminative methods; ranking methods; AMINO-ACID-COMPOSITION; HIDDEN MARKOV-MODELS; SEQUENCE SIMILARITY; WEB SERVER; PSI-BLAST; STRUCTURAL CLASSIFICATION; EVOLUTIONARY INFORMATION; FOLD RECOGNITION; RANDOM FOREST; PROFILE;
D O I
10.1093/bib/bbw108
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Protein remote homology detection is one of the most fundamental and central problems for the studies of protein structures and functions, aiming to detect the distantly evolutionary relationships among proteins via computational methods. During the past decades, many computational approaches have been proposed to solve this important task. These methods have made a substantial contribution to protein remote homology detection. Therefore, it is necessary to give a comprehensive review and comparison on these computational methods. In this article, we divide these computational approaches into three categories, including alignment methods, discriminative methods and ranking methods. Their advantages and disadvantages are discussed in a comprehensive perspective, and their performance is compared on widely used benchmark data sets. Finally, some open questions in this field are further explored and discussed.
引用
收藏
页码:231 / 244
页数:14
相关论文
共 118 条
[1]   Do aligned sequences share the same fold? [J].
Abagyan, RA ;
Batalov, S .
JOURNAL OF MOLECULAR BIOLOGY, 1997, 273 (01) :355-368
[2]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[3]   Iterated profile searches with PSI-BLAST - a tool for discovery in protein databases [J].
Altschul, SF ;
Koonin, EV .
TRENDS IN BIOCHEMICAL SCIENCES, 1998, 23 (11) :444-447
[4]   BASIC LOCAL ALIGNMENT SEARCH TOOL [J].
ALTSCHUL, SF ;
GISH, W ;
MILLER, W ;
MYERS, EW ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410
[5]   SCOP database in 2004: refinements integrate structure and sequence family data [J].
Andreeva, A ;
Howorth, D ;
Brenner, SE ;
Hubbard, TJP ;
Chothia, C ;
Murzin, AG .
NUCLEIC ACIDS RESEARCH, 2004, 32 :D226-D229
[6]  
Anfinsen ChristianB., 1972, STUDIES PRINCIPLES G
[7]   Ongoing and future developments at the Universal Protein Resource [J].
Apweiler, Rolf ;
Martin, Maria Jesus ;
O'Donovan, Claire ;
Magrane, Michele ;
Alam-Faruque, Yasmin ;
Antunes, Ricardo ;
Barrell, Daniel ;
Bely, Benoit ;
Bingley, Mark ;
Binns, David ;
Bower, Lawrence ;
Browne, Paul ;
Chan, Wei Mun ;
Dimmer, Emily ;
Eberhardt, Ruth ;
Fazzini, Francesco ;
Fedotov, Alexander ;
Foulger, Rebecca ;
Garavelli, John ;
Castro, Leyla Garcia ;
Huntley, Rachael ;
Jacobsen, Julius ;
Kleen, Michael ;
Laiho, Kati ;
Legge, Duncan ;
Lin, Quan ;
Liu, Wudong ;
Luo, Jie ;
Orchard, Sandra ;
Patient, Samuel ;
Pichler, Klemens ;
Poggioli, Diego ;
Pontikos, Nikolas ;
Pruess, Manuela ;
Rosanoff, Steven ;
Sawford, Tony ;
Sehra, Harminder ;
Turner, Edward ;
Corbett, Matt ;
Donnelly, Mike ;
van Rensburg, Pieter ;
Xenarios, Ioannis ;
Bougueleret, Lydie ;
Auchincloss, Andrea ;
Argoud-Puy, Ghislaine ;
Axelsen, Kristian ;
Bairoch, Amos ;
Baratin, Delphine ;
Blatter, Marie-Claude ;
Boeckmann, Brigitte .
NUCLEIC ACIDS RESEARCH, 2011, 39 :D214-D219
[8]  
Bateman A, 2004, NUCLEIC ACIDS RES, V32, pD138, DOI [10.1093/nar/gkr1065, 10.1093/nar/gkh121, 10.1093/nar/gkp985]
[9]   Pairwise sequence alignment below the twilight zone [J].
Blake, JD ;
Cohen, FE .
JOURNAL OF MOLECULAR BIOLOGY, 2001, 307 (02) :721-735
[10]   webPRC: the Profile Comparer for alignment-based searching of public domain databases [J].
Brandt, Bernd W. ;
Heringa, Jaap .
NUCLEIC ACIDS RESEARCH, 2009, 37 :W48-W52