BETAWRAP:: Successful prediction of parallel β-helices from primary sequence reveals an association with many microbial pathogens

被引:94
作者
Bradley, P
Cowen, L
Menke, M
King, J
Berger, B
机构
[1] MIT, Dept Math, Cambridge, MA 02139 USA
[2] MIT, Comp Sci Lab, Cambridge, MA 02139 USA
[3] MIT, Dept Biol, Cambridge, MA 02139 USA
[4] Tufts Univ, Dept Elect Engn & Comp Sci, Medford, MA 02155 USA
关键词
D O I
10.1073/pnas.251267298
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The amino acid sequence rules that specify beta -sheet structure in proteins remain obscure. A subclass of beta -sheet proteins, parallel beta -helices, represent a processive folding of the chain into an elongated topologically simpler fold than globular beta -sheets. In this paper, we present a computational approach that predicts the right-handed parallel beta -helix supersecondary structural motif in primary amino acid sequences by using beta -strand interactions learned from non-beta -helix structures. A program called BETAWRAP (http://theory.Ics.mit.edu/betawrap) implements this method and recognizes each of the seven known parallel beta -helix families, when trained on the known parallel beta -helices from outside that family. BETAWRAP identifies 2,448 sequences among 595,890 screened from the National Center for Biotechnology Information (NCBI; http://www.ncbi.nlm.nih.gov/) nonredundant protein database as likely parallel beta -helices. It identifies surprisingly many bacterial and fungal protein sequences that play a role in human infectious disease; these include toxins, virulence factors, adhesins, and surface proteins of Chlamydia, Helicobacteria, Bordetella, Leishmania, Borrelia, Rickettsia, Neisseria, and Bacillus anthracis. Also unexpected was the rarity of the parallel beta -helix fold and its predicted sequences among higher eukaryotes. The computational method introduced here can be called a three-dimensional dynamic profile method because it generates interstrand pairwise correlations from a processive sequence wrap. Such methods may be applicable to recognizing other beta structures for which strand topology and profiles of residue accessibility are well conserved.
引用
收藏
页码:14819 / 14824
页数:6
相关论文
共 29 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]  
Bateman A, 2004, NUCLEIC ACIDS RES, V32, pD138, DOI [10.1093/nar/gkp985, 10.1093/nar/gkh121, 10.1093/nar/gkr1065]
[3]  
Berger B, 1995, J Comput Biol, V2, P125, DOI 10.1089/cmb.1995.2.125
[4]   PREDICTING COILED COILS BY USE SF PAIRWISE RESIDUE CORRELATIONS [J].
BERGER, B ;
WILSON, DB ;
WOLF, E ;
TONCHEV, T ;
MILLA, M ;
KIM, PS .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1995, 92 (18) :8259-8263
[5]   The Protein Data Bank [J].
Berman, HM ;
Westbrook, J ;
Feng, Z ;
Gilliland, G ;
Bhat, TN ;
Weissig, H ;
Shindyalov, IN ;
Bourne, PE .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :235-242
[6]   Profile hidden Markov models [J].
Eddy, SR .
BIOINFORMATICS, 1998, 14 (09) :755-763
[7]  
ENGELMAN D, 1999, ANNU REV BIOPHYS BIO, V15, P321
[8]   Knowledge-based protein secondary structure assignment [J].
Frishman, D ;
Argos, P .
PROTEINS-STRUCTURE FUNCTION AND GENETICS, 1995, 23 (04) :566-579
[9]  
Garnier J, 1996, METHOD ENZYMOL, V266, P540
[10]   β-helix structure and ice-binding properties of a hyperactive antifreeze protein from an insect [J].
Graether, SP ;
Kuiper, MJ ;
Gagné, SM ;
Walker, VK ;
Jia, ZC ;
Sykes, BD ;
Davies, PL .
NATURE, 2000, 406 (6793) :325-328