A new method for identification of protein (sub)families in a set of proteins based on hydropathy distribution in proteins

被引:50
作者
Pánek, J
Eidhammer, I
Aasland, R
机构
[1] Univ Queensland, Inst Mol Biosci, Brisbane, Qld, Australia
[2] Acad Sci Czech Republ, Inst Microbiol, Videnska, Czech Republic
[3] Univ Bergen, Dept Informat, N-5008 Bergen, Norway
[4] Univ Bergen, Dept Mol Biol, N-5008 Bergen, Norway
关键词
hydropathy; hydropathy distribution; protein feature; protein family; protein subfamily;
D O I
10.1002/prot.20356
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Structural similarity among proteins is reflected in the distribution of hydropathicity along the amino acids in the protein sequence. Similarities in the hydropathy distributions are obvious for homologous proteins within a protein family. They also were observed for proteins with related structures, even when sequence similarities were undetectable. Here we present a novel method that employs the hydropathy distribution in proteins for identification of (sub)families in a set of (homologous) proteins. We represent proteins as points in a generalized hydropathy space, represented by vectors of specifically defined features. The features are derived from hydropathy of the individual amino acids. Projection of this space onto principal axes reveals groups of proteins with related hydropathy distributions. The groups identified correspond well to families of structurally and functionally related proteins. We found that this method accurately identifies protein families in a set of proteins, or subfamilies in a set of homologous proteins. Our results show that protein families can be identified by the analysis of hydropathy distribution, without the need for sequence alignment. (C) 2005 Wiley-Liss, Inc.
引用
收藏
页码:923 / 934
页数:12
相关论文
共 27 条
[1]   The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003 [J].
Boeckmann, B ;
Bairoch, A ;
Apweiler, R ;
Blatter, MC ;
Estreicher, A ;
Gasteiger, E ;
Martin, MJ ;
Michoud, K ;
O'Donovan, C ;
Phan, I ;
Pilbout, S ;
Schneider, M .
NUCLEIC ACIDS RESEARCH, 2003, 31 (01) :365-370
[2]   Molecular modeling and substrate specificity of discrete cruzipain-like and cathepsin L-like cysteine proteinases of the human blood fluke Schistosoma mansoni [J].
Brady, CP ;
Brinkworth, RI ;
Dalton, JP ;
Dowd, AJ ;
Verity, CK ;
Brindley, PJ .
ARCHIVES OF BIOCHEMISTRY AND BIOPHYSICS, 2000, 380 (01) :46-55
[3]   Recombinant expression and localization of Schistosoma mansoni cathepsin L1 support its role in the degradation of host hemoglobin [J].
Brady, CP ;
Dowd, AJ ;
Brindley, PJ ;
Ryan, T ;
Day, SR ;
Dalton, JP .
INFECTION AND IMMUNITY, 1999, 67 (01) :368-374
[4]   Proteolysis of human hemoglobin by schistosome cathepsin D [J].
Brindley, PJ ;
Kalinna, BH ;
Wong, JYM ;
Bogitsh, BJ ;
King, LT ;
Smyth, DJ ;
Verity, CK ;
Abbenante, G ;
Brinkworth, RI ;
Fairlie, DP ;
Smythe, ML ;
Milburn, PJ ;
Bielefeldt-Ohmann, H ;
Zheng, Y ;
McManus, DP .
MOLECULAR AND BIOCHEMICAL PARASITOLOGY, 2001, 112 (01) :103-112
[5]   A structural explanation for the twilight zone of protein sequence homology [J].
Chung, SY ;
Subbiah, S .
STRUCTURE, 1996, 4 (10) :1123-1127
[6]   Identification of novel membrane proteins by searching for patterns in hydropathy profiles [J].
Clements, JD ;
Martin, RE .
EUROPEAN JOURNAL OF BIOCHEMISTRY, 2002, 269 (08) :2101-2107
[7]  
Creighton T.E., 1993, PROTEINS, VSecond
[8]   THE HYDROPHOBIC MOMENT DETECTS PERIODICITY IN PROTEIN HYDROPHOBICITY [J].
EISENBERG, D ;
WEISS, RM ;
TERWILLIGER, TC .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA-BIOLOGICAL SCIENCES, 1984, 81 (01) :140-144
[9]   NICOTINAMIDE ADENINE-DINUCLEOTIDE BIOSYNTHESIS AND PYRIDINE-NUCLEOTIDE CYCLE METABOLISM IN MICROBIAL SYSTEMS [J].
FOSTER, JW ;
MOAT, AG .
MICROBIOLOGICAL REVIEWS, 1980, 44 (01) :83-105
[10]   Finding the right fold [J].
Goldenberg, DP .
NATURE STRUCTURAL BIOLOGY, 1999, 6 (11) :987-990