Use of tetrapeptide signals for protein secondary-structure prediction

被引:40
作者
Feng, Yonge [1 ]
Luo, Liaofu [1 ]
机构
[1] Inner Mongolia Univ, Fac Sci & Technol, Lab Theoret Biophys, Hohhot 010021, Peoples R China
基金
美国国家科学基金会;
关键词
protein secondary-structure prediction; tetra-peptide structural words; increment of diversity; quadratic discriminant analysis; boundary correction; long-range interaction;
D O I
10.1007/s00726-008-0089-7
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
This paper develops a novel sequence-based method, tetra-peptide-based increment of diversity with quadratic discriminant analysis (TPIDQD for short), for protein secondary-structure prediction. The proposed TPIDQD method is based on tetra-peptide signals and is used to predict the structure of the central residue of a sequence fragment. The three-state overall per-residue accuracy (Q(3)) is about 80% in the threefold cross-validated test for 21-residue fragments in the CB513 dataset. The accuracy can be further improved by taking long-range sequence information (fragments of more than 21 residues) into account in prediction. The results show the tetra-peptide signals can indeed reflect some relationship between an amino acid's sequence and its secondary structure, indicating the importance of tetra-peptide signals as the protein folding code in the protein structure prediction.
引用
收藏
页码:607 / 614
页数:8
相关论文
共 85 条
[21]   PREDICTION OF PROTEIN CONFORMATION [J].
CHOU, PY ;
FASMAN, GD .
BIOCHEMISTRY, 1974, 13 (02) :222-245
[22]  
Cuff JA, 1999, PROTEINS, V34, P508, DOI 10.1002/(SICI)1097-0134(19990301)34:4<508::AID-PROT10>3.0.CO
[23]  
2-4
[24]   Using pseudo amino acid composition to predict transmembrane regions in protein: cellular automata and Lempel-Ziv complexity [J].
Diao, Y. ;
Ma, D. ;
Wen, Z. ;
Yin, J. ;
Xiang, J. ;
Li, M. .
AMINO ACIDS, 2008, 34 (01) :111-117
[25]   The community structure of human cellular signaling network [J].
Diao, Yuanbo ;
Li, Menglong ;
Fenga, Zinan ;
Yin, Jiajian ;
Pan, Yi .
JOURNAL OF THEORETICAL BIOLOGY, 2007, 247 (04) :608-615
[26]  
Ding YS, 2007, PROTEIN PEPTIDE LETT, V14, P811
[27]   Achieving 80% ten-fold cross-validated accuracy for secondary structure prediction by large-scale training [J].
Dor, Ofer ;
Zhou, Yaoqi .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2007, 66 (04) :838-845
[28]   Prediction of protein submitochondria locations by hybridizing pseudo-amino acid composition with various physicochemical features of segmented sequence [J].
Du, Pufeng ;
Li, Yanda .
BMC BIOINFORMATICS, 2006, 7 (1)
[29]   Predicting DNA-binding proteins: approached from Chou's pseudo amino acid composition and other specific sequence features [J].
Fang, Y. ;
Guo, Y. ;
Feng, Y. ;
Li, M. .
AMINO ACIDS, 2008, 34 (01) :103-109
[30]   Knowledge-based protein secondary structure assignment [J].
Frishman, D ;
Argos, P .
PROTEINS-STRUCTURE FUNCTION AND GENETICS, 1995, 23 (04) :566-579