Protein secondary structure prediction using sequence profile and conserved domain profile

被引:0
作者
Woo, SK [1 ]
Park, CB
Lee, SW
机构
[1] Korea Univ, Dept Bioinformat, Seoul 136713, South Korea
[2] Korea Univ, Dept Comp Sci & Engn, Seoul 136713, South Korea
来源
ADVANCES IN INTELLIGENT COMPUTING, PT 2, PROCEEDINGS | 2005年 / 3645卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we proposed a novel method for protein secondary structure prediction using sequence profile and conserved domain profile. Sequence profile generated from PSI-BLAST (position specific iterated BLAST) has been widely used in protein secondary structure prediction, because PSI-BLAST shows good performance in finding remote homology. Conserved domains kept functional and structural information of related proteins; therefore we could draw remote homology information in conserved domains using RPS-BLAST (reverse position specific BLAST). We combined sequence profile and conserved domain profile to get more remote homology information, and propose a method which used the combined profile to predict the protein secondary structures. In order to verify the effectiveness of our proposed method, we implemented a protein secondary structure prediction system. Overall prediction accuracy reached 75.9% on the RS126 data set. The improvement by incorporating conserved domain information exceeded 3%, and this result showed that our proposed method could improve significantly the accuracy of protein secondary structure prediction.
引用
收藏
页码:1 / 10
页数:10
相关论文
共 10 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]  
Bourne P.E., 2003, STRUCTURAL BIOINFORM
[3]  
Cuff JA, 1999, PROTEINS, V34, P508, DOI 10.1002/(SICI)1097-0134(19990301)34:4<508::AID-PROT10>3.0.CO
[4]  
2-4
[5]   A novel method of protein secondary structure prediction with high segment overlap measure: Support vector machine approach [J].
Hua, SJ ;
Sun, ZR .
JOURNAL OF MOLECULAR BIOLOGY, 2001, 308 (02) :397-407
[6]   Protein secondary structure prediction based on position-specific scoring matrices [J].
Jones, DT .
JOURNAL OF MOLECULAR BIOLOGY, 1999, 292 (02) :195-202
[7]   CDD: a conserved domain database for protein classification [J].
Marchler-Bauer, A ;
Anderson, JB ;
Cherukuri, PF ;
DeWweese-Scott, C ;
Geer, LY ;
Gwadz, M ;
He, SQ ;
Hurwitz, DI ;
Jackson, JD ;
Ke, ZX ;
Lanczycki, CJ ;
Liebert, CA ;
Liu, CL ;
Lu, F ;
Marchler, GH ;
Mullokandov, M ;
Shoemaker, BA ;
Simonyan, V ;
Song, JS ;
Thiessen, PA ;
Yamashita, RA ;
Yin, JJ ;
Zhang, DC ;
Bryant, SH .
NUCLEIC ACIDS RESEARCH, 2005, 33 :D192-D196
[8]  
Nguyen Minh N, 2003, Genome Inform, V14, P218
[9]   Alignments grow, secondary structure prediction improves [J].
Przybylski, D ;
Rost, B .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2002, 46 (02) :197-205
[10]   PREDICTION OF PROTEIN SECONDARY STRUCTURE AT BETTER THAN 70-PERCENT ACCURACY [J].
ROST, B ;
SANDER, C .
JOURNAL OF MOLECULAR BIOLOGY, 1993, 232 (02) :584-599