Identification of related proteins with weak sequence identity using secondary structure information

被引:46
|
作者
Geourjon, C [1 ]
Combet, C [1 ]
Blanchet, C [1 ]
Deléage, G [1 ]
机构
[1] Inst Biol & Chim Prot, CNRS, UMR 5086, Pole Bioinformat Lyonnais, F-69367 Lyon 07, France
关键词
protein; molecular modeling; sequence; databank; alignment; structure prediction; secondary structure; Web server;
D O I
10.1110/ps.30001
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Molecular modeling of proteins is confronted with the problem of finding homologous proteins, especially when few identities remain after the process of molecular evolution. Using even the most recent methods based on sequence identity detection, structural relationships are still difficult to establish with high reliability. As protein structures are more conserved than sequences, we investigated the possibility of using protein secondary structure comparison (observed or predicted structures) to discriminate between related and unrelated proteins sequences in the range of 10%-30% sequence identity. Pairwise comparison of secondary structures have been measured using the structural overlap (Sov) parameter. In this article, we show that if the secondary structures likeness is >50%, most of the pairs are structurally related. Taking into account the secondary structures of proteins that have been detected by BLAST, FASTA, or SSEARCH in the noisy region (with high E value), we show that distantly related protein sequences (even with <20% identity) can be still identified. This strategy can be used to identify three-dimensional templates in homology modeling by finding unexpected related proteins and to select proteins for experimental investigation in a structural genomic approach, as well as for genome annotation.
引用
收藏
页码:788 / 797
页数:12
相关论文
共 50 条
  • [1] Secondary structure assignment of proteins in the absence of sequence information
    Khalife, Sammy
    Malliavin, Therese
    Liberti, Leo
    BIOINFORMATICS ADVANCES, 2021, 1 (01):
  • [2] Improved identification of outer membrane beta barrel proteins using primary sequence, predicted secondary structure, and evolutionary information
    Mizianty, Marcin J.
    Kurgan, Lukasz
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2011, 79 (01) : 294 - 303
  • [3] Fold recognition using sequence and secondary structure information
    Koretke, KK
    Russell, RB
    Copley, RR
    Lupas, AN
    PROTEINS-STRUCTURE FUNCTION AND GENETICS, 1999, : 141 - 148
  • [4] Identification of novel DNA repair proteins via primary sequence, secondary structure, and homology
    Brown, J. B.
    Akutsu, Tatsuya
    BMC BIOINFORMATICS, 2009, 10
  • [5] Identification of novel DNA repair proteins via primary sequence, secondary structure, and homology
    JB Brown
    Tatsuya Akutsu
    BMC Bioinformatics, 10
  • [6] IDENTIFICATION OF SHORT TURN MOTIFS IN PROTEINS USING SEQUENCE AND STRUCTURE FINGERPRINTS
    WINTJENS, RT
    ROOMAN, MJ
    WODAK, SJ
    ISRAEL JOURNAL OF CHEMISTRY, 1994, 34 (02) : 257 - 269
  • [7] A computational model to identify fertility-related proteins using sequence information
    Lin, Yan
    Wang, Jiashu
    Liu, Xiaowei
    Xie, Xueqin
    Wu, De
    Zhang, Junjie
    Ding, Hui
    FRONTIERS OF COMPUTER SCIENCE, 2024, 18 (01)
  • [8] A statistical analytical approach to predict the secondary structure of proteins from amino acid sequence information
    Tiwari, S
    Reddy, BVB
    THEORETICAL CHEMISTRY ACCOUNTS, 1999, 101 (1-3) : 41 - 45
  • [9] A statistical analytical approach to predict the secondary structure of proteins from amino acid sequence information
    Shrish Tiwari
    Boojala V. B. Reddy
    Theoretical Chemistry Accounts, 1999, 101 : 41 - 45
  • [10] A MULTIPLE SEQUENCE ALIGNMENT ALGORITHM FOR HOMOLOGOUS PROTEINS USING SECONDARY STRUCTURE INFORMATION AND OPTIONALLY KEYING ALIGNMENTS TO FUNCTIONALLY IMPORTANT SITES
    HENNEKE, CM
    COMPUTER APPLICATIONS IN THE BIOSCIENCES, 1989, 5 (02): : 141 - 150