Context-specific amino acid substitution matrices and their use in the detection of protein homologs

被引:17
作者
Goonesekere, Nalin C. W. [1 ,2 ]
Lee, Byungkook [1 ]
机构
[1] NIH, Natl Canc Inst, Ctr Canc Res, Mol Biol Lab, Bethesda, MD 20892 USA
[2] Univ No Iowa, Dept Chem & Biochem, Cedar Falls, IA 50614 USA
关键词
CSSM; score matrix; homology detection; sequence alignment; environment polarity; fold recognition;
D O I
10.1002/prot.21775
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The sequence homology detection relies on score matrices, which reflect the frequency of amino acid substitutions observed in a dataset of homologous sequences. The substitution matrices in popular use today are usually constructed without consideration of the structural context in which the substitution takes place. Here, we present amino acid substitution matrices specific for particular polar-nonpolar environment of the amino acid. As expected, these matrices [context-specific substitution matrices (CSSMs)] show striking differences from the popular BLOSUM62 matrix, which does not include structural information. When incorporated into BLAST and PSI-BLAST, CSSM outperformed BLOSUM matrices as assessed by ROC curve analyses of the number of true and false hits and by the accuracy of the sequence alignments to the hit sequences. These findings are also of relevance to profile-profile-based methods of homology detection, since CSSMs may help build a better profile. Profiles generated for protein sequences in PDB using CSSM-PSI-BLAST will be made available for searching via RPSBLAST through our web site http://lmbbi.nci.nih.gov/.
引用
收藏
页码:910 / 919
页数:10
相关论文
共 14 条
  • [1] Discriminative modelling of context-specific amino acid substitution probabilities
    Angermueller, Christof
    Biegert, Andreas
    Soeding, Johannes
    BIOINFORMATICS, 2012, 28 (24) : 3240 - 3247
  • [2] Amino acid substitution scoring matrices specific to intrinsically disordered regions in proteins
    Trivedi, Rakesh
    Nagarajaram, Hampapathalu Adimurthy
    SCIENTIFIC REPORTS, 2019, 9 (1)
  • [3] Depth dependent amino acid substitution matrices and their use in predicting deleterious mutations
    Farheen, Nida
    Sen, Neeladri
    Nair, Sanjana
    Tan, Kuan Pern
    Madhusudhan, M. S.
    PROGRESS IN BIOPHYSICS & MOLECULAR BIOLOGY, 2017, 128 : 14 - 23
  • [4] Optimizing amino acid substitution matrices with a local alignment kernel
    Hiroto Saigo
    Jean-Philippe Vert
    Tatsuya Akutsu
    BMC Bioinformatics, 7
  • [5] Ideal amino acid exchange forms for approximating substitution matrices
    Pokarowski, Piotr
    Kloczkowski, Andrzej
    Nowakowski, Szymon
    Pokarowska, Maria
    Jernigan, Robert L.
    Kolinski, Andrzej
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2007, 69 (02) : 379 - 393
  • [6] Revisiting amino acid substitution matrices for identifying distantly related proteins
    Yamada, Kazunori
    Tomii, Kentaro
    BIOINFORMATICS, 2014, 30 (03) : 317 - 325
  • [7] A methodology for determining amino-acid substitution matrices from set covers
    Porto, Alexandre H. L.
    Barbosa, Valmir C.
    APPLICATIONS OF EVOLUTIONARY COMPUTING, PROCEEDINGS, 2006, 3907 : 138 - 148
  • [8] ENVIRONMENT-SPECIFIC AMINO-ACID SUBSTITUTION TABLES - TERTIARY TEMPLATES AND PREDICTION OF PROTEIN FOLDS
    OVERINGTON, J
    DONNELLY, D
    JOHNSON, MS
    SALI, A
    BLUNDELL, TL
    PROTEIN SCIENCE, 1992, 1 (02) : 216 - 226
  • [9] Amino acid substitution matrices based on 4-body Delaunay contact profiles
    Sacan, Ahmet
    Toroslu, I. Hakki
    PROCEEDINGS OF THE 7TH IEEE INTERNATIONAL SYMPOSIUM ON BIOINFORMATICS AND BIOENGINEERING, VOLS I AND II, 2007, : 796 - 801
  • [10] Construction and Analysis of Amino Acid Substitution Matrices for Optimal Alignment of Microbial Rhodopsin Sequences
    Novoseletsky V.N.
    Armeev G.A.
    Shaitan K.V.
    Moscow University Biological Sciences Bulletin, 2019, 74 (1) : 21 - 25