Depth dependent amino acid substitution matrices and their use in predicting deleterious mutations

被引:8
|
作者
Farheen, Nida [1 ]
Sen, Neeladri [1 ]
Nair, Sanjana [1 ]
Tan, Kuan Pern [2 ,3 ]
Madhusudhan, M. S. [1 ,2 ]
机构
[1] Indian Inst Sci Educ & Res, Pune 411008, Maharashtra, India
[2] Bioinformat Inst, 30 Biopolis St,07-01, Singapore 138671, Singapore
[3] Nanyang Technol Univ, Sch Comp Engn, Singapore 639798, Singapore
基金
英国惠康基金;
关键词
Substitution matrix; Alignment; Depth; Deleterious mutation; SECONDARY STRUCTURE; PROTEIN-STRUCTURE; FOLD RECOGNITION; SOLVENT ACCESSIBILITY; ALIGNED SEQUENCES; ATOM DEPTH; WEB SERVER; ALIGNMENT; REPLACEMENT; STABILITY;
D O I
10.1016/j.pbiomolbio.2017.02.004
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The 20 naturally occurring amino acids have different environmental preferences of where they are likely to occur in protein structures. Environments in a protein can be classified by their proximity to solvent by the residue depth measure. Since the frequencies of amino acids are different at various depth levels, the substitution frequencies should vary according to depth. To quantify these substitution frequencies, we built depth dependent substitution matrices. The dataset used for creation of the matrices consisted of 3696 high quality, non redundant pairwise protein structural alignments. One of the applications of these matrices is to predict the tolerance of mutations in different protein environments. Using these substitution scores the prediction of deleterious mutations was done on 3500 mutations in T4 lysozyme and CcdB. The accuracy of the technique in terms of the Matthews Correlation Coefficient (MCC) is 0.48 on the CcdB testing set, while the best of the other tested methods has an MCC of 0.40. Further developments in these substitution matrices could help in improving structure-sequence alignment for protein 3D structure modeling. (C) 2017 Elsevier Ltd. All rights reserved.
引用
收藏
页码:14 / 23
页数:10
相关论文
共 22 条