Construction and Analysis of Amino Acid Substitution Matrices for Optimal Alignment of Microbial Rhodopsin Sequences

被引:0
作者
Novoseletsky V.N. [1 ]
Armeev G.A. [1 ]
Shaitan K.V. [1 ]
机构
[1] Department of Biology, Moscow State University, Moscow
基金
俄罗斯基础研究基金会;
关键词
entropy; heliorhodopsin; homology search; microbial rhodopsins; sequence alignment; substitution matrix;
D O I
10.3103/S009639251901005X
中图分类号
学科分类号
摘要
Pairwise alignment of amino acid sequences is the basic tool of bioinformatics, which is widely used both independently and within numerous more complex methods. The effectiveness of this tool critically depends on the scoring function used, which consists of a substitution matrix and gap penalties. In this work, amino acid substitution matrices for the superfamily of microbial rhodopsins (RHOD) were constructed and analyzed and then compared with a set of general-purpose matrices (BLOSUM, VTML, PFASUM). It was shown that all matrices allow constructing alignments of microbial rhodopsin sequences of almost the same quality, but only BLOSUM and VTML matrices and their linear combinations with RHOD matrices allow revealing homology between microbial rhodopsins and heliorhodopsin. © 2019, Allerton Press, Inc.
引用
收藏
页码:21 / 25
页数:4
相关论文
共 24 条
  • [1] Govorunova E.G., Sineshchekov O.A., Li H., Spudich J.L., Microbial rhodopsins: Diversity, mechanisms, and optogenetic applications, Annu. Rev. Bio-chem., 86, 1, pp. 845-872, (2017)
  • [2] Pushkarev A., Inoue K., Larom S., Et al., A distinct abundant group of microbial rhodopsins discovered using functional metagenomics, Nature, 558, 7711, pp. 595-599, (2018)
  • [3] Needleman S.B., Wunsch C.D., A general method applicable to the search for similarities in the amino acid sequence of two proteins, J. Mol. Biol., 48, 3, pp. 443-453, (1970)
  • [4] Smith T.F., Waterman M.S., Identification of common molecular subsequences, J. Mol. Biol., 147, 1, pp. 195-197, (1981)
  • [5] Loytynoja A., Alignment methods: Strategies, challenges, benchmarking, and comparative overview, Evolutionary Genomics. Methods in Molecular Biology (Methods and Protocols), 855, pp. 203-235, (2012)
  • [6] Khan F.I., Wei D.Q., Gu K.R., Hassan M.I., Tabrez S., Current updates on computer aided protein modeling and designing, Int. J. Biol. Macromol., 85, pp. 48-62, (2016)
  • [7] Thompson J.D., Higgins D.G., Gibson T.J., Clustal W., Improving the sensitivity of progressive multiple sequence aligment through sequence weighting, position specific gap penalties and weight matrix choice, Nucleic Acid Res., 22, 22, pp. 4673-4680, (1994)
  • [8] Kuznetsov I.B., Protein sequence alignment with family-specific amino acid similarity matrices, BMC Res. Notes, 4, (2011)
  • [9] Henikoff S., Henikoff J.G., Amino acid substitution matrices from protein blocks, Proc. Natl. Acad. Sci. U.S.A., 89, 22, pp. 10915-10919, (1992)
  • [10] Muller T., Spang R., Vingron M., Estimating amino acid substitution models: A comparison of Day-hoff’s estimator, the resolvent approach and a maximum likelihood method, Mol. Biol. Evol., 19, 1, pp. 8-13, (2002)