TM-Aligner: Multiple sequence alignment tool for transmembrane proteins with reduced time and improved accuracy

被引:18
作者
Bhat, Basharat [1 ]
Ganai, Nazir A. [2 ]
Andrabi, Syed Mudasir [2 ]
Shah, Riaz A. [2 ]
Singh, Ashutosh [1 ]
机构
[1] Shiv Nadar Univ, Dept Life Sci, Greater Noida 201314, UP, India
[2] Sher E Kashmir Univ Agr Sci & Technol, Dept Anim Biotechnol, Shuhama 190016, Jammu & Kashmir, India
关键词
DATABASE; STRATEGY;
D O I
10.1038/s41598-017-13083-y
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Membrane proteins plays significant role in living cells. Transmembrane proteins are estimated to constitute approximately 30% of proteins at genomic scale. It has been a difficult task to develop specific alignment tools for transmembrane proteins due to limited number of experimentally validated protein structures. Alignment tools based on homology modeling provide fairly good result by recapitulating 70-80% residues in reference alignment provided all input sequences should have known template structures. However, homology modeling tools took substantial amount of time, thus aligning large numbers of sequences becomes computationally demanding. Here we present TM-Aligner, a new tool for transmembrane protein sequence alignment. TM-Aligner is based on Wu-Manber and dynamic string matching algorithm which has significantly improved its accuracy and speed of multiple sequence alignment. We compared TM-Aligner with prevailing other popular tools and performed benchmarking using three separate reference sets, BaliBASE3.0 reference set7 of alpha-helical transmembrane proteins, structure based alignment of transmembrane proteins from Pfam database and structure alignment from GPCRDB. Benchmarking against reference datasets indicated that TM-Aligner is more advanced method having least turnaround time with significant improvements over the most accurate methods such as PROMALS, MAFFT, TM-Coffee, Kalign, ClustalW, Muscle and PRALINE. TM-Aligner is freely available through http://lms.snu.edu.in/TM-Aligner/.
引用
收藏
页数:8
相关论文
共 22 条
[1]  
[Anonymous], 2002, MOLECULAR BIOL CELL
[2]   Biophysical approaches to membrane protein structure determination [J].
Arora, A ;
Tamm, LK .
CURRENT OPINION IN STRUCTURAL BIOLOGY, 2001, 11 (05) :540-547
[3]   BAliBASE (Benchmark Alignment dataBASE): enhancements for repeats, transmembrane sequences and circular permutations [J].
Bahr, A ;
Thompson, JD ;
Thierry, JC ;
Poch, O .
NUCLEIC ACIDS RESEARCH, 2001, 29 (01) :323-326
[4]   NEW ALIGNMENT STRATEGY FOR TRANSMEMBRANE PROTEINS [J].
CSERZO, M ;
BERNASSAU, JM ;
SIMON, I ;
MAIGRET, B .
JOURNAL OF MOLECULAR BIOLOGY, 1994, 243 (03) :388-396
[5]  
Durbin R., 1998, BIOL SEQUENCE ANAL P
[6]   MUSCLE: multiple sequence alignment with high accuracy and high throughput [J].
Edgar, RC .
NUCLEIC ACIDS RESEARCH, 2004, 32 (05) :1792-1797
[7]   The Pfam protein families database: towards a more sustainable future [J].
Finn, Robert D. ;
Coggill, Penelope ;
Eberhardt, Ruth Y. ;
Eddy, Sean R. ;
Mistry, Jaina ;
Mitchell, Alex L. ;
Potter, Simon C. ;
Punta, Marco ;
Qureshi, Matloob ;
Sangrador-Vegas, Amaia ;
Salazar, Gustavo A. ;
Tate, John ;
Bateman, Alex .
NUCLEIC ACIDS RESEARCH, 2016, 44 (D1) :D279-D285
[8]  
Floden E. W, 2016, NUCLEIC ACIDS RES
[9]   GPCRdb: an information system for G protein-coupled receptors [J].
Isberg, Vignir ;
Mordalski, Stefan ;
Munk, Christian ;
Rataj, Krzysztof ;
Harpsoe, Kasper ;
Hauser, Alexander S. ;
Vroling, Bas ;
Bojarski, Andrzej J. ;
Vriend, Gert ;
Gloriam, David E. .
NUCLEIC ACIDS RESEARCH, 2016, 44 (D1) :D356-D364
[10]   Recent developments in the MAFFT multiple sequence alignment program [J].
Katoh, Kazutaka ;
Toh, Hiroyuki .
BRIEFINGS IN BIOINFORMATICS, 2008, 9 (04) :286-298