SAliBASE: A Database of Simulated Protein Alignments

被引:2
|
作者
Pervez, Muhammad Tariq [1 ]
Shah, Hayat Ali [2 ]
Babar, Masroor Ellahi [3 ]
Naveed, Nasir [2 ]
Shoaib, Muhammad [4 ]
机构
[1] Virtual Univ Pakistan, Dept Bioinformat & Computat Biol, 54 Lawrence Rd, Lahore 54000, Pakistan
[2] Virtual Univ Pakistan, Dept Comp Sci, Lahore, Pakistan
[3] Virtual Univ Pakistan, Dept Biotechnol, Lahore, Pakistan
[4] UET, Dept Comp Sci & Engn, Lahore, Pakistan
来源
关键词
SAliBASE; simulated alignment; true alignment; MULTIPLE SEQUENCE ALIGNMENT;
D O I
10.1177/1176934318821080
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Simulated alignments are alternatives to manually constructed multiple sequence alignments for evaluating performance of multiple sequence alignment tools. The importance of simulated sequences is recognized because their true evolutionary history is known, which is very helpful for reconstructing accurate phylogenetic trees and alignments. However, generating simulated alignments require expertise to use bioinformatics tools and consume several hours for reconstructing even a few hundreds of simulated sequences. It becomes a tedious job for an end user who needs a few datasets of variety of simulated sequences. Currently, there is no databank available which may help researchers to download simulated sequences/alignments for their study. Major focus of our study was to develop a database of simulated protein sequences (SAliBASE) based on different varying parameters such as insertion rate, deletion rate, sequence length, number of sequences, and indel size. Each dataset has corresponding alignment as well. This repository is very useful for evaluating multiple alignment methods.
引用
收藏
页数:4
相关论文
共 50 条
  • [21] TOPOFIT-DB, a database of protein structural alignments based on the TOPOFIT method
    Leslin, Chesley M.
    Abyzov, Alexej
    Ilyin, Valentin A.
    NUCLEIC ACIDS RESEARCH, 2007, 35 : D317 - D321
  • [22] SEQUENCE DATABASE SEARCHES AND SEQUENCE ALIGNMENTS
    WATERMAN, M
    FASEB JOURNAL, 1994, 8 (07): : A1260 - A1260
  • [23] Creation of a Database Including a Set of Biological Features Relaed to Protein Sequences and their Corresponding Alignments
    Ortuno, F.
    Pomares, H.
    Rojas, I.
    Valenzuela, O.
    2014 IEEE 27TH INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS (CBMS), 2014, : 557 - +
  • [24] DbClustal: rapid and reliable global multiple alignments of protein sequences detected by database searches
    Thompson, JD
    Plewniak, F
    Thierry, JC
    Poch, O
    NUCLEIC ACIDS RESEARCH, 2000, 28 (15) : 2919 - 2926
  • [25] SALSA: improved protein database searching by a new algorithm for assembly of sequence fragments into gapped alignments
    Rognes, T
    Seeberg, E
    BIOINFORMATICS, 1998, 14 (10) : 839 - 845
  • [26] GALA, a database for genomic sequence alignments and annotations
    Giardine, B
    Elnitski, L
    Riemer, C
    Makalowska, L
    Schwartz, S
    Miller, W
    Hardison, RC
    GENOME RESEARCH, 2003, 13 (04) : 732 - 741
  • [27] PASS2: A semi-automated database of Protein Alignments Organised as Structural Superfamilies
    Mallika, V
    Bhaduri, A
    Sowdhamini, R
    NUCLEIC ACIDS RESEARCH, 2002, 30 (01) : 284 - 288
  • [28] POLYMORPHISMS AND SEQUENCE ALIGNMENTS IN THE GENOME SEQUENCE DATABASE
    MCLEOD, M
    THAYER, N
    HARGER, C
    MARCH, S
    MANNING, M
    TROUP, C
    POWER, A
    RIDER, D
    CROWLEY, D
    SMYTH, L
    KEEN, G
    FIELDS, C
    AMERICAN JOURNAL OF HUMAN GENETICS, 1995, 57 (04) : 1535 - 1535
  • [29] A kinase sequence database: sequence alignments and family assignment
    Buzko, O
    Shokat, KM
    BIOINFORMATICS, 2002, 18 (09) : 1274 - 1275
  • [30] CDD: a curated Entrez database of conserved domain alignments
    Marchler-Bauer, A
    Anderson, JB
    DeWeese-Scott, C
    Fedorova, ND
    Geer, LY
    He, SQ
    Hurwitz, DI
    Jackson, JD
    Jacobs, AR
    Lanczycki, CJ
    Liebert, CA
    Liu, CL
    Madej, T
    Marchler, GH
    Mazumder, R
    Nikolskaya, AN
    Panchenko, AR
    Rao, BS
    Shoemaker, BA
    Simonyan, V
    Song, JS
    Thiessen, PA
    Vasudevan, S
    Wang, YL
    Yamashita, RA
    Yin, JJ
    Bryant, SH
    NUCLEIC ACIDS RESEARCH, 2003, 31 (01) : 383 - 387