BLASTGrabber: a bioinformatic tool for visualization, analysis and sequence selection of massive BLAST data

被引:6
|
作者
Neumann, Ralf Stefan [1 ,2 ]
Kumar, Surendra [1 ,2 ]
Haverkamp, Thomas Hendricus Augustus [3 ]
Shalchian-Tabrizi, Kamran [1 ,2 ]
机构
[1] Univ Oslo, Sect Genet & Evolutionary Biol EVOGENE, Oslo, Norway
[2] Univ Oslo, CEDE, Oslo, Norway
[3] Univ Oslo, Dept Biosci, Ctr Ecol & Evolutionary Synth, Oslo, Norway
来源
BMC BIOINFORMATICS | 2014年 / 15卷
关键词
Analysis; BLAST; High-throughput; Taxonomy; Text-mining; Visualization; DISCOVERY; TAXONOMY; OUTPUT;
D O I
10.1186/1471-2105-15-128
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Advances in sequencing efficiency have vastly increased the sizes of biological sequence databases, including many thousands of genome-sequenced species. The BLAST algorithm remains the main search engine for retrieving sequence information, and must consequently handle data on an unprecedented scale. This has been possible due to high-performance computers and parallel processing. However, the raw BLAST output from contemporary searches involving thousands of queries becomes ill-suited for direct human processing. Few programs attempt to directly visualize and interpret BLAST output; those that do often provide a mere basic structuring of BLAST data. Results: Here we present a bioinformatics application named BLASTGrabber suitable for high-throughput sequencing analysis. BLASTGrabber, being implemented as a Java application, is OS-independent and includes a user friendly graphical user interface. Text or XML-formatted BLAST output files can be directly imported, displayed and categorized based on BLAST statistics. Query names and FASTA headers can be analysed by text-mining. In addition to visualizing sequence alignments, BLAST data can be ordered as an interactive taxonomy tree. All modes of analysis support selection, export and storage of data. A Java interface-based plugin structure facilitates the addition of customized third party functionality. Conclusion: The BLASTGrabber application introduces new ways of visualizing and analysing massive BLAST output data by integrating taxonomy identification, text mining capabilities and generic multi-dimensional rendering of BLAST hits. The program aims at a non-expert audience in terms of computer skills; the combination of new functionalities makes the program flexible and useful for a broad range of operations.
引用
收藏
页数:11
相关论文
共 19 条
  • [1] BLASTGrabber: a bioinformatic tool for visualization, analysis and sequence selection of massive BLAST data
    Ralf Stefan Neumann
    Surendra Kumar
    Thomas Hendricus Augustus Haverkamp
    Kamran Shalchian-Tabrizi
    BMC Bioinformatics, 15
  • [2] BARD: A visualization tool for biological sequence analysis
    Spell, R
    Brady, R
    Dietrich, F
    INFOVIS 2002: IEEE SYMPOSIUM ON INFORMATION VISUALIZATION 2003, PROCEEDINGS, 2003, : 219 - 225
  • [3] Scalable Visualization and Interactive Analysis using Massive Data Streams
    Pascucci, Valerio
    Bremer, Peer-Timo
    Gyulassy, Attila
    Scorzelli, Giorgio
    Christensen, Cameron
    Summa, Brian
    Kumar, Sidharth
    CLOUD COMPUTING AND BIG DATA, 2013, 23 : 212 - 230
  • [4] A visualization tool for exploratory analysis of cyclic multivariate data
    Matthew O. Ward
    Benjamin N. Lipchak
    Metrika, 2000, 51 : 27 - 37
  • [5] A visualization tool for exploratory analysis of cyclic multivariate data
    Ward, MO
    Lipchak, BN
    METRIKA, 2000, 51 (01) : 27 - 37
  • [6] MGAviewer: a desktop visualization tool for analysis of metagenomics alignment data
    Zhu, Zhengwei
    Niu, Beifang
    Chen, Jing
    Wu, Sitao
    Sun, Shulei
    Li, Weizhong
    BIOINFORMATICS, 2013, 29 (01) : 122 - 123
  • [7] EyeClouds: A Visualization and Analysis Tool for Exploring Eye Movement Data
    Burch, Michael
    Veneri, Alberto
    Sun, Bangjie
    PROCEEDINGS OF THE 12TH INTERNATIONAL SYMPOSIUM ON VISUAL INFORMATION COMMUNICATION AND INTERACTION, VINCI 2019, 2019,
  • [8] BiomiX, a user-friendly bioinformatic tool for democratized analysis and integration of multiomics data
    Iperi, Cristian
    Fernandez-Ochoa, Alvaro
    Barturen, Guillermo
    Pers, Jacques-Olivier
    Foulquier, Nathan
    Bettacchioli, Eleonore
    Alarcon-Riquelme, Marta
    PRECISESADS Flow Cytometry Study Grp, Divi
    PRECISESADS Clinical Consortium, Anne
    Cornec, Divi
    Bordron, Anne
    Jamin, Christophe
    BMC BIOINFORMATICS, 2025, 26 (01):
  • [9] SyncPro : A Synchronized Visualization Tool for Differential Analysis of Proteomics Data Sets
    Yung, Ling Sing
    Yang, Chao
    Dakna, Mohammed
    Mischak, Harald
    Yu, Weichuan
    2010 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE WORKSHOPS (BIBMW), 2010, : 95 - 100
  • [10] Development of a data visualization and analysis tool to study the particle dynamics in the coastal zone
    Kitsiou, D
    De Madron, XD
    Arnau, PA
    MARINE POLLUTION BULLETIN, 2001, 43 (7-12) : 262 - 269