Unipro UGENE NGS pipelines and components for variant calling, RNA-seq and ChIP-seq data analyses

被引:67
作者
Golosova, Olga [1 ]
Henderson, Ross [2 ]
Vaskin, Yuriy [1 ]
Gabrielian, Andrei [2 ]
Grekhov, German [1 ]
Nagarajan, Vijayaraj [2 ]
Oler, Andrew J. [2 ]
Nones, Mariam Qui [2 ]
Hurt, Darrell [2 ]
Fursov, Mikhail [1 ]
Huyen, Yentram [2 ]
机构
[1] Unipro Ctr Informat Technol, Novosibirsk, Russia
[2] NIAID, Bioinformat & Computat Biosci Branch, Off Cyber Infrastruct & Computat Biol, NIH, Bethesda, MD 20892 USA
来源
PEERJ | 2014年 / 2卷
关键词
Bioinformatics; Next-generation sequencing; Data analysis; ChIP-seq; Variant calling; RNA-seq; READ ALIGNMENT; WORKFLOWS; TOPHAT; GENE;
D O I
10.7717/peerj.644
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The advent of Next Generation Sequencing (NGS) technologies has opened new possibilities for researchers. However, the more biology becomes a data-intensive field, the more biologists have to learn how to process and analyze NGS data with complex computational tools. Even with the availability of common pipeline specifications, it is often a time-consuming and cumbersome task for a bench scientist to install and configure the pipeline tools. We believe that a unified, desktop and biologist-friendly front end to NGS data analysis tools will substantially improve productivity in this field. Here we present NGS pipelines "Variant Calling with SAMtools", "Tuxedo Pipeline for RNA-seq Data Analysis" and "Cistrome Pipeline for ChIP-seq Data Analysis" integrated into the Unipro UGENE desktop toolkit. We describe the available UGENE infrastructure that helps researchers run these pipelines on different datasets, store and investigate the results and re-run the pipelines with the same parameters. These pipeline tools are included in the UGENE NGS package. Individual blocks of these pipelines are also available for expert users to create their own advanced workflows.
引用
收藏
页数:15
相关论文
共 17 条
  • [11] Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks
    Trapnell, Cole
    Roberts, Adam
    Goff, Loyal
    Pertea, Geo
    Kim, Daehwan
    Kelley, David R.
    Pimentel, Harold
    Salzberg, Steven L.
    Rinn, John L.
    Pachter, Lior
    [J]. NATURE PROTOCOLS, 2012, 7 (03) : 562 - 578
  • [12] Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation
    Trapnell, Cole
    Williams, Brian A.
    Pertea, Geo
    Mortazavi, Ali
    Kwan, Gordon
    van Baren, Marijke J.
    Salzberg, Steven L.
    Wold, Barbara J.
    Pachter, Lior
    [J]. NATURE BIOTECHNOLOGY, 2010, 28 (05) : 511 - U174
  • [13] TopHat: discovering splice junctions with RNA-Seq
    Trapnell, Cole
    Pachter, Lior
    Salzberg, Steven L.
    [J]. BIOINFORMATICS, 2009, 25 (09) : 1105 - 1111
  • [14] The Taverna workflow suite: designing and executing workflows of Web Services on the desktop, web or in the cloud
    Wolstencroft, Katherine
    Haines, Robert
    Fellows, Donal
    Williams, Alan
    Withers, David
    Owen, Stuart
    Soiland-Reyes, Stian
    Dunlop, Ian
    Nenadic, Aleksandra
    Fisher, Paul
    Bhagat, Jiten
    Belhajjame, Khalid
    Bacall, Finn
    Hardisty, Alex
    de la Hidalga, Abraham Nieva
    Vargas, Maria P. Balcazar
    Sufi, Shoaib
    Goble, Carole
    [J]. NUCLEIC ACIDS RESEARCH, 2013, 41 (W1) : W557 - W561
  • [15] hPDI: a database of experimental human protein-DNA interactions
    Xie, Zhi
    Hu, Shaohui
    Blackshaw, Seth
    Zhu, Heng
    Qian, Jiang
    [J]. BIOINFORMATICS, 2010, 26 (02) : 287 - 289
  • [16] Model-based Analysis of ChIP-Seq (MACS)
    Zhang, Yong
    Liu, Tao
    Meyer, Clifford A.
    Eeckhoute, Jerome
    Johnson, David S.
    Bernstein, Bradley E.
    Nussbaum, Chad
    Myers, Richard M.
    Brown, Myles
    Li, Wei
    Liu, X. Shirley
    [J]. GENOME BIOLOGY, 2008, 9 (09)
  • [17] High-resolution DNA-binding specificity analysis of yeast transcription factors
    Zhu, Cong
    Byers, Kelsey J. R. P.
    McCord, Rachel Patton
    Shi, Zhenwei
    Berger, Michael F.
    Newburger, Daniel E.
    Saulrieta, Katrina
    Smith, Zachary
    Shah, Mita V.
    Radhakrishnan, Mathangi
    Philippakis, Anthony A.
    Hu, Yanhui
    De Masi, Federico
    Pacek, Marcin
    Rolfs, Andreas
    Murthy, Tal
    LaBaer, Joshua
    Bulyk, Martha L.
    [J]. GENOME RESEARCH, 2009, 19 (04) : 556 - 566