Unipro UGENE NGS pipelines and components for variant calling, RNA-seq and ChIP-seq data analyses

被引:68
作者
Golosova, Olga [1 ]
Henderson, Ross [2 ]
Vaskin, Yuriy [1 ]
Gabrielian, Andrei [2 ]
Grekhov, German [1 ]
Nagarajan, Vijayaraj [2 ]
Oler, Andrew J. [2 ]
Nones, Mariam Qui [2 ]
Hurt, Darrell [2 ]
Fursov, Mikhail [1 ]
Huyen, Yentram [2 ]
机构
[1] Unipro Ctr Informat Technol, Novosibirsk, Russia
[2] NIAID, Bioinformat & Computat Biosci Branch, Off Cyber Infrastruct & Computat Biol, NIH, Bethesda, MD 20892 USA
关键词
Bioinformatics; Next-generation sequencing; Data analysis; ChIP-seq; Variant calling; RNA-seq; READ ALIGNMENT; WORKFLOWS; TOPHAT; GENE;
D O I
10.7717/peerj.644
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The advent of Next Generation Sequencing (NGS) technologies has opened new possibilities for researchers. However, the more biology becomes a data-intensive field, the more biologists have to learn how to process and analyze NGS data with complex computational tools. Even with the availability of common pipeline specifications, it is often a time-consuming and cumbersome task for a bench scientist to install and configure the pipeline tools. We believe that a unified, desktop and biologist-friendly front end to NGS data analysis tools will substantially improve productivity in this field. Here we present NGS pipelines "Variant Calling with SAMtools", "Tuxedo Pipeline for RNA-seq Data Analysis" and "Cistrome Pipeline for ChIP-seq Data Analysis" integrated into the Unipro UGENE desktop toolkit. We describe the available UGENE infrastructure that helps researchers run these pipelines on different datasets, store and investigate the results and re-run the pipelines with the same parameters. These pipeline tools are included in the UGENE NGS package. Individual blocks of these pipelines are also available for expert users to create their own advanced workflows.
引用
收藏
页数:15
相关论文
共 17 条
[11]   Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks [J].
Trapnell, Cole ;
Roberts, Adam ;
Goff, Loyal ;
Pertea, Geo ;
Kim, Daehwan ;
Kelley, David R. ;
Pimentel, Harold ;
Salzberg, Steven L. ;
Rinn, John L. ;
Pachter, Lior .
NATURE PROTOCOLS, 2012, 7 (03) :562-578
[12]   Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation [J].
Trapnell, Cole ;
Williams, Brian A. ;
Pertea, Geo ;
Mortazavi, Ali ;
Kwan, Gordon ;
van Baren, Marijke J. ;
Salzberg, Steven L. ;
Wold, Barbara J. ;
Pachter, Lior .
NATURE BIOTECHNOLOGY, 2010, 28 (05) :511-U174
[13]   TopHat: discovering splice junctions with RNA-Seq [J].
Trapnell, Cole ;
Pachter, Lior ;
Salzberg, Steven L. .
BIOINFORMATICS, 2009, 25 (09) :1105-1111
[14]   The Taverna workflow suite: designing and executing workflows of Web Services on the desktop, web or in the cloud [J].
Wolstencroft, Katherine ;
Haines, Robert ;
Fellows, Donal ;
Williams, Alan ;
Withers, David ;
Owen, Stuart ;
Soiland-Reyes, Stian ;
Dunlop, Ian ;
Nenadic, Aleksandra ;
Fisher, Paul ;
Bhagat, Jiten ;
Belhajjame, Khalid ;
Bacall, Finn ;
Hardisty, Alex ;
de la Hidalga, Abraham Nieva ;
Vargas, Maria P. Balcazar ;
Sufi, Shoaib ;
Goble, Carole .
NUCLEIC ACIDS RESEARCH, 2013, 41 (W1) :W557-W561
[15]   hPDI: a database of experimental human protein-DNA interactions [J].
Xie, Zhi ;
Hu, Shaohui ;
Blackshaw, Seth ;
Zhu, Heng ;
Qian, Jiang .
BIOINFORMATICS, 2010, 26 (02) :287-289
[16]   Model-based Analysis of ChIP-Seq (MACS) [J].
Zhang, Yong ;
Liu, Tao ;
Meyer, Clifford A. ;
Eeckhoute, Jerome ;
Johnson, David S. ;
Bernstein, Bradley E. ;
Nussbaum, Chad ;
Myers, Richard M. ;
Brown, Myles ;
Li, Wei ;
Liu, X. Shirley .
GENOME BIOLOGY, 2008, 9 (09)
[17]   High-resolution DNA-binding specificity analysis of yeast transcription factors [J].
Zhu, Cong ;
Byers, Kelsey J. R. P. ;
McCord, Rachel Patton ;
Shi, Zhenwei ;
Berger, Michael F. ;
Newburger, Daniel E. ;
Saulrieta, Katrina ;
Smith, Zachary ;
Shah, Mita V. ;
Radhakrishnan, Mathangi ;
Philippakis, Anthony A. ;
Hu, Yanhui ;
De Masi, Federico ;
Pacek, Marcin ;
Rolfs, Andreas ;
Murthy, Tal ;
LaBaer, Joshua ;
Bulyk, Martha L. .
GENOME RESEARCH, 2009, 19 (04) :556-566