BAMscale: quantification of next-generation sequencing peaks and generation of scaled coverage tracks

被引:36
作者
Pongor, Lorinc S. [1 ,2 ]
Gross, Jacob M. [1 ,2 ]
Vera Alvarez, Roberto [3 ]
Murai, Junko [1 ,2 ]
Jang, Sang-Min [1 ,2 ]
Zhang, Hongliang [1 ,2 ]
Redon, Christophe [1 ,2 ]
Fu, Haiqing [1 ,2 ]
Huang, Shar-Yin [1 ,2 ]
Thakur, Bhushan [1 ,2 ]
Baris, Adrian [1 ,2 ]
Marino-Ramirez, Leonardo [3 ]
Landsman, David [3 ]
Aladjem, Mirit I. [1 ,2 ]
Pommier, Yves [1 ,2 ]
机构
[1] NCI, Dev Therapeut Branch, Ctr Canc Res, NIH, 37 Convent Dr, Bethesda, MD 20892 USA
[2] NCI, Lab Mol Pharmacol, Ctr Canc Res, NIH, 37 Convent Dr, Bethesda, MD 20892 USA
[3] NIH, Computat Biol Branch, Natl Ctr Biotechnol Informat, NIH, 8600 Rockville Pike, Bethesda, MD 20892 USA
关键词
Histone modifications; Expression; ATAC-seq; ChIP-seq; NS-seq; Replication timing; Replication origins; RNA-seq; SLFN11; GENOME-WIDE; REPLICATION INITIATION; SEQ; LANDSCAPE; DISCOVERY; CHROMATIN; PLATFORM;
D O I
10.1186/s13072-020-00343-x
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Background Next-generation sequencing allows genome-wide analysis of changes in chromatin states and gene expression. Data analysis of these increasingly used methods either requires multiple analysis steps, or extensive computational time. We sought to develop a tool for rapid quantification of sequencing peaks from diverse experimental sources and an efficient method to produce coverage tracks for accurate visualization that can be intuitively displayed and interpreted by experimentalists with minimal bioinformatics background. We demonstrate its strength and usability by integrating data from several types of sequencing approaches. Results We have developed BAMscale, a one-step tool that processes a wide set of sequencing datasets. To demonstrate the usefulness of BAMscale, we analyzed multiple sequencing datasets from chromatin immunoprecipitation sequencing data (ChIP-seq), chromatin state change data (assay for transposase-accessible chromatin using sequencing: ATAC-seq, DNA double-strand break mapping sequencing: END-seq), DNA replication data (Okazaki fragments sequencing: OK-seq, nascent-strand sequencing: NS-seq, single-cell replication timing sequencing: scRepli-seq) and RNA-seq data. The outputs consist of raw and normalized peak scores (multiple normalizations) in text format and scaled bigWig coverage tracks that are directly accessible to data visualization programs. BAMScale also includes a visualization module facilitating direct, on-demand quantitative peak comparisons that can be used by experimentalists. Our tool can effectively analyze large sequencing datasets (similar to 100 Gb size) in minutes, outperforming currently available tools. Conclusions BAMscale accurately quantifies and normalizes identified peaks directly from BAM files, and creates coverage tracks for visualization in genome browsers. BAMScale can be implemented for a wide set of methods for calculating coverage tracks, including ChIP-seq and ATAC-seq, as well as methods that currently require specialized, separate tools for analyses, such as splice-aware RNA-seq, END-seq and OK-seq for which no dedicated software is available. BAMscale is freely available on github ().
引用
收藏
页数:13
相关论文
共 52 条
[1]   TPMCalculator: one-step software to quantify mRNA abundance of genomic features [J].
Alvarez, Roberto Vera ;
Pongor, Lorinc Sandor ;
Marino-Ramirez, Leonardo ;
Landsman, David .
BIOINFORMATICS, 2019, 35 (11) :1960-1962
[2]   The mitochondrial type IB topoisomerase drives mitochondrial translation and carcinogenesis [J].
Baechler, S. A. ;
Factor, V. M. ;
Dalla Rosa, I ;
Ravji, A. ;
Becker, D. ;
Khiati, S. ;
Jenkins, L. M. Miller ;
Lang, M. ;
Sourbier, C. ;
Michaels, S. A. ;
Neckers, L. M. ;
Zhang, H. L. ;
Spinazzola, A. ;
Huang, S. N. ;
Marquardt, J. U. ;
Pommier, Y. .
NATURE COMMUNICATIONS, 2019, 10 (1)
[3]   Bivariate Genomic Footprinting Detects Changes in Transcription Factor Activity [J].
Baek, Songjoon ;
Goldstein, Ido ;
Hager, Gordon L. .
CELL REPORTS, 2017, 19 (08) :1710-1722
[4]   High-resolution profiling of histone methylations in the human genome [J].
Barski, Artern ;
Cuddapah, Suresh ;
Cui, Kairong ;
Roh, Tae-Young ;
Schones, Dustin E. ;
Wang, Zhibin ;
Wei, Gang ;
Chepelev, Iouri ;
Zhao, Keji .
CELL, 2007, 129 (04) :823-837
[5]  
Baruzzo G, 2017, NAT METHODS, V14, P135, DOI [10.1038/NMETH.4106, 10.1038/nmeth.4106]
[6]   MLL-Rearranged Leukemia Is Dependent on Aberrant H3K79 Methylation by DOT1L [J].
Bernt, Kathrin M. ;
Zhu, Nan ;
Sinha, Amit U. ;
Vempati, Sridhar ;
Faber, Joerg ;
Krivtsov, Andrei V. ;
Feng, Zhaohui ;
Punt, Natalie ;
Daigle, Amanda ;
Bullinger, Lars ;
Pollock, Roy M. ;
Richon, Victoria M. ;
Kung, Andrew L. ;
Armstrong, Scott A. .
CANCER CELL, 2011, 20 (01) :66-78
[7]   ASCL1 and NEUROD1 Reveal Heterogeneity in Pulmonary Neuroendocrine Tumors and Regulate Distinct Genetic Programs [J].
Borromeo, Mark D. ;
Savage, Trisha K. ;
Kollipara, Rahul K. ;
He, Min ;
Augustyn, Alexander ;
Osborne, Jihan K. ;
Girard, Luc ;
Minna, John D. ;
Gazdar, Adi F. ;
Cobb, Melanie H. ;
Johnson, Jane E. .
CELL REPORTS, 2016, 16 (05) :1259-1272
[8]   High-resolution mapping and characterization of open chromatin across the genome [J].
Boyle, Alan P. ;
Davis, Sean ;
Shulha, Hennady P. ;
Meltzer, Paul ;
Margulies, Elliott H. ;
Weng, Zhiping ;
Furey, Terrence S. ;
Crawford, Gregory E. .
CELL, 2008, 132 (02) :311-322
[9]  
Buenrostro Jason D, 2015, Curr Protoc Mol Biol, V109, DOI 10.1002/0471142727.mb2129s109
[10]   DNA Breaks and End Resection Measured Genome-wide by End Sequencing [J].
Canela, Andres ;
Sridharan, Sriram ;
Sciascia, Nicholas ;
Tubbs, Anthony ;
Meltzer, Paul ;
Sleckman, Barry P. ;
Nussenzweig, Andre .
MOLECULAR CELL, 2016, 63 (05) :898-911