BS-Seeker2: a versatile aligning pipeline for bisulfite sequencing data

被引:286
作者
Guo, Weilong [1 ,2 ]
Fiziev, Petko [3 ]
Yan, Weihong [4 ]
Cokus, Shawn [2 ]
Sun, Xueguang [5 ]
Zhang, Michael Q. [1 ,6 ]
Chen, Pao-Yang [7 ]
Pellegrini, Matteo [2 ,8 ]
机构
[1] Tsinghua Univ, TNLIST, Ctr Synthet & Syst Biol, Beijing 100084, Peoples R China
[2] Univ Calif Los Angeles, Dept Mol Cell & Dev Biol, Los Angeles, CA 90095 USA
[3] Univ Calif Los Angeles, Dept Biol Chem, Los Angeles, CA 90095 USA
[4] Univ Calif Los Angeles, Dept Chem & Biochem, Los Angeles, CA 90095 USA
[5] Zymo Res Corp, Irvine, CA 92614 USA
[6] Univ Texas Dallas, Ctr Syst Biol, Dept Mol & Cell Biol, Richardson, TX 75080 USA
[7] Acad Sinica, Inst Plant & Microbial Biol, Taipei 11529, Taiwan
[8] Univ Calif Los Angeles, Inst Genom & Prote, Los Angeles, CA 90095 USA
来源
BMC GENOMICS | 2013年 / 14卷
关键词
DNA methylation; Bisulfite sequencing aligner; WGBS; RRBS; BS Seeker; Bisulfite conversion failure; Galaxy toolshed; ALIGNMENT; EFFICIENT; ACCURACY; IMPROVES;
D O I
10.1186/1471-2164-14-774
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: DNA methylation is an important epigenetic modification involved in many biological processes. Bisulfite treatment coupled with high-throughput sequencing provides an effective approach for studying genome-wide DNA methylation at base resolution. Libraries such as whole genome bisulfite sequencing (WGBS) and reduced represented bisulfite sequencing (RRBS) are widely used for generating DNA methylomes, demanding efficient and versatile tools for aligning bisulfite sequencing data. Results: We have developed BS-Seeker2, an updated version of BS Seeker, as a full pipeline for mapping bisulfite sequencing data and generating DNA methylomes. BS-Seeker2 improves mappability over existing aligners by using local alignment. It can also map reads from RRBS library by building special indexes with improved efficiency and accuracy. Moreover, BS-Seeker2 provides additional function for filtering out reads with incomplete bisulfite conversion, which is useful in minimizing the overestimation of DNA methylation levels. We also defined CGmap and ATCGmap file formats for full representations of DNA methylomes, as part of the outputs of BS-Seeker2 pipeline together with BAM and WIG files. Conclusions: Our evaluations on the performance show that BS-Seeker2 works efficiently and accurately for both WGBS data and RRBS data. BS-Seeker2 is freely available at http://pellegrini.mcdb.ucla.edu/BS_Seeker2/ and the Galaxy server.
引用
收藏
页数:8
相关论文
共 19 条
  • [1] BS Seeker: precise mapping for bisulfite sequencing
    Chen, Pao-Yang
    Cokus, Shawn J.
    Pellegrini, Matteo
    [J]. BMC BIOINFORMATICS, 2010, 11
  • [2] Shotgun bisulphite sequencing of the Arabidopsis genome reveals DNA methylation patterning
    Cokus, Shawn J.
    Feng, Suhua
    Zhang, Xiaoyu
    Chen, Zugen
    Merriman, Barry
    Haudenschild, Christian D.
    Pradhan, Sriharsa
    Nelson, Stanley F.
    Pellegrini, Matteo
    Jacobsen, Steven E.
    [J]. NATURE, 2008, 452 (7184) : 215 - 219
  • [3] Galaxy: A platform for interactive large-scale genome analysis
    Giardine, B
    Riemer, C
    Hardison, RC
    Burhans, R
    Elnitski, L
    Shah, P
    Zhang, Y
    Blankenberg, D
    Albert, I
    Taylor, J
    Miller, W
    Kent, WJ
    Nekrutenko, A
    [J]. GENOME RESEARCH, 2005, 15 (10) : 1451 - 1455
  • [4] BRAT-BW: efficient and accurate mapping of bisulfite-treated reads
    Harris, Elena Y.
    Ponts, Nadia
    Le Roch, Karine G.
    Lonardi, Stefano
    [J]. BIOINFORMATICS, 2012, 28 (13) : 1795 - 1796
  • [5] Bismark: a flexible aligner and methylation caller for Bisulfite-Seq applications
    Krueger, Felix
    Andrews, Simon R.
    [J]. BIOINFORMATICS, 2011, 27 (11) : 1571 - 1572
  • [6] Langmead B, 2012, NAT METHODS, V9, P357, DOI [10.1038/NMETH.1923, 10.1038/nmeth.1923]
  • [7] Ultrafast and memory-efficient alignment of short DNA sequences to the human genome
    Langmead, Ben
    Trapnell, Cole
    Pop, Mihai
    Salzberg, Steven L.
    [J]. GENOME BIOLOGY, 2009, 10 (03):
  • [8] SOAP: short oligonucleotide alignment program
    Li, Ruiqiang
    Li, Yingrui
    Kristiansen, Karsten
    Wang, Jun
    [J]. BIOINFORMATICS, 2008, 24 (05) : 713 - 714
  • [9] Human DNA methylomes at base resolution show widespread epigenomic differences
    Lister, Ryan
    Pelizzola, Mattia
    Dowen, Robert H.
    Hawkins, R. David
    Hon, Gary
    Tonti-Filippini, Julian
    Nery, Joseph R.
    Lee, Leonard
    Ye, Zhen
    Ngo, Que-Minh
    Edsall, Lee
    Antosiewicz-Bourget, Jessica
    Stewart, Ron
    Ruotti, Victor
    Millar, A. Harvey
    Thomson, James A.
    Ren, Bing
    Ecker, Joseph R.
    [J]. NATURE, 2009, 462 (7271) : 315 - 322
  • [10] Genome-scale DNA methylation maps of pluripotent and differentiated cells
    Meissner, Alexander
    Mikkelsen, Tarjei S.
    Gu, Hongcang
    Wernig, Marius
    Hanna, Jacob
    Sivachenko, Andrey
    Zhang, Xiaolan
    Bernstein, Bradley E.
    Nusbaum, Chad
    Jaffe, David B.
    Gnirke, Andreas
    Jaenisch, Rudolf
    Lander, Eric S.
    [J]. NATURE, 2008, 454 (7205) : 766 - U91