A Bayesian Assignment Method for Ambiguous Bisulfite Short Reads

被引:1
作者
Tran, Hong [1 ]
Wu, Xiaowei [2 ]
Tithi, Saima [1 ]
Sun, Ming-An [3 ]
Xie, Hehuang [3 ,4 ]
Zhang, Liqing [1 ]
机构
[1] Virginia Polytech Inst & State Univ, Dept Comp Sci, Virginia Tech, Blacksburg, VA 24061 USA
[2] Virginia Polytech Inst & State Univ, Dept Stat, Virginia Tech, Blacksburg, VA 24061 USA
[3] Virginia Polytech Inst & State Univ, Virginia Bioinformat Inst, Virginia Tech, Blacksburg, VA 24061 USA
[4] Virginia Polytech Inst & State Univ, Dept Biol Sci, Virginia Tech, Blacksburg, VA 24061 USA
关键词
DNA METHYLATION; ALIGNMENT;
D O I
10.1371/journal.pone.0151826
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
DNA methylation is an epigenetic modification critical for normal development and diseases. The determination of genome-wide DNA methylation at single-nucleotide resolution is made possible by sequencing bisulfite treated DNA with next generation high-throughput sequencing. However, aligning bisulfite short reads to a reference genome remains challenging as only a limited proportion of them (around 50-70%) can be aligned uniquely; a significant proportion, known as multireads, are mapped to multiple locations and thus discarded from downstream analyses, causing financial waste and biased methylation inference. To address this issue, we develop a Bayesian model that assigns multireads to their most likely locations based on the posterior probability derived from information hidden in uniquely aligned reads. Analyses of both simulated data and real hairpin bisulfite sequencing data show that our method can effectively assign approximately 70% of the multireads to their best locations with up to 90% accuracy, leading to a significant increase in the overall mapping efficiency. Moreover, the assignment model shows robust performance with low coverage depth, making it particularly attractive considering the prohibitive cost of bisulfite sequencing. Additionally, results show that longer reads help improve the performance of the assignment model. The assignment model is also robust to varying degrees of methylation and varying sequencing error rates. Finally, incorporating prior knowledge on mutation rate and context specific methylation level into the assignment model increases inference accuracy. The assignment model is implemented in the BAM-ABS package and freely available at https://github.com/zhanglabvt/BAM_ABS.
引用
收藏
页数:17
相关论文
共 22 条
[1]   Comparison of alignment software for genome-wide bisulphite sequence data [J].
Chatterjee, Aniruddha ;
Stockwell, Peter A. ;
Rodger, Euan J. ;
Morison, Ian M. .
NUCLEIC ACIDS RESEARCH, 2012, 40 (10) :e79
[2]   Shotgun bisulphite sequencing of the Arabidopsis genome reveals DNA methylation patterning [J].
Cokus, Shawn J. ;
Feng, Suhua ;
Zhang, Xiaoyu ;
Chen, Zugen ;
Merriman, Barry ;
Haudenschild, Christian D. ;
Pradhan, Sriharsa ;
Nelson, Stanley F. ;
Pellegrini, Matteo ;
Jacobsen, Steven E. .
NATURE, 2008, 452 (7184) :215-219
[3]   DNA methylation and cancer [J].
Das, PM ;
Singal, R .
JOURNAL OF CLINICAL ONCOLOGY, 2004, 22 (22) :4632-4642
[4]   A framework for variation discovery and genotyping using next-generation DNA sequencing data [J].
DePristo, Mark A. ;
Banks, Eric ;
Poplin, Ryan ;
Garimella, Kiran V. ;
Maguire, Jared R. ;
Hartl, Christopher ;
Philippakis, Anthony A. ;
del Angel, Guillermo ;
Rivas, Manuel A. ;
Hanna, Matt ;
McKenna, Aaron ;
Fennell, Tim J. ;
Kernytsky, Andrew M. ;
Sivachenko, Andrey Y. ;
Cibulskis, Kristian ;
Gabriel, Stacey B. ;
Altshuler, David ;
Daly, Mark J. .
NATURE GENETICS, 2011, 43 (05) :491-+
[5]   Distribution, recognition and regulation of non-CpG methylation in the adult mammalian brain [J].
Guo, Junjie U. ;
Su, Yijing ;
Shin, Joo Heon ;
Shin, Jaehoon ;
Li, Hongda ;
Xie, Bin ;
Zhong, Chun ;
Hu, Shaohui ;
Le, Thuc ;
Fan, Guoping ;
Zhu, Heng ;
Chang, Qiang ;
Gao, Yuan ;
Ming, Guo-li ;
Song, Hongjun .
NATURE NEUROSCIENCE, 2014, 17 (02) :215-222
[6]   BS-Seeker2: a versatile aligning pipeline for bisulfite sequencing data [J].
Guo, Weilong ;
Fiziev, Petko ;
Yan, Weihong ;
Cokus, Shawn ;
Sun, Xueguang ;
Zhang, Michael Q. ;
Chen, Pao-Yang ;
Pellegrini, Matteo .
BMC GENOMICS, 2013, 14
[7]   Increased methylation variation in epigenetic domains across cancer types [J].
Hansen, Kasper Daniel ;
Timp, Winston ;
Bravo, Hector Corrada ;
Sabunciyan, Sarven ;
Langmead, Benjamin ;
McDonald, Oliver G. ;
Wen, Bo ;
Wu, Hao ;
Liu, Yun ;
Diep, Dinh ;
Briem, Eirikur ;
Zhang, Kun ;
Irizarry, Rafael A. ;
Feinberg, Andrew P. .
NATURE GENETICS, 2011, 43 (08) :768-U77
[8]   The role of DNA methylation in mammalian epigenetics [J].
Jones, PA ;
Takai, D .
SCIENCE, 2001, 293 (5532) :1068-1070
[9]   Bismark: a flexible aligner and methylation caller for Bisulfite-Seq applications [J].
Krueger, Felix ;
Andrews, Simon R. .
BIOINFORMATICS, 2011, 27 (11) :1571-1572
[10]   Hairpin-bisulfite PCR:: Assessing epigenetic methylation patterns on complementary strands of individual DNA molecules [J].
Laird, CD ;
Pleasant, ND ;
Clark, AD ;
Sneeden, JL ;
Hassan, KMA ;
Manley, NC ;
Vary, JC ;
Morgan, T ;
Hansen, RS ;
Stöger, R .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2004, 101 (01) :204-209