dSreg: a Bayesian model to integrate changes in splicing and RNA-binding protein activity

被引:1
作者
Marti-Gomez, Carlos [1 ,2 ]
Lara-Pezzi, Enrique [1 ,2 ]
Sanchez-Cabo, Fatima [1 ,2 ]
机构
[1] Mol Regulat Heart Failure CMG & ELP, Madrid 28029, Spain
[2] Ctr Nacl Invest Cardiovasc CNIC, Bioinformat Unit FSC, Madrid 28029, Spain
关键词
EXPRESSION; SEQUENCE; GENE; QUANTIFICATION; REGULATOR; GENOME; IMPACT;
D O I
10.1093/bioinformatics/btz915
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Alternative splicing (AS) is an important mechanism in the generation of transcript diversity across mammals. AS patterns are dynamically regulated during development and in response to environmental changes. Defects or perturbations in its regulation may lead to cancer or neurological disorders, among other pathological conditions. The regulatory mechanisms controlling AS in a given biological context are typically inferred using a two-step framework: differential AS analysis followed by enrichment methods. These strategies require setting rather arbitrary thresholds and are prone to error propagation along the analysis. Results: To overcome these limitations, we propose dSreg, a Bayesian model that integrates RNA-seq with data from regulatory features, e.g. binding sites of RNA-binding proteins. dSreg identifies the key underlying regulators controlling AS changes and quantifies their activity while simultaneously estimating the changes in exon inclusion rates. dSreg increased both the sensitivity and the specificity of the identified AS changes in simulated data, even at low read coverage. dSreg also showed improved performance when analyzing a collection of knock-down RNA-binding proteins' experiments from ENCODE, as opposed to traditional enrichment methods, such as over-representation analysis and gene set enrichment analysis. dSreg opens the possibility to integrate a large amount of readily available RNA-seq datasets at low coverage for AS analysis and allows more cost-effective RNA-seq experiments.
引用
收藏
页码:2134 / 2141
页数:8
相关论文
共 54 条
[1]   Leveraging transcript quantification for fast computation of alternative splicing profiles [J].
Alamancos, Gael P. ;
Pages, Amadis ;
Trincado, Juan L. ;
Bellora, Nicolas ;
Eyras, Eduardo .
RNA, 2015, 21 (09) :1521-1531
[2]   Predicting the sequence specificities of DNA- and RNA-binding proteins by deep learning [J].
Alipanahi, Babak ;
Delong, Andrew ;
Weirauch, Matthew T. ;
Frey, Brendan J. .
NATURE BIOTECHNOLOGY, 2015, 33 (08) :831-+
[3]   Detecting differential usage of exons from RNA-seq data [J].
Anders, Simon ;
Reyes, Alejandro ;
Huber, Wolfgang .
GENOME RESEARCH, 2012, 22 (10) :2008-2017
[4]   Alternative splicing as a regulator of development and tissue identity [J].
Baralle, Francisco E. ;
Giudice, Jimena .
NATURE REVIEWS MOLECULAR CELL BIOLOGY, 2017, 18 (07) :437-451
[5]   Deciphering the splicing code [J].
Barash, Yoseph ;
Calarco, John A. ;
Gao, Weijun ;
Pan, Qun ;
Wang, Xinchen ;
Shai, Ofer ;
Blencowe, Benjamin J. ;
Frey, Brendan J. .
NATURE, 2010, 465 (7294) :53-59
[6]   The Evolutionary Landscape of Alternative Splicing in Vertebrate Species [J].
Barbosa-Morais, Nuno L. ;
Irimia, Manuel ;
Pan, Qun ;
Xiong, Hui Y. ;
Gueroussov, Serge ;
Lee, Leo J. ;
Slobodeniuc, Valentina ;
Kutter, Claudia ;
Watt, Stephen ;
Colak, Recep ;
Kim, TaeHyung ;
Misquitta-Ali, Christine M. ;
Wilson, Michael D. ;
Kim, Philip M. ;
Odom, Duncan T. ;
Frey, Brendan J. ;
Blencowe, Benjamin J. .
SCIENCE, 2012, 338 (6114) :1587-1593
[7]  
Betancourt M.GirolamiM., 2015, Current Trends in Bayesian Methodology with Applications, P79, DOI DOI 10.1201/B18502
[8]   Near-optimal probabilistic RNA-seq quantification (vol 34, pg 525, 2016) [J].
Bray, Nicolas L. ;
Pimentel, Harold ;
Melsted, Pall ;
Pachter, Lior .
NATURE BIOTECHNOLOGY, 2016, 34 (08) :888-888
[9]   Stan: A Probabilistic Programming Language [J].
Carpenter, Bob ;
Gelman, Andrew ;
Hoffman, Matthew D. ;
Lee, Daniel ;
Goodrich, Ben ;
Betancourt, Michael ;
Brubaker, Marcus A. ;
Guo, Jiqiang ;
Li, Peter ;
Riddell, Allen .
JOURNAL OF STATISTICAL SOFTWARE, 2017, 76 (01) :1-29
[10]  
Carvalho C. M., 2009, Artificial intelligence and statistics, V5, P73