Mixture models and wavelet transforms reveal high confidence RNA-protein interaction sites in MOV10 PAR-CLIP data

被引:63
作者
Sievers, Cem [1 ]
Schlumpf, Tommy [1 ]
Sawarkar, Ritwick [1 ]
Comoglio, Federico [1 ]
Paro, Renato [1 ,2 ]
机构
[1] Swiss Fed Inst Technol Zurich, Dept Biosyst Sci & Engn, CH-4058 Basel, Switzerland
[2] Univ Basel, Fac Sci, CH-4056 Basel, Switzerland
关键词
BINDING PROTEIN; SEQUENCES;
D O I
10.1093/nar/gks697
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The Photo-Activatable Ribonucleoside-enhanced CrossLinking and ImmunoPrecipitation (PAR-CLIP) method was recently developed for global identification of RNAs interacting with proteins. The strength of this versatile method results from induction of specific T to C transitions at sites of interaction. However, current analytical tools do not distinguish between non-experimentally and experimentally induced transitions. Furthermore, geometric properties at potential binding sites are not taken into account. To surmount these shortcomings, we developed a two-step algorithm consisting of a non-parametric two-component mixture model and a wavelet-based peak calling procedure. Our algorithm can reduce the number of false positives up to 24% thereby identifying high confidence interaction sites. We successfully employed this approach in conjunction with a modified PAR-CLIP protocol to study the functional role of nuclear Moloney leukemia virus 10, a putative RNA helicase interacting with Argonaute2 and Polycomb. Our method, available as the R package wavClusteR, is generally applicable to any substitution-based inference problem in genomics.
引用
收藏
页数:11
相关论文
共 28 条
[1]  
[Anonymous], 2011, R: A Language and Environment for Statistical Computing
[2]  
[Anonymous], 1992, CBMS-NSF Reg. Conf. Ser. in Appl. Math
[3]  
Bailey T L, 1994, Proc Int Conf Intell Syst Mol Biol, V2, P28
[4]  
Bishop C.M., 2006, J ELECTRON IMAGING, V16, P049901, DOI DOI 10.1117/1.2819119
[5]  
Chendrimada TP, 2007, NATURE, V447, P823, DOI 10.1038/nature05841
[6]   PARalyzer: definition of RNA binding sites from PAR-CLIP short-read sequence data [J].
Corcoran, David L. ;
Georgiev, Stoyan ;
Mukherjee, Neelanjan ;
Gottwein, Eva ;
Skalsky, Rebecca L. ;
Keene, Jack D. ;
Ohler, Uwe .
GENOME BIOLOGY, 2011, 12 (08)
[7]   Improved peak detection in mass spectrum by incorporating continuous wavelet transform-based pattern matching [J].
Du, Pan ;
Kibbe, Warren A. ;
Lin, Simon M. .
BIOINFORMATICS, 2006, 22 (17) :2059-2065
[8]  
Efron B, 1986, STAT SCI, V1, DOI DOI 10.1214/SS/1177013815
[9]   Small silencing RNAs: an expanding universe [J].
Ghildiyal, Megha ;
Zamore, Phillip D. .
NATURE REVIEWS GENETICS, 2009, 10 (02) :94-108
[10]   miRBase: microRNA sequences, targets and gene nomenclature [J].
Griffiths-Jones, Sam ;
Grocock, Russell J. ;
van Dongen, Stijn ;
Bateman, Alex ;
Enright, Anton J. .
NUCLEIC ACIDS RESEARCH, 2006, 34 :D140-D144