Leveraging cross-link modification events in CLIP-seq for motif discovery

被引:29
作者
Bahrami-Samani, Emad [1 ]
Penalva, Luiz O. F. [2 ,3 ]
Smith, Andrew D. [1 ]
Uren, Philip J. [1 ]
机构
[1] Univ So Calif, Los Angeles, CA 90089 USA
[2] Univ Texas Hlth Sci Ctr San Antonio, Childrens Canc Res Inst, San Antonio, TX 78229 USA
[3] Univ Texas Hlth Sci Ctr San Antonio, Dept Cellular & Struct Biol, San Antonio, TX 78229 USA
基金
美国国家卫生研究院;
关键词
RNA-BINDING PROTEIN; GENOME-WIDE ANALYSIS; PAR-CLIP; NUCLEOTIDE-RESOLUTION; SECONDARY STRUCTURE; EXPECTATION MAXIMIZATION; PARTITION-FUNCTION; INTERACTION SITES; MESSENGER-RNAS; STEM-CELLS;
D O I
10.1093/nar/gku1288
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
High-throughput protein-RNA interaction data generated by CLIP-seq has provided an unprecedented depth of access to the activities of RNA-binding proteins (RBPs), the key players in co- and post-transcriptional regulation of gene expression. Motif discovery forms part of the necessary follow-up data analysis for CLIP-seq, both to refine the exact locations of RBP binding sites, and to characterize them. The specific properties of RBP binding sites, and the CLIP-seq methods, provide additional information not usually present in the classic motif discovery problem: the binding site structure, and cross-linking induced events in reads. We show that CLIP-seq data contains clear secondary structure signals, as well as technology- and RBP-specific cross-link signals. We introduce Zagros, a motif discovery algorithm specifically designed to leverage this information and explore its impact on the quality of recovered motifs. Our results indicate that using both secondary structure and cross-link modifications can greatly improve motif discovery on CLIP-seq data. Further, the motifs we recover provide insight into the balance between sequence- and structure-specificity struck by RBP binding.
引用
收藏
页码:95 / 103
页数:9
相关论文
共 45 条
[1]  
BAILEY TL, 1995, MACH LEARN, V21, P51, DOI 10.1007/BF00993379
[2]   Argonaute HITS-CLIP decodes microRNA-mRNA interaction maps [J].
Chi, Sung Wook ;
Zang, Julie B. ;
Mele, Aldo ;
Darnell, Robert B. .
NATURE, 2009, 460 (7254) :479-486
[3]   PARalyzer: definition of RNA binding sites from PAR-CLIP short-read sequence data [J].
Corcoran, David L. ;
Georgiev, Stoyan ;
Mukherjee, Neelanjan ;
Gottwein, Eva ;
Skalsky, Rebecca L. ;
Keene, Jack D. ;
Ohler, Uwe .
GENOME BIOLOGY, 2011, 12 (08)
[4]   Advancing the functional utility of PAR-CLIP by quantifying background binding to mRNAs and lncRNAs [J].
Friedersdorf, Matthew B. ;
Keene, Jack D. .
GENOME BIOLOGY, 2014, 15 (01)
[5]   Identification of protein binding sites on U3 snoRNA and pre-rRNA by UV cross-linking and high-throughput analysis of cDNAs [J].
Granneman, Sander ;
Kudla, Grzegorz ;
Petfalski, Elisabeth ;
Tollervey, David .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2009, 106 (24) :9613-9618
[6]   Transcriptome-wide Identification of RNA-Binding Protein and MicroRNA Target Sites by PAR-CLIP [J].
Hafner, Markus ;
Landthaler, Markus ;
Burger, Lukas ;
Khorshid, Mohsen ;
Hausser, Jean ;
Berninger, Philipp ;
Rothballer, Andrea ;
Ascano, Manuel, Jr. ;
Jungkamp, Anna-Carina ;
Munschauer, Mathias ;
Ulrich, Alexander ;
Wardle, Greg S. ;
Dewell, Scott ;
Zavolan, Mihaela ;
Tuschl, Thomas .
CELL, 2010, 141 (01) :129-141
[7]   Integrative Genome-wide Analysis Reveals Cooperative Regulation of Alternative Splicing by hnRNP Proteins [J].
Huelga, Stephanie C. ;
Vu, Anthony Q. ;
Arnold, Justin D. ;
Liang, Tiffany Y. ;
Liu, Patrick P. ;
Yan, Bernice Y. ;
Donohue, John Paul ;
Shiue, Lily ;
Hoon, Shawn ;
Brenner, Sydney ;
Ares, Manuel, Jr. ;
Yeo, Gene W. .
CELL REPORTS, 2012, 1 (02) :167-178
[8]   Analysis and design of RNA sequencing experiments for identifying isoform regulation [J].
Katz, Yarden ;
Wang, Eric T. ;
Airoldi, Edoardo M. ;
Burge, Christopher B. .
NATURE METHODS, 2010, 7 (12) :1009-U101
[9]   RNAcontext: A New Method for Learning the Sequence and Structure Binding Preferences of RNA-Binding Proteins [J].
Kazan, Hilal ;
Ray, Debashish ;
Chan, Esther T. ;
Hughes, Timothy R. ;
Morris, Quaid .
PLOS COMPUTATIONAL BIOLOGY, 2010, 6 (07) :28
[10]   Distinct binding properties of TIAR RRMs and linker region [J].
Kim, Henry S. ;
Headey, Stephen J. ;
Yoga, Yano M. K. ;
Scanlon, Martin J. ;
Gorospe, Myriam ;
Wilce, Matthew C. J. ;
Wilce, Jacqueline A. .
RNA BIOLOGY, 2013, 10 (04) :579-589