Seten: a tool for systematic identification and comparison of processes, phenotypes, and diseases associated with RNA-binding proteins from condition-specific CLIP-seq profiles

被引:8
作者
Budak, Gungor [1 ]
Srivastava, Rajneesh [1 ]
Janga, Sarath Chandra [1 ,2 ,3 ]
机构
[1] Indiana Univ Purdue Univ, Sch Informat & Comp, Dept Biohlth Informat, Indianapolis, IN 46202 USA
[2] Indiana Univ Sch Med, Ctr Computat Biol & Bioinformat, Indianapolis, IN 46202 USA
[3] Indiana Univ Sch Med, Dept Med & Mol Genet, Indianapolis, IN 46202 USA
关键词
RNA-binding proteins; CLIP (crosslinking and immunoprecipitation); gene set enrichment; functional enrichment; genotype-phenotype; post-transcriptional networks; UBIQUITIN LIGASE; BETA-TRCP; SITES; GENE; TRANSCRIPTOME; DDX6; DISCOVERY; ALIGNMENT; SEQUENCE; LINKING;
D O I
10.1261/rna.059089.116
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
RNA-binding proteins (RBPs) control the regulation of gene expression in eukaryotic genomes at post-transcriptional level by binding to their cognate RNAs. Although several variants of CLIP (crosslinking and immunoprecipitation) protocols are currently available to study the global protein RNA interaction landscape at single-nucleotide resolution in a cell, currently there are very few tools that can facilitate understanding and dissecting the functional associations of RBPs from the resulting binding maps. Here, we present Seten, a web-based and command line tool, which can identify and compare processes, phenotypes, and diseases associated with RBPs from condition-specific CLIP-seq profiles. Seten uses BED files resulting from most peak calling algorithms, which include scores reflecting the extent of binding of an RBP on the target transcript, to provide both traditional functional enrichment as well as gene set enrichment results for a number of gene set collections including BioCarta, KEGG, Reactome, Gene Ontology (GO), Human Phenotype Ontology (HPO), and MalaCards Disease Ontology for several organisms including fruit fly, human, mouse, rat, worm, and yeast. It also provides an option to dynamically compare the associated gene sets across data sets as bubble charts, to facilitate comparative analysis. Benchmarking of Seten using eCLIP data for IGF2BP1, SRSF7, and PTBP1 against their corresponding CRISPR RNA-seq in K562 cells as well as randomized negative controls, demonstrated that its gene set enrichment method outperforms functional enrichment, with scores significantly contributing to the discovery of true annotations. Comparative performance analysis using these CRISPR control data sets revealed significantly higher precision and comparable recall to that observed using ChIP-Enrich. Seten's web interface currently provides precomputed results for about 200 CLIP-seq data sets and both command line as well as web interfaces can be used to analyze CLIP-seq data sets. We highlight several examples to show the utility of Seten for rapid profiling of various CLIP-seq data sets. Seten is available on http://wwvv.iupuledu/similar to sysbio/seten/.
引用
收藏
页码:836 / 846
页数:11
相关论文
共 61 条
[1]   CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300
[2]   Heterogeneous nuclear ribonucleoproteins (hnRNPs) in cellular processes: Focus on hnRNP E1's multifunctional regulatory roles [J].
Chaudhury, Arindam ;
Chander, Praveen ;
Howe, Philip H. .
RNA, 2010, 16 (08) :1449-1462
[3]   PIPE-CLIP: a comprehensive online tool for CLIP-seq data analysis [J].
Chen, Beibei ;
Yun, Jonghyun ;
Kim, Min Soo ;
Mendell, Joshua T. ;
Xie, Yang .
GENOME BIOLOGY, 2014, 15 (01)
[4]   Sensitive and highly resolved identification of RNA-protein interaction sites in PAR-CLIP data [J].
Comoglio, Federico ;
Sievers, Cem ;
Paro, Renato .
BMC BIOINFORMATICS, 2015, 16
[5]   PARalyzer: definition of RNA binding sites from PAR-CLIP short-read sequence data [J].
Corcoran, David L. ;
Georgiev, Stoyan ;
Mukherjee, Neelanjan ;
Gottwein, Eva ;
Skalsky, Rebecca L. ;
Keene, Jack D. ;
Ohler, Uwe .
GENOME BIOLOGY, 2011, 12 (08)
[6]  
Croft D, 2014, NUCLEIC ACIDS RES, V42, pD472, DOI [10.1093/nar/gkt1102, 10.1093/nar/gkz1031]
[7]  
D'Agostino Y, 2017, BRIEF FUNCT GENOMICS, DOI [10.1093/bfgp/e1w038, DOI 10.1093/BFGP/E1W038]
[8]   HITS-CLIP: panoramic views of protein-RNA regulation in living cells [J].
Darnell, Robert B. .
WILEY INTERDISCIPLINARY REVIEWS-RNA, 2010, 1 (02) :266-286
[9]   Pseudosubstrate regulation of the SCFβ-TrCP ubiquitin ligase by hnRNP-U [J].
Davis, M ;
Hatzubai, A ;
Andersen, JS ;
Ben-Shushan, E ;
Fisher, GZ ;
Yaron, A ;
Bauskin, A ;
Mercurio, F ;
Mann, M ;
Ben-Neriah, Y .
GENES & DEVELOPMENT, 2002, 16 (04) :439-451
[10]   RIP-chip enrichment analysis [J].
Erhard, Florian ;
Doelken, Lars ;
Zimmer, Ralf .
BIOINFORMATICS, 2013, 29 (01) :77-83