An Annotation Agnostic Algorithm for Detecting Nascent RNA Transcripts in GRO-Seq

被引:13
|
作者
Azofeifa, Joseph G. [1 ]
Allen, Mary A. [2 ]
Lladser, Manuel E. [3 ]
Dowell, Robin D. [2 ,4 ]
机构
[1] Univ Colorado, Dept Comp Sci, Boulder, CO 80309 USA
[2] Univ Colorado, BioFrontiers Inst, Boulder, CO 80309 USA
[3] Univ Colorado, Dept Appl Math, Boulder, CO 80309 USA
[4] Univ Colorado, Dept Mol Cellular & Dev Biol, Boulder, CO 80309 USA
关键词
GRO-seq; nascent transcription; logisitic regression; hidden Markov models; algorithms; experimentation; ELONGATION; INITIATION; ENHANCERS; ELEMENTS; MODELS; SITES; MAP;
D O I
10.1109/TCBB.2016.2520919
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
We present a fast and simple algorithm to detect nascent RNA transcription in global nuclear run-on sequencing (GRO-seq). GRO-seq is a relatively new protocol that captures nascent transcripts from actively engaged polymerase, providing a direct read-out on bona fide transcription. Most traditional assays, such as RNA-seq, measure steady state RNA levels which are affected by transcription, post-transcriptional processing, and RNA stability. GRO-seq data, however, presents unique analysis challenges that are only beginning to be addressed. Here, we describe a new algorithm, Fast Read Stitcher (FStitch), that takes advantage of two popular machine-learning techniques, hidden Markov models and logistic regression, to classify which regions of the genome are transcribed. Given a small user-defined training set, our algorithm is accurate, robust to varying read depth, annotation agnostic, and fast. Analysis of GRO-seq data without a priori need for annotation uncovers surprising new insights into several aspects of the transcription process.
引用
收藏
页码:1070 / 1081
页数:12
相关论文
共 14 条
  • [1] RNA polymerase II activity revealed by GRO-seq and pNET-seq in Arabidopsis
    Jiafu Zhu
    Min Liu
    Xiaobin Liu
    Zhicheng Dong
    Nature Plants, 2018, 4 : 1112 - 1123
  • [2] RNA polymerase II activity revealed by GRO-seq and pNET-seq in Arabidopsis
    Zhu, Jiafu
    Liu, Min
    Liu, Xiaobin
    Dong, Zhicheng
    NATURE PLANTS, 2018, 4 (12) : 1112 - 1123
  • [3] RNA Pol II transcription model and interpretation of GRO-seq data
    Manuel E. Lladser
    Joseph G. Azofeifa
    Mary A. Allen
    Robin D. Dowell
    Journal of Mathematical Biology, 2017, 74 : 77 - 97
  • [4] Protocol for affordable and efficient profiling of nascent RNAs in bread wheat using GRO-seq
    Chen, Yan
    Zhu, Jiafu
    Xie, Yilin
    Li, Zijuan
    Zhang, Yijing
    Liu, Min
    Dong, Zhicheng
    STAR PROTOCOLS, 2022, 3 (03):
  • [5] RNA Pol II transcription model and interpretation of GRO-seq data
    Lladser, Manuel E.
    Azofeifa, Joseph G.
    Allen, Mary A.
    Dowell, Robin D.
    JOURNAL OF MATHEMATICAL BIOLOGY, 2017, 74 (1-2) : 77 - 97
  • [6] Assessment of Gene Annotation Accuracy by Inferring Transcripts from RNA-Seq
    Martin, Jeffrey
    Zhu, Wenhan
    Bergman, Nicholas
    Borodovsky, Mark
    2009 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2009, : 54 - +
  • [7] Analysis of pro-inflammatory gene activation and RNA processing by RNA-Seq of nascent transcripts
    Bhatt, D.
    Pandya-Jones, A.
    Black, D.
    Smale, S.
    IMMUNOLOGY, 2011, 135 : 21 - 21
  • [8] EU-RNA-seq for in vivo labeling and high throughput sequencing of nascent transcripts
    Palozola, Katherine C.
    Donahue, Greg
    Zaret, Kenneth S.
    STAR PROTOCOLS, 2021, 2 (03):
  • [9] An Efficient Algorithm for Sensitively Detecting Circular RNA from RNA-seq Data
    Zhang, Xuanping
    Wang, Yidan
    Zhao, Zhongmeng
    Wang, Jiayin
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2018, 19 (10)
  • [10] SOAPfuse: an algorithm for identifying fusion transcripts from paired-end RNA-Seq data
    Jia, Wenlong
    Qiu, Kunlong
    He, Minghui
    Song, Pengfei
    Zhou, Quan
    Zhou, Feng
    Yu, Yuan
    Zhu, Dandan
    Nickerson, Michael L.
    Wan, Shengqing
    Liao, Xiangke
    Zhu, Xiaoqian
    Peng, Shaoliang
    Li, Yingrui
    Wang, Jun
    Guo, Guangwu
    GENOME BIOLOGY, 2013, 14 (02):