Accurate assembly of circular RNAs with TERRACE

被引:0
作者
Zahin, Tasfia [1 ]
Shi, Qian [1 ,2 ]
Zang, Xiaofei Carl
Shao, Mingfu [1 ,2 ]
机构
[1] Penn State Univ, Dept Comp Sci & Engn, University Pk, PA 16802 USA
[2] Penn State Univ, Huck Inst Life Sci, University Pk, PA 16802 USA
基金
美国国家科学基金会; 美国国家卫生研究院;
关键词
LANDSCAPE; ABUNDANT;
D O I
10.1101/gr.279106.124
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Circular RNA (circRNA) is a class of RNA molecules that forms a closed loop with their 5 ' and 3 ' ends covalently bonded. CircRNAs are known to be more stable than linear RNAs, have distinct properties and functions, and are promising biomarkers. Existing methods for assembling circRNAs heavily rely on the annotated transcriptomes, hence exhibiting unsatisfactory accuracy without a high-quality transcriptome. We present TERRACE, a new algorithm for full-length assembly of circRNAs from paired-end total RNA-seq data. TERRACE uses the splice graph as the underlying data structure that organizes the splicing and coverage information. We transform the problem of assembling circRNAs into finding paths that "bridge" the three fragments in the splice graph induced by back-spliced reads. We adopt a definition for optimal bridging paths and a dynamic programming algorithm to calculate such optimal paths. TERRACE features an efficient algorithm to detect back-spliced reads missed by RNA-seq aligners, contributing to its much-improved sensitivity. It also incorporates a new machine-learning approach trained to assign a confidence score to each assembled circRNA, which is shown to be superior to using abundance for scoring. On both simulations and biological data sets, TERRACE consistently outperforms existing methods by a large margin in sensitivity while achieving better or comparable precision. In particular, when the annotations are not provided, TERRACE assembles 123%-413% more correct circRNAs than state-of-the-art methods. TERRACE presents a significant advance in assembling full-length circRNAs from RNA-seq data, and we expect it to be widely used in future research on circRNAs.
引用
收藏
页码:1365 / 1370
页数:6
相关论文
共 37 条
  • [11] CircRNAFisher: a systematic computational approach for de novo circular RNA identification
    Jia, Guo-yi
    Wang, Duo-lin
    Xue, Meng-zhu
    Liu, Yu-wei
    Pei, Yu-chen
    Yang, Ying-qun
    Xu, Jing-mei
    Liang, Yan-chun
    Wang, Peng
    [J]. ACTA PHARMACOLOGICA SINICA, 2019, 40 (01) : 55 - 63
  • [12] Kim D, 2015, NAT METHODS, V12, P357, DOI [10.1038/NMETH.3317, 10.1038/nmeth.3317]
  • [13] The emerging roles of circRNAs in cancer and oncology
    Kristensen, Lasse S.
    Jakobsen, Theresa
    Hager, Henrik
    Kjems, Jorgen
    [J]. NATURE REVIEWS CLINICAL ONCOLOGY, 2022, 19 (03) : 188 - 206
  • [14] The Sequence Read Archive
    Leinonen, Rasko
    Sugawara, Hideaki
    Shumway, Martin
    [J]. NUCLEIC ACIDS RESEARCH, 2011, 39 : D19 - D21
  • [15] The Biogenesis, Functions, and Challenges of Circular RNAs
    Li, Xiang
    Yang, Li
    Chen, Ling-Ling
    [J]. MOLECULAR CELL, 2018, 71 (03) : 428 - 442
  • [16] CircMarker: a fast and accurate algorithm for circular RNA detection
    Li, Xin
    Chu, Chong
    Pei, Jingwen
    Mandoiu, Ion
    Wu, Yufeng
    [J]. BMC GENOMICS, 2018, 19
  • [17] CIRCexplorer3: A CLEAR Pipeline for Direct Comparison of Circular and Linear RNA Expression
    Ma, Xu-Kai
    Wang, Meng-Ran
    Liu, Chu-Xiao
    Dong, Rui
    Carmichael, Gordon G.
    Chen, Ling-Ling
    Yang, Li
    [J]. GENOMICS PROTEOMICS & BIOINFORMATICS, 2019, 17 (05) : 511 - 521
  • [18] Circular RNAs are a large class of animal RNAs with regulatory potency
    Memczak, Sebastian
    Jens, Marvin
    Elefsinioti, Antigoni
    Torti, Francesca
    Krueger, Janna
    Rybak, Agnieszka
    Maier, Luisa
    Mackowiak, Sebastian D.
    Gregersen, Lea H.
    Munschauer, Mathias
    Loewer, Alexander
    Ziebold, Ulrike
    Landthaler, Markus
    Kocks, Christine
    le Noble, Ferdinand
    Rajewsky, Nikolaus
    [J]. NATURE, 2013, 495 (7441) : 333 - 338
  • [19] Circall: fast and accurate methodology for discovery of circular RNAs from paired-end RNA-sequencing data
    Nguyen, Dat Thanh
    Trac, Quang Thinh
    Nguyen, Thi-Hau
    Nguyen, Ha-Nam
    Ohad, Nir
    Pawitan, Yudi
    Vu, Trung Nghia
    [J]. BMC BIOINFORMATICS, 2021, 22 (01)
  • [20] Circular RNAs in the Mammalian Brain Are Highly Abundant, Conserved, and Dynamically Expressed
    Rybak-Wolf, Agnieszka
    Stottmeister, Christin
    Glazar, Petar
    Jens, Marvin
    Pino, Natalia
    Giusti, Sebastian
    Hanan, Mor
    Behm, Mikaela
    Bartok, Osnat
    Ashwal-Fluss, Reut
    Herzog, Margareta
    Schreyer, Luisa
    Papavasileiou, Panagiotis
    Ivanov, Andranik
    Ohman, Marie
    Refojo, Damian
    Kadener, Sebastian
    Rajewsky, Nikolaus
    [J]. MOLECULAR CELL, 2015, 58 (05) : 870 - 885