Evaluating the bias of circRNA predictions from total RNA-Seq data

被引:8
作者
Wang, Jinzeng [1 ,2 ]
Liu, Kang [1 ]
Liu, Ya [1 ]
Lv, Qi [1 ]
Zhang, Fan [1 ,3 ]
Wang, Haiyun [1 ]
机构
[1] Tongji Univ, Sch Life Sci & Technol, Shanghai 200092, Peoples R China
[2] Shanghai Jiao Tong Univ, Sch Med, Rui Jin Hosp, Natl Res Ctr Translat Med Shanghai, Shanghai 200025, Peoples R China
[3] Tongji Univ, Sch Med, Shanghai Pulm Hosp, Clin Translat Res Ctr, Shanghai 200433, Peoples R China
基金
中国国家自然科学基金;
关键词
circular RNA; circRNA predictions; total RNA-Seq; CIRI; KNIFE; CIRCULAR RNA; REVEALS;
D O I
10.18632/oncotarget.22972
中图分类号
R73 [肿瘤学];
学科分类号
100214 ;
摘要
CircRNAs are a group of endogenous noncoding RNAs. The quickly developing high throughput RNA sequencing technologies along with novel bioinformatics approaches have enabled researchers to systematically identify circRNAs and their biological functions in cells. Deep sequencing of rRNA-depleted RNAs treated with RNase R, which digests linear RNAs and leaves circRNAs enriched, is an efficient way to identify circRNAs. However, very few of RNase R treated data are at hand but a large amount of total RNA-Seq data with no sequencing costs is available, for circRNA predictions. In this study, we systematically investigated the prediction bias from total RNA-Seq data as well as the influence of sequencing depth, sequencing quality and single-end or paired-end sequencing strategy on the predictions. We also identified circRNA properties that may contribute to the improved prediction performance. Our analysis shows that circRNA predictions from total RNA-Seq data gain similar to 50% true positive. Sequencing error dramatically worsens the predictions, rather than single-end sequencing strategy or low sequencing depth. However, false positive can be carefully controlled by using data with good quality and narrowing down circRNAs guided by their properties.
引用
收藏
页码:110914 / 110921
页数:8
相关论文
共 50 条
  • [1] Acfs: accurate circRNA identification and quantification from RNA-Seq data
    You, Xintian
    Conrad, Tim O. F.
    SCIENTIFIC REPORTS, 2016, 6
  • [2] CIRCexplorer pipelines for circRNA annotation and quantification from non-polyadenylated RNA-seq datasets
    Ma, Xu-Kai
    Xue, Wei
    Chen, Ling-Ling
    Yang, Li
    METHODS, 2021, 196 : 3 - 10
  • [3] Comprehensive comparison of two types of algorithm for circRNA detection from short-read RNA-Seq
    Liu, Hongfei
    Akhatayeva, Zhanerke
    Pan, Chuanying
    Liao, Mingzhi
    Lan, Xianyong
    BIOINFORMATICS, 2022, 38 (11) : 3037 - 3043
  • [4] Comprehensive Analysis of RNA-Seq in Endometriosis Reveals Competing Endogenous RNA Network Composed of circRNA, lncRNA and mRNA
    Yin, Meichen
    Zhai, Lingyun
    Wang, Jianzhang
    Yu, Qin
    Li, Tiantian
    Xu, Xinxin
    Guo, Xinyue
    Mao, Xinqi
    Zhou, Jianwei
    Zhang, Xinmei
    FRONTIERS IN GENETICS, 2022, 13
  • [5] Deep annotation of long noncoding RNAs by assembling RNA-seq and small RNA-seq data
    Zhang, Jiaming
    Hou, Weibo
    Zhao, Qi
    Xiao, Songling
    Linghu, Hongye
    Zhang, Lixin
    Du, Jiawei
    Cui, Hongdi
    Yang, Xu
    Ling, Shukuan
    Su, Jianzhong
    Kong, Qingran
    JOURNAL OF BIOLOGICAL CHEMISTRY, 2023, 299 (09)
  • [6] Improving RNA-Seq expression estimates by correcting for fragment bias
    Roberts, Adam
    Trapnell, Cole
    Donaghey, Julie
    Rinn, John L.
    Pachter, Lior
    GENOME BIOLOGY, 2011, 12 (03):
  • [7] Utilizing RNA-seq data in monotone iterative generalized linear model to elevate prior knowledge quality of the circRNA-miRNA-mRNA regulatory axisUtilizing RNA-seq Data in Monotone Iterative Generalized...A. Anuarbekov, J. Kléma
    Alikhan Anuarbekov
    Jiří Kléma
    BMC Bioinformatics, 26 (1)
  • [8] Accurate inference of isoforms from multiple sample RNA-Seq data
    Tasnim, Masruba
    Ma, Shining
    Yang, Ei-Wen
    Jiang, Tao
    Li, Wei
    BMC GENOMICS, 2015, 16
  • [9] Transcriptome assembly and quantification from Ion Torrent RNA-Seq data
    Mangul, Serghei
    Caciula, Adrian
    Al Seesi, Sahar
    Brinza, Dumitru
    Mondoiu, Ion
    Zelikovsky, Alex
    BMC GENOMICS, 2014, 15
  • [10] De novo assembly of bacterial transcriptomes from RNA-seq data
    Tjaden, Brian
    GENOME BIOLOGY, 2015, 16