Evaluating the bias of circRNA predictions from total RNA-Seq data

被引:8
|
作者
Wang, Jinzeng [1 ,2 ]
Liu, Kang [1 ]
Liu, Ya [1 ]
Lv, Qi [1 ]
Zhang, Fan [1 ,3 ]
Wang, Haiyun [1 ]
机构
[1] Tongji Univ, Sch Life Sci & Technol, Shanghai 200092, Peoples R China
[2] Shanghai Jiao Tong Univ, Sch Med, Rui Jin Hosp, Natl Res Ctr Translat Med Shanghai, Shanghai 200025, Peoples R China
[3] Tongji Univ, Sch Med, Shanghai Pulm Hosp, Clin Translat Res Ctr, Shanghai 200433, Peoples R China
基金
中国国家自然科学基金;
关键词
circular RNA; circRNA predictions; total RNA-Seq; CIRI; KNIFE; CIRCULAR RNA; REVEALS;
D O I
10.18632/oncotarget.22972
中图分类号
R73 [肿瘤学];
学科分类号
100214 ;
摘要
CircRNAs are a group of endogenous noncoding RNAs. The quickly developing high throughput RNA sequencing technologies along with novel bioinformatics approaches have enabled researchers to systematically identify circRNAs and their biological functions in cells. Deep sequencing of rRNA-depleted RNAs treated with RNase R, which digests linear RNAs and leaves circRNAs enriched, is an efficient way to identify circRNAs. However, very few of RNase R treated data are at hand but a large amount of total RNA-Seq data with no sequencing costs is available, for circRNA predictions. In this study, we systematically investigated the prediction bias from total RNA-Seq data as well as the influence of sequencing depth, sequencing quality and single-end or paired-end sequencing strategy on the predictions. We also identified circRNA properties that may contribute to the improved prediction performance. Our analysis shows that circRNA predictions from total RNA-Seq data gain similar to 50% true positive. Sequencing error dramatically worsens the predictions, rather than single-end sequencing strategy or low sequencing depth. However, false positive can be carefully controlled by using data with good quality and narrowing down circRNAs guided by their properties.
引用
收藏
页码:110914 / 110921
页数:8
相关论文
共 50 条
  • [1] Acfs: accurate circRNA identification and quantification from RNA-Seq data
    You, Xintian
    Conrad, Tim O. F.
    SCIENTIFIC REPORTS, 2016, 6
  • [2] Acfs: accurate circRNA identification and quantification from RNA-Seq data
    Xintian You
    Tim OF Conrad
    Scientific Reports, 6
  • [3] Bias and Correction in RNA-seq Data for Marine Species
    Kai Song
    Li Li
    Guofan Zhang
    Marine Biotechnology, 2017, 19 : 541 - 550
  • [4] Bias and Correction in RNA-seq Data for Marine Species
    Song, Kai
    Li, Li
    Zhang, Guofan
    MARINE BIOTECHNOLOGY, 2017, 19 (05) : 541 - 550
  • [5] A transposable approach to RNA-seq from total RNA
    Scott Kuersten
    Nature Methods, 2013, 10 (9) : 916 - 916
  • [6] A transposable approach to RNA-seq from total RNA
    Scott Kuersten
    Nature Methods, 2012, 9 (6) : i - ii
  • [7] Identification of CNAs from RNA-Seq data
    Iwamoto, Eisuke
    Sanada, Masashi
    Yasuda, Takahiko
    CANCER SCIENCE, 2022, 113 : 1446 - 1446
  • [8] Length bias correction for RNA-seq data in gene set analyses
    Gao, Liyan
    Fang, Zhide
    Zhang, Kui
    Zhi, Degui
    Cui, Xiangqin
    BIOINFORMATICS, 2011, 27 (05) : 662 - 669
  • [9] Transcript length bias in RNA-seq data confounds systems biology
    Alicia Oshlack
    Matthew J Wakefield
    Biology Direct, 4
  • [10] Transcript length bias in RNA-seq data confounds systems biology
    Oshlack, Alicia
    Wakefield, Matthew J.
    BIOLOGY DIRECT, 2009, 4