共 36 条
CircAST: Full-length Assembly and Quantification of Alternatively Spliced Isoforms in Circular RNAs
被引:47
作者:
Wu, Jing
[1
,2
]
Li, Yan
[3
,4
]
Wang, Cheng
[3
]
Cui, Yiqiang
[3
]
Xu, Tianyi
[1
]
Wang, Chang
[3
]
Wang, Xiao
[1
]
Sha, Jiahao
[3
]
Jiang, Bin
[1
]
Wang, Kai
[5
]
Hu, Zhibin
[3
]
Guo, Xuejiang
[3
]
Song, Xiaofeng
[1
]
机构:
[1] Nanjing Univ Aeronaut & Astronaut, Dept Biomed Engn, Nanjing 211106, Peoples R China
[2] Nanjing Med Univ, Sch Biomed Engn & Informat, Nanjing 211166, Peoples R China
[3] Nanjing Med Univ, State Key Lab Reprod Med, Nanjing 211166, Peoples R China
[4] Nanjing Med Univ, Sir Run Run Hosp, Ctr Pathol & Clin Lab, Nanjing 211166, Peoples R China
[5] Childrens Hosp Philadelphia, Ctr Cellular & Mol Therapeut, Philadelphia, PA 19104 USA
基金:
国家重点研发计划;
中国国家自然科学基金;
关键词:
Circular RNA;
Full-length reconstruction;
Isoform quantification;
Multiple splice graph model;
Transcriptome;
TRANSLATION;
TRANSCRIPTS;
DATABASE;
REVEALS;
D O I:
10.1016/j.gpb.2019.03.004
中图分类号:
Q3 [遗传学];
学科分类号:
071007 ;
090102 ;
摘要:
Circular RNAs (circRNAs), covalently closed continuous RNA loops, are generated from cognate linear RNAs through back splicing events, and alternative splicing events may generate different circRNA isoforms at the same locus. However, the challenges of reconstruction and quantification of alternatively spliced full-length circRNAs remain unresolved. On the basis of the internal structural characteristics of circRNAs, we developed CircAST, a tool to assemble alternatively spliced circRNA transcripts and estimate their expression by using multiple splice graphs. Simulation studies showed that CircAST correctly assembled the full sequences of circRNAs with a sensitivity of 85.63%-94.32% and a precision of 81.96%-87.55%. By assigning reads to specific isoforms, CircAST quantified the expression of circRNA isoforms with correlation coefficients of 0.85-0.99 between theoretical and estimated values. We evaluated CircAST on an in-house mouse testis RNA-seq dataset with RNase R treatment for enriching circRNAs and identified 380 circRNAs with full-length sequences different from those of their corresponding cognate linear RNAs. RT-PCR and Sanger sequencing analyses validated 32 out of 37 randomly selected isoforms, thus further indicating the good performance of CircAST, especially for isoforms with low abundance. We also applied CircAST to published experimental data and observed substantial diversity in circular transcripts across samples, thus suggesting that circRNA expression is highly regulated.
引用
收藏
页码:522 / 534
页数:13
相关论文