LIQA: long-read isoform quantification and analysis

被引:41
作者
Hu, Yu [1 ]
Fang, Li [1 ]
Chen, Xuelian [2 ]
Zhong, Jiang F. [2 ]
Li, Mingyao [3 ]
Wang, Kai [1 ,4 ]
机构
[1] Childrens Hosp Philadelphia, Raymond G Perelman Ctr Cellular & Mol Therapeut, Philadelphia, PA 19104 USA
[2] Univ Southern Calif, Keck Sch Med, Dept Otolaryngol, Los Angeles, CA 90033 USA
[3] Univ Penn, Perelman Sch Med, Dept Biostat Epidemiol & Informat, Philadelphia, PA 19104 USA
[4] Univ Penn, Dept Pathol & Lab Med, Perelman Sch Med, Philadelphia, PA 19104 USA
关键词
RNA-SEQ; MESSENGER-RNA; EXPRESSION; TRANSCRIPTOME; COMPLEXITY; RECEPTORS;
D O I
10.1186/s13059-021-02399-8
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Long-read RNA sequencing (RNA-seq) technologies can sequence full-length transcripts, facilitating the exploration of isoform-specific gene expression over short-read RNA-seq. We present LIQA to quantify isoform expression and detect differential alternative splicing (DAS) events using long-read direct mRNA sequencing or cDNA sequencing data. LIQA incorporates base pair quality score and isoform-specific read length information in a survival model to assign different weights across reads, and uses an expectation-maximization algorithm for parameter estimation. We apply LIQA to long-read RNA-seq data from the Universal Human Reference, acute myeloid leukemia, and esophageal squamous epithelial cells and demonstrate its high accuracy in profiling alternative splicing events.
引用
收藏
页数:21
相关论文
共 59 条
[21]  
Kent WJ, 2002, GENOME RES, V12, P656, DOI [10.1101/gr.229202. Article published online before March 2002, 10.1101/gr.229202]
[22]   Alternative splicing isoforms in health and disease [J].
Kim, Hyoung Kyu ;
Pham, Michael Huy Cuong ;
Ko, Kyung Soo ;
Rhee, Byoung Doo ;
Han, Jin .
PFLUGERS ARCHIV-EUROPEAN JOURNAL OF PHYSIOLOGY, 2018, 470 (07) :995-1016
[23]   RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome [J].
Li, Bo ;
Dewey, Colin N. .
BMC BIOINFORMATICS, 2011, 12
[24]   Minimap2: pairwise alignment for nucleotide sequences [J].
Li, Heng .
BIOINFORMATICS, 2018, 34 (18) :3094-3100
[25]   Transcriptome assembly and isoform expression level estimation from biased RNA-Seq reads [J].
Li, Wei ;
Jiang, Tao .
BIOINFORMATICS, 2012, 28 (22) :2914-2921
[26]   RNA splicing is a primary link between genetic variation and disease [J].
Li, Yang I. ;
van de Geijn, Bryce ;
Raj, Anil ;
Knowles, David A. ;
Petti, Allegra A. ;
Golan, David ;
Gilad, Yoav ;
Pritchard, Jonathan K. .
SCIENCE, 2016, 352 (6285) :600-604
[27]  
Maric J, 2019, GRAPHMAP2 SPLICE AWA, DOI [10.1101/720458, DOI 10.1101/720458]
[28]   RNA Splicing and Disease: Animal Models to Therapies [J].
Montes, Matias ;
Sanford, Brianne L. ;
Comiskey, Daniel F. ;
Chandler, Dawn S. .
TRENDS IN GENETICS, 2019, 35 (01) :68-87
[29]   TIGAR2: sensitive and accurate estimation of transcript isoform expression with longer RNA-Seq reads [J].
Nariai, Naoki ;
Kojima, Kaname ;
Mimori, Takahiro ;
Sato, Yukuto ;
Kawai, Yosuke ;
Yamaguchi-Kabata, Yumi ;
Nagasaki, Masao .
BMC GENOMICS, 2014, 15
[30]   Estimation of alternative splicing isoform frequencies from RNA-Seq data [J].
Nicolae, Marius ;
Mangul, Serghei ;
Mandoiu, Ion I. ;
Zelikovsky, Alex .
ALGORITHMS FOR MOLECULAR BIOLOGY, 2011, 6