IsoTree: De Novo Transcriptome Assembly from RNA-Seq Reads

被引:2
|
作者
Zhao, Jin [1 ]
Feng, Haodi [1 ]
Zhu, Daming [1 ]
Zhang, Chi [2 ]
Xu, Ying [3 ]
机构
[1] Shandong Univ, Sch Comp Sci & Technol, Shun Hua Rd, Jinan 250101, Shandong, Peoples R China
[2] Indiana Univ, Dept Med & Mol Genet, Bloomington, IN 47405 USA
[3] Univ Georgia, Dept Biochem & Mol Biol, Athens, GA 30602 USA
来源
BIOINFORMATICS RESEARCH AND APPLICATIONS (ISBRA 2017) | 2017年 / 10330卷
基金
中国国家自然科学基金;
关键词
ABUNDANCE ESTIMATION; ISOFORM DISCOVERY; EXPRESSION LEVELS; SPLICE JUNCTIONS; RECONSTRUCTION; REVEALS;
D O I
10.1007/978-3-319-59575-7_7
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
High-throughput sequencing of mRNA has made the deep and efficient probing of transcriptomes more affordable. However, the vast amounts of short RNA-seq reads make de novo transcriptome assembly an algorithmic challenge. In this work, we present IsoTree, a novel framework for transcripts reconstruction in the absence of reference genomes. Unlike most of de novo assembly methods that build de Bruijn graph or splicing graph by connecting k-mers which are sets of overlapping substrings generated from reads, IsoTree constructs splicing graph by connecting reads directly. For each splicing graph, IsoTree applies an iterative scheme of mixed integer linear program to build a prefix tree, called isoform tree. Each path from the root node of the isoform tree to a leaf node represents a plausible transcript candidate which will be pruned based on the information of pair-end reads. Experiments showed that IsoTree performs better in recall on both pair-end reads and singleend reads and in precision on pair-end reads compared to other leading transcript assembly programs including Cufflinks, StringTie and Bin-Packer.
引用
收藏
页码:71 / 83
页数:13
相关论文
共 50 条
  • [1] IsoTree: A New Framework for de novo Transcriptome Assembly from RNA-seq Reads
    Zhao, Jin
    Feng, Haodi
    Zhu, Daming
    Zhang, Chi
    Xu, Ying
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2020, 17 (03) : 938 - 948
  • [2] De novo transcriptome assembly of RNA-Seq reads with different strategies
    Chen Geng
    Yin KangPing
    Wang, Charles
    Shi TieLiu
    SCIENCE CHINA-LIFE SCIENCES, 2011, 54 (12) : 1129 - 1133
  • [3] SOAPdenovo-Trans: de novo transcriptome assembly with short RNA-Seq reads
    Xie, Yinlong
    Wu, Gengxiong
    Tang, Jingbo
    Luo, Ruibang
    Patterson, Jordan
    Liu, Shanlin
    Huang, Weihua
    He, Guangzhu
    Gu, Shengchang
    Li, Shengkang
    Zhou, Xin
    Lam, Tak-Wah
    Li, Yingrui
    Xu, Xun
    Wong, Gane Ka-Shu
    Wang, Jun
    BIOINFORMATICS, 2014, 30 (12) : 1660 - 1666
  • [4] Rnnotator: an automated de novo transcriptome assembly pipeline from stranded RNA-Seq reads
    Martin, Jeffrey
    Bruno, Vincent M.
    Fang, Zhide
    Meng, Xiandong
    Blow, Matthew
    Zhang, Tao
    Sherlock, Gavin
    Snyder, Michael
    Wang, Zhong
    BMC GENOMICS, 2010, 11
  • [5] BinPacker: Packing-Based De Novo Transcriptome Assembly from RNA-seq Data
    Liu, Juntao
    Li, Guojun
    Chang, Zheng
    Yu, Ting
    Liu, Bingqiang
    McMullen, Rick
    Chen, Pengyin
    Huang, Xiuzhen
    PLOS COMPUTATIONAL BIOLOGY, 2016, 12 (02)
  • [6] SSP: An interval integer linear programming for de novo transcriptome assembly and isoform discovery of RNA-seq reads
    Safikhani, Zhaleh
    Sadeghi, Mehdi
    Pezeshk, Hamid
    Eslahchi, Changiz
    GENOMICS, 2013, 102 (5-6) : 507 - 514
  • [7] Bridger: a new framework for de novo transcriptome assembly using RNA-seq data
    Chang, Zheng
    Li, Guojun
    Liu, Juntao
    Zhang, Yu
    Ashby, Cody
    Liu, Deli
    Cramer, Carole L.
    Huang, Xiuzhen
    GENOME BIOLOGY, 2015, 16
  • [8] Transcriptome assembly and isoform expression level estimation from biased RNA-Seq reads
    Li, Wei
    Jiang, Tao
    BIOINFORMATICS, 2012, 28 (22) : 2914 - 2921
  • [9] De novo assembly and analysis of RNA-seq data
    Robertson, Gordon
    Schein, Jacqueline
    Chiu, Readman
    Corbett, Richard
    Field, Matthew
    Jackman, Shaun D.
    Mungall, Karen
    Lee, Sam
    Okada, Hisanaga Mark
    Qian, Jenny Q.
    Griffith, Malachi
    Raymond, Anthony
    Thiessen, Nina
    Cezard, Timothee
    Butterfield, Yaron S.
    Newsome, Richard
    Chan, Simon K.
    She, Rong
    Varhol, Richard
    Kamoh, Baljit
    Prabhu, Anna-Liisa
    Tam, Angela
    Zhao, YongJun
    Moore, Richard A.
    Hirst, Martin
    Marra, Marco A.
    Jones, Steven J. M.
    Hoodless, Pamela A.
    Birol, Inanc
    NATURE METHODS, 2010, 7 (11) : 909 - U62
  • [10] De novo assembly of bacterial transcriptomes from RNA-seq data
    Tjaden, Brian
    GENOME BIOLOGY, 2015, 16