CADBURE: A generic tool to evaluate the performance of spliced aligners on RNA-Seq data

被引:7
|
作者
Kumar, Praveen Kumar Raj [1 ]
Hoang, Thanh V. [1 ]
Robinson, Michael L. [1 ]
Tsonis, Panagiotis A. [2 ,3 ]
Liang, Chun [1 ,4 ]
机构
[1] Miami Univ, Dept Biol, Oxford, OH 45056 USA
[2] Univ Dayton, Dept Biol, Dayton, OH 45469 USA
[3] Univ Dayton, Ctr Tissue Regenerat & Engn, Dayton, OH 45469 USA
[4] Miami Univ, Dept Comp Sci & Software Engn, Oxford, OH 45056 USA
来源
SCIENTIFIC REPORTS | 2015年 / 5卷
关键词
TRANSCRIPTOME ANALYSIS; EXPRESSION; ALIGNMENT; READS;
D O I
10.1038/srep13443
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The fundamental task in RNA-Seq-based transcriptome analysis is alignment of millions of short reads to the reference genome or transcriptome. Choosing the right tool for the dataset in hand from many existent RNA-Seq alignment packages remains a critical challenge for downstream analysis. To facilitate this choice, we designed a novel tool for comparing alignment results of user data based on the relative reliability of uniquely aligned reads (CADBURE). CADBURE can easily evaluate different aligners, or different parameter sets using the same aligner, and selects the best alignment result for any RNA-Seq dataset. Strengths of CADBURE include the ability to compare alignment results without the need for synthetic data such as simulated genomes, alignment regeneration and randomly subsampled datasets. The benefit of a CADBURE selected alignment result was supported by differentially expressed gene (DEG) analysis. We demonstrated that the use of CADBURE to select the best alignment from a number of different alignment results could change the number of DEGs by as much as 10%. In particular, the CADBURE selected alignment result favors fewer false positives in the DEG analysis. We also verified differential expression of eighteen genes with RT-qPCR validation experiments. CADBURE is an open source tool (http://cadbure.sourceforge.net/).
引用
收藏
页数:10
相关论文
共 50 条
  • [31] NSMAP: A method for spliced isoforms identification and quantification from RNA-Seq
    Xia, Zheng
    Wen, Jianguo
    Chang, Chung-Che
    Zhou, Xiaobo
    BMC BIOINFORMATICS, 2011, 12
  • [32] Transcript quantification with RNA-Seq data
    Regina Bohnert
    Jonas Behr
    Gunnar Rätsch
    BMC Bioinformatics, 10
  • [33] RNA-Seq Data: A Complexity Journey
    Capobianco, Enrico
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2014, 11 (19): : 123 - 130
  • [34] A global assessment of RNA-seq performance
    Darren J. Burgess
    Nature Reviews Genetics, 2014, 15 (10) : 645 - 645
  • [35] An Efficient and Flexible Method for Deconvoluting Bulk RNA-Seq Data with Single-Cell RNA-Seq Data
    Sun, Xifang
    Sun, Shiquan
    Yang, Sheng
    CELLS, 2019, 8 (10)
  • [36] Crafted experiments to evaluate feature selection methods for single cell RNA-seq data
    Liu, Siyao
    Corcoran, David
    Garcia-Recio, Susana
    Perou, Charles
    Marron, J. S.
    CANCER RESEARCH, 2024, 84 (07)
  • [37] Deep annotation of long noncoding RNAs by assembling RNA-seq and small RNA-seq data
    Zhang, Jiaming
    Hou, Weibo
    Zhao, Qi
    Xiao, Songling
    Linghu, Hongye
    Zhang, Lixin
    Du, Jiawei
    Cui, Hongdi
    Yang, Xu
    Ling, Shukuan
    Su, Jianzhong
    Kong, Qingran
    JOURNAL OF BIOLOGICAL CHEMISTRY, 2023, 299 (09)
  • [38] BEAVR: a browser-based tool for the exploration and visualization of RNA-seq data
    Perampalam, Pirunthan
    Dick, Frederick A.
    BMC BIOINFORMATICS, 2020, 21 (01)
  • [39] DVGfinder: A Metasearch Tool for Identifying Defective Viral Genomes in RNA-Seq Data
    Olmo-Uceda, Maria J.
    Munoz-Sanchez, Juan C.
    Lasso-Giraldo, Wilberth
    Arnau, Vicente
    Diaz-Villanueva, Wladimiro
    Elena, Santiago F.
    VIRUSES-BASEL, 2022, 14 (05):
  • [40] SCRABBLE: single-cell RNA-seq imputation constrained by bulk RNA-seq data
    Peng, Tao
    Zhu, Qin
    Yin, Penghang
    Tan, Kai
    GENOME BIOLOGY, 2019, 20 (1)