TPMCalculator: one-step software to quantify mRNA abundance of genomic features

被引:170
作者
Alvarez, Roberto Vera [1 ]
Pongor, Lorinc Sandor [1 ,2 ]
Marino-Ramirez, Leonardo [1 ]
Landsman, David [1 ]
机构
[1] Natl Ctr Biotechnol Informat, Computat Biol Branch, Natl Lib Med, NIH, Bethesda, MD 20892 USA
[2] Semmelweis Univ, Dept Pediat 2, H-1094 Budapest, Hungary
基金
美国国家卫生研究院;
关键词
QUANTIFICATION;
D O I
10.1093/bioinformatics/bty896
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
A Summary: The quantification of RNA sequencing (RNA-seq) abundance using a normalization method that calculates transcripts per million (TPM) is a key step to compare multiple samples from different experiments. TPMCalculator is a one-step software to process RNA-seq alignments in BAM format and reports TPM values, raw read counts and feature lengths for genes, transcripts, exons and introns. The program describes the genomic features through a model generated from the gene transfer format file used during alignments reporting of the TPM values and the raw read counts for each feature. In this paper, we show the correlation for 1256 samples from the TCGA-BRCA project between TPM and FPKM reported by TPMCalculator and RSeQC. We also show the correlation for raw read counts reported by TPMCalculator, HTSeq and featureCounts.
引用
收藏
页码:1960 / 1962
页数:3
相关论文
共 9 条
[1]   HTSeq-a Python']Python framework to work with high-throughput sequencing data [J].
Anders, Simon ;
Pyl, Paul Theodor ;
Huber, Wolfgang .
BIOINFORMATICS, 2015, 31 (02) :166-169
[2]   Comprehensive molecular portraits of human breast tumours [J].
Koboldt, Daniel C. ;
Fulton, Robert S. ;
McLellan, Michael D. ;
Schmidt, Heather ;
Kalicki-Veizer, Joelle ;
McMichael, Joshua F. ;
Fulton, Lucinda L. ;
Dooling, David J. ;
Ding, Li ;
Mardis, Elaine R. ;
Wilson, Richard K. ;
Ally, Adrian ;
Balasundaram, Miruna ;
Butterfield, Yaron S. N. ;
Carlsen, Rebecca ;
Carter, Candace ;
Chu, Andy ;
Chuah, Eric ;
Chun, Hye-Jung E. ;
Coope, Robin J. N. ;
Dhalla, Noreen ;
Guin, Ranabir ;
Hirst, Carrie ;
Hirst, Martin ;
Holt, Robert A. ;
Lee, Darlene ;
Li, Haiyan I. ;
Mayo, Michael ;
Moore, Richard A. ;
Mungall, Andrew J. ;
Pleasance, Erin ;
Robertson, A. Gordon ;
Schein, Jacqueline E. ;
Shafiei, Arash ;
Sipahimalani, Payal ;
Slobodan, Jared R. ;
Stoll, Dominik ;
Tam, Angela ;
Thiessen, Nina ;
Varhol, Richard J. ;
Wye, Natasja ;
Zeng, Thomas ;
Zhao, Yongjun ;
Birol, Inanc ;
Jones, Steven J. M. ;
Marra, Marco A. ;
Cherniack, Andrew D. ;
Saksena, Gordon ;
Onofrio, Robert C. ;
Pho, Nam H. .
NATURE, 2012, 490 (7418) :61-70
[3]   featureCounts: an efficient general purpose program for assigning sequence reads to genomic features [J].
Liao, Yang ;
Smyth, Gordon K. ;
Shi, Wei .
BIOINFORMATICS, 2014, 30 (07) :923-930
[4]   Mapping and quantifying mammalian transcriptomes by RNA-Seq [J].
Mortazavi, Ali ;
Williams, Brian A. ;
McCue, Kenneth ;
Schaeffer, Lorian ;
Wold, Barbara .
NATURE METHODS, 2008, 5 (07) :621-628
[5]   Salmon provides fast and bias-aware quantification of transcript expression [J].
Patro, Rob ;
Duggal, Geet ;
Love, Michael I. ;
Irizarry, Rafael A. ;
Kingsford, Carl .
NATURE METHODS, 2017, 14 (04) :417-+
[6]  
Sayers EW, 2019, NUCLEIC ACIDS RES, V47, pD23, DOI [10.1093/nar/gky1069, 10.1093/nar/gkr1184, 10.1093/nar/gks1189]
[7]   Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation [J].
Trapnell, Cole ;
Williams, Brian A. ;
Pertea, Geo ;
Mortazavi, Ali ;
Kwan, Gordon ;
van Baren, Marijke J. ;
Salzberg, Steven L. ;
Wold, Barbara J. ;
Pachter, Lior .
NATURE BIOTECHNOLOGY, 2010, 28 (05) :511-U174
[8]   Measurement of mRNA abundance using RNA-seq data: RPKM measure is inconsistent among samples [J].
Wagner, Gunter P. ;
Kin, Koryu ;
Lynch, Vincent J. .
THEORY IN BIOSCIENCES, 2012, 131 (04) :281-285
[9]   RSeQC: quality control of RNA-seq experiments [J].
Wang, Liguo ;
Wang, Shengqin ;
Li, Wei .
BIOINFORMATICS, 2012, 28 (16) :2184-2185