Soft truncation thresholding for gene set analysis of RNA-seq data: Application to a vaccine study

被引:14
作者
Fridley, Brooke L. [1 ]
Jenkins, Gregory D. [2 ]
Grill, Diane E. [2 ]
Kennedy, Richard B. [3 ,4 ,5 ]
Poland, Gregory A. [3 ,4 ,5 ]
Oberg, Ann L. [2 ,4 ]
机构
[1] Univ Kansas, Med Ctr, Dept Biostat, Kansas City, KS 66160 USA
[2] Mayo Clin, Coll Med, Dept Hlth Sci Res, Div Biomed Stat & Informat, Rochester, MN 55905 USA
[3] Mayo Clin, Coll Med, Dept Mol Med, Rochester, MN 55905 USA
[4] Mayo Clin, Mayo Vaccine Res Grp, Rochester, MN 55905 USA
[5] Mayo Clin, Program Translat Immunovirol & Biodef, Rochester, MN 55905 USA
关键词
IMMUNE-RESPONSES; MICROARRAY DATA; EXPRESSION;
D O I
10.1038/srep02898
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Gene set analysis (GSA) has been used for analysis of microarray data to aid the interpretation and to increase statistical power. With the advent of next-generation sequencing, the use of GSA is even more relevant, as studies are often conducted on a small number of samples. We propose the use of soft truncation thresholding and the Gamma Method (GM) to determine significant gene set (GS), where a generalized linear model is used to assess per-gene significance. The approach was compared to other methods using an extensive simulation study and RNA-seq data from smallpox vaccine study. The GM was found to outperform other proposed methods. Application of the GM to the smallpox vaccine study found the GSs to be moderately associated with response, including focal adhesion (p = 0.04) and extracellular matrix receptor interaction (p = 0.05). The application of GSA to RNA-seq data will provide new insights into the genomic basis of complex traits.
引用
收藏
页数:6
相关论文
共 28 条
[1]   Microarray data analysis: from disarray to consolidation and consensus [J].
Allison, DB ;
Cui, XQ ;
Page, GP ;
Sabripour, M .
NATURE REVIEWS GENETICS, 2006, 7 (01) :55-65
[2]   A map of human genome variation from population-scale sequencing [J].
Altshuler, David ;
Durbin, Richard M. ;
Abecasis, Goncalo R. ;
Bentley, David R. ;
Chakravarti, Aravinda ;
Clark, Andrew G. ;
Collins, Francis S. ;
De la Vega, Francisco M. ;
Donnelly, Peter ;
Egholm, Michael ;
Flicek, Paul ;
Gabriel, Stacey B. ;
Gibbs, Richard A. ;
Knoppers, Bartha M. ;
Lander, Eric S. ;
Lehrach, Hans ;
Mardis, Elaine R. ;
McVean, Gil A. ;
Nickerson, DebbieA. ;
Peltonen, Leena ;
Schafer, Alan J. ;
Sherry, Stephen T. ;
Wang, Jun ;
Wilson, Richard K. ;
Gibbs, Richard A. ;
Deiros, David ;
Metzker, Mike ;
Muzny, Donna ;
Reid, Jeff ;
Wheeler, David ;
Wang, Jun ;
Li, Jingxiang ;
Jian, Min ;
Li, Guoqing ;
Li, Ruiqiang ;
Liang, Huiqing ;
Tian, Geng ;
Wang, Bo ;
Wang, Jian ;
Wang, Wei ;
Yang, Huanming ;
Zhang, Xiuqing ;
Zheng, Huisong ;
Lander, Eric S. ;
Altshuler, David L. ;
Ambrogio, Lauren ;
Bloom, Toby ;
Cibulskis, Kristian ;
Fennell, Tim J. ;
Gabriel, Stacey B. .
NATURE, 2010, 467 (7319) :1061-1073
[3]  
[Anonymous], GENES IMMUN IN PRESS
[4]   Genome-wide gene-set analysis for identification of pathways associated with alcohol dependence [J].
Biernacka, Joanna M. ;
Geske, Jennifer ;
Jenkins, Gregory D. ;
Colby, Colin ;
Rider, David N. ;
Karpyak, Victor M. ;
Choi, Doo-Sup ;
Fridley, Brooke L. .
INTERNATIONAL JOURNAL OF NEUROPSYCHOPHARMACOLOGY, 2013, 16 (02) :271-278
[5]   Use of the gamma method for self-contained gene-set analysis of SNP data [J].
Biernacka, Joanna M. ;
Jenkins, Gregory D. ;
Wang, Liewei ;
Moyer, Ann M. ;
Fridley, Brooke L. .
EUROPEAN JOURNAL OF HUMAN GENETICS, 2012, 20 (05) :565-571
[6]   Evaluation of statistical methods for normalization and differential expression in mRNA-Seq experiments [J].
Bullard, James H. ;
Purdom, Elizabeth ;
Hansen, Kasper D. ;
Dudoit, Sandrine .
BMC BIOINFORMATICS, 2010, 11
[7]   Insights into Colon Cancer Etiology via a Regularized Approach to Gene Set Analysis of GWAS Data [J].
Chen, Lin S. ;
Hutter, Carolyn M. ;
Potter, John D. ;
Liu, Yan ;
Prentice, Ross L. ;
Peters, Ulrike ;
Hsu, Li .
AMERICAN JOURNAL OF HUMAN GENETICS, 2010, 86 (06) :860-871
[8]   Improving gene set analysis of microarray data by SAM-GS [J].
Dinu, Irina ;
Potter, John D. ;
Mueller, Thomas ;
Liu, Qi ;
Adewale, Adeniyi J. ;
Jhangri, Gian S. ;
Einecke, Gunilla ;
Famulski, Konrad S. ;
Halloran, Philip ;
Yasui, Yutaka .
BMC BIOINFORMATICS, 2007, 8 (1)
[9]   ON TESTING THE SIGNIFICANCE OF SETS OF GENES [J].
Efron, Bradley ;
Tibshirani, Robert .
ANNALS OF APPLIED STATISTICS, 2007, 1 (01) :107-129
[10]   Self-Contained Gene-Set Analysis of Expression Data: An Evaluation of Existing and Novel Methods [J].
Fridley, Brooke L. ;
Jenkins, Gregory D. ;
Biernacka, Joanna M. .
PLOS ONE, 2010, 5 (09) :1-9