Sparse non-negative generalized PCA with applications to metabolomics

被引:35
作者
Allen, Genevera I. [1 ,2 ]
Maletic-Savatic, Mirjana [1 ]
机构
[1] Texas Childrens Hosp, Jan & Dan Duncan Neurol Res Inst, Baylor Coll Med, Dept Pediat, Houston, TX 77030 USA
[2] Rice Univ, Dept Stat, Houston, TX 77005 USA
关键词
PRINCIPAL COMPONENT ANALYSIS; NEURAL PROGENITOR CELLS; MATRIX FACTORIZATION; H-1-NMR; LASSO; IDENTIFICATION; DECOMPOSITION; TECHNOLOGIES; TOXICOLOGY; SELECTION;
D O I
10.1093/bioinformatics/btr522
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Nuclear magnetic resonance (NMR) spectroscopy has been used to study mixtures of metabolites in biological samples. This technology produces a spectrum for each sample depicting the chemical shifts at which an unknown number of latent metabolites resonate. The interpretation of this data with common multivariate exploratory methods such as principal components analysis (PCA) is limited due to high-dimensionality, non-negativity of the underlying spectra and dependencies at adjacent chemical shifts. Results: We develop a novel modification of PCA that is appropriate for analysis of NMR data, entitled Sparse Non-Negative Generalized PCA. This method yields interpretable principal components and loading vectors that select important features and directly account for both the non-negativity of the underlying spectra and dependencies at adjacent chemical shifts. Through the reanalysis of experimental NMR data on five purified neural cell types, we demonstrate the utility of our methods for dimension reduction, pattern recognition, sample exploration and feature selection. Our methods lead to the identification of novel metabolites that reflect the differences between these cell types.
引用
收藏
页码:3029 / 3035
页数:7
相关论文
共 33 条
[1]  
Allen G., 2011, TR201103 RIC U
[2]   NMR-based metabonomic approaches for evaluating physiological influences on biofluid composition [J].
Bollard, ME ;
Stanley, EG ;
Lindon, JC ;
Nicholson, JK ;
Holmes, E .
NMR IN BIOMEDICINE, 2005, 18 (03) :143-162
[3]   NMR-based metabolic profiling and metabonomic approaches to problems in molecular toxicology [J].
Coen, Muireann ;
Holmes, Elaine ;
Lindon, John C. ;
Nicholson, Jeremy K. .
CHEMICAL RESEARCH IN TOXICOLOGY, 2008, 21 (01) :9-27
[4]   Curve-fitting method for direct quantitation of compounds in complex biological mixtures using 1H NMR:: Application in metabonomic toxicology studies [J].
Crockford, DJ ;
Keun, HC ;
Smith, LM ;
Holmes, E ;
Nicholson, JK .
ANALYTICAL CHEMISTRY, 2005, 77 (14) :4556-4562
[5]  
de Graaf R.A., 2007, IN VIVO NMR SPECTROS
[6]   Measuring the metabolome: current analytical technologies [J].
Dunn, WB ;
Bailey, NJC ;
Johnson, HE .
ANALYST, 2005, 130 (05) :606-625
[7]   Bioinformatic methods in NMR-based metabolic profiling [J].
Ebbels, Timothy M. D. ;
Cavill, Rachel .
PROGRESS IN NUCLEAR MAGNETIC RESONANCE SPECTROSCOPY, 2009, 55 (04) :361-374
[8]   Regularization Paths for Generalized Linear Models via Coordinate Descent [J].
Friedman, Jerome ;
Hastie, Trevor ;
Tibshirani, Rob .
JOURNAL OF STATISTICAL SOFTWARE, 2010, 33 (01) :1-22
[9]   Metabolomics by numbers: acquiring and understanding global metabolite data [J].
Goodacre, R ;
Vaidyanathan, S ;
Dunn, WB ;
Harrigan, GG ;
Kell, DB .
TRENDS IN BIOTECHNOLOGY, 2004, 22 (05) :245-252
[10]   Metabolomics: Current technologies and future trends [J].
Hollywood, Katherine ;
Brison, Daniel R. ;
Goodacre, Royston .
PROTEOMICS, 2006, 6 (17) :4716-4723