MetaboLyzer: A Novel Statistical Workflow for Analyzing Postprocessed LC-MS Metabolomics Data

被引:85
作者
Mak, Tytus D. [1 ]
Laiakis, Evagelia C. [2 ]
Goudarzi, Maryam [2 ]
Fornace, Albert J., Jr. [1 ,2 ,3 ]
机构
[1] Georgetown Univ, Med Ctr, Lombardi Comprehens Canc Ctr, Washington, DC 20057 USA
[2] Georgetown Univ, Med Ctr, Washington, DC 20057 USA
[3] King Abdulaziz Univ, Ctr Excellence Genom Med Res CEGMR, Jeddah 22254, Saudi Arabia
关键词
KEGG;
D O I
10.1021/ac402477z
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Metabolomics, the global study of small molecules in a particular system, has in the past few years risen to become a primary -omics platform for the study of metabolic processes. With the ever-increasing pool of quantitative data yielded from metabolomic research, specialized methods and tools with which to analyze and extract meaningful conclusions from these data are becoming more and more crucial. Furthermore, the depth of knowledge and expertise-required to undertake a metabolomics oriented study is a daunting obstacle to investigators new to the field. As such, we have created a new statistical analysis workflow, MetaboLyzer, which aims to both simplify analysis for investigators new to metabolomics, as well as provide experienced investigators the flexibility to conduct sophisticated analysis. MetaboLyzer's workflow is specifically tailored to the unique characteristics and idiosyncrasies of postprocessed liquid chromatography-mass spectrometry (LC-MS)-based metabolomic data sets. It utilizes a wide gamut of statistical tests, procedures, and methodologies that belong to classical biostatistics, as well as several novel statistical techniques that we have developed specifically for metabolomics data. Furthermore, MetaboLyzer conducts rapid putative ion identification and putative biologically relevant analysis via incorporation of four major small molecule databases: KEGG, HMDB, Lipid Maps, and BioCyc. MetaboLyzer incorporates these aspects into a comprehensive workflow that outputs easy to understand statistically significant and potentially biologically relevant information in the form of heatmaps, volcano plots, 3D visualization plots, correlation maps, and metabolic pathway hit histograms. For demonstration purposes, a urine metabolomics data set from a previously reported radiobiology study in which samples were collected from mice exposed to gamma radiation was analyzed. MetaboLyzer was able to identify 243 statistically significant ions out of a total of 1942. Numerous putative metabolites and pathways were found to be biologically significant from the putative ion identification workflow.
引用
收藏
页码:506 / 513
页数:8
相关论文
共 21 条
[1]   Metabolomics: from small molecules to big ideas [J].
Baker, Monya .
NATURE METHODS, 2011, 8 (02) :117-121
[2]   High-throughput, nontargeted metabolite fingerprinting using nominal mass flow injection electrospray mass spectrometry [J].
Beckmann, Manfred ;
Parker, David ;
Enot, David P. ;
Duval, Emilie ;
Draper, John .
NATURE PROTOCOLS, 2008, 3 (03) :486-504
[3]  
Caspi R, 2008, NUCLEIC ACIDS RES, V36, pD623, DOI [10.1093/nar/gkm900, 10.1093/nar/gkt1103]
[4]  
Daviss B, 2005, SCIENTIST, V19, P25
[5]   Do multiple outcome measures require p-value adjustment? [J].
Feise R.J. .
BMC Medical Research Methodology, 2 (1) :1-4
[6]   Missing values in mass spectrometry based metabolomics: an undervalued step in the data processing pipeline [J].
Hrydziuszko, Olga ;
Viant, Mark R. .
METABOLOMICS, 2012, 8 (01) :S161-S174
[7]   Matplotlib: A 2D graphics environment [J].
Hunter, John D. .
COMPUTING IN SCIENCE & ENGINEERING, 2007, 9 (03) :90-95
[8]   KEGG: Kyoto Encyclopedia of Genes and Genomes [J].
Kanehisa, M ;
Goto, S .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :27-30
[9]   KEGG for integration and interpretation of large-scale molecular data sets [J].
Kanehisa, Minoru ;
Goto, Susumu ;
Sato, Yoko ;
Furumichi, Miho ;
Tanabe, Mao .
NUCLEIC ACIDS RESEARCH, 2012, 40 (D1) :D109-D114
[10]  
Karatzoglou A., 2004, J STAT SOFTW, V11, P1, DOI [10.18637/jss.v011.i09, DOI 10.18637/JSS.V011.I09]