Data Normalization of 1H NMR Metabolite Fingerprinting Data Sets in the Presence of Unbalanced Metabolite Regulation

被引:28
作者
Hochrein, Jochen [1 ]
Zacharias, Helena U. [1 ]
Taruttis, Franziska [1 ]
Samol, Claudia [1 ]
Engelmann, Julia C. [1 ]
Spang, Rainer [1 ]
Oefner, Peter J. [1 ]
Gronwald, Wolfram [1 ]
机构
[1] Univ Regensburg, Inst Funct Genom, D-93053 Regensburg, Germany
关键词
NMR; data normalization; metabolomics; unbalanced regulation; confounding factors; STATISTICAL-METHODS; METABOLOMICS DATA; NMR; URINE;
D O I
10.1021/acs.jproteome.5b00192
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Data normalization is an essential step in NMR-based metabolomics. Conducted properly, it improves data quality and removes unwanted biases. The choice of the appropriate normalization method is critical and depends on the inherent properties of the data set in question. In particular, the presence of unbalanced metabolic regulation, where the different specimens and cohorts under investigation do not contain approximately equal shares of up- and down-regulated features, may strongly influence data normalization. Here, we demonstrate the suitability of the Shapiro-Wilk test to detect such unbalanced regulation. Next, employing a Latin-square design consisting of eight metabolites spiked into a urine specimen at eight different known concentrations, we show that commonly used normalization and scaling methods fail to retrieve true metabolite concentrations in the presence of increasing amounts of glucose added to simulate unbalanced regulation. However, by learning the normalization parameters on a subset of nonregulated features only, Linear Baseline Normalization, Probabilistic Quotient Normalization, and Variance Stabilization Normalization were found to account well for different dilutions of the samples without distorting the true spike-in levels even in the presence of marked unbalanced metabolic regulation. Finally, the methods described were applied successfully to a real world example of unbalanced regulation, namely, a set of plasma specimens collected from patients with and without acute kidney injury after cardiac surgery with cardiopulmonary bypass use.
引用
收藏
页码:3217 / 3228
页数:12
相关论文
共 29 条
[1]   DISORDERS OF CALCIUM AND MAGNESIUM HOMEOSTASIS [J].
AGUS, ZS ;
WASSERSTEIN, A ;
GOLDFARB, S .
AMERICAN JOURNAL OF MEDICINE, 1982, 72 (03) :473-488
[2]   MEASUREMENT IN MEDICINE - THE ANALYSIS OF METHOD COMPARISON STUDIES [J].
ALTMAN, DG ;
BLAND, JM .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES D-THE STATISTICIAN, 1983, 32 (03) :307-317
[3]  
[Anonymous], 2003, User's Guide to Principal Components
[4]   Contrast normalization of oligonucleotide arrays [J].
Åstrand, M .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2003, 10 (01) :95-102
[5]   Acute renal failure - definition, outcome measures, animal models, fluid therapy and information technology needs: the Second International Consensus Conference of the Acute Dialysis Quality Initiative (ADQI) Group [J].
Bellomo, R ;
Ronco, C ;
Kellum, JA ;
Mehta, RL ;
Palevsky, P .
CRITICAL CARE, 2004, 8 (04) :R204-R212
[6]   A comparison of normalization methods for high density oligonucleotide array data based on variance and bias [J].
Bolstad, BM ;
Irizarry, RA ;
Åstrand, M ;
Speed, TP .
BIOINFORMATICS, 2003, 19 (02) :185-193
[7]   LOCALLY WEIGHTED REGRESSION - AN APPROACH TO REGRESSION-ANALYSIS BY LOCAL FITTING [J].
CLEVELAND, WS ;
DEVLIN, SJ .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1988, 83 (403) :596-610
[8]   Statistical Methods for Handling Unwanted Variation in Metabolomics Data [J].
De Livera, Alysha M. ;
Sysi-Aho, Marko ;
Jacob, Laurent ;
Gagnon-Bartsch, Johann A. ;
Castillo, Sandra ;
Simpson, Julie A. ;
Speed, Terence P. .
ANALYTICAL CHEMISTRY, 2015, 87 (07) :3606-3615
[9]   Normalizing and Integrating Metabolomics Data [J].
De Livera, Alysha M. ;
Dias, Daniel A. ;
De Souza, David ;
Rupasinghe, Thusitha ;
Pyke, James ;
Tull, Dedreia ;
Roessner, Ute ;
McConville, Malcolm ;
Speed, Terence P. .
ANALYTICAL CHEMISTRY, 2012, 84 (24) :10768-10776
[10]  
Development Core Team R, 2009, R LANG ENV STAT COMP