Group aggregating normalization method for the preprocessing of NMR-based metabolomic data

被引:22
作者
Dong, Jiyang [1 ,2 ]
Cheng, Kian-Kai [2 ,3 ,4 ]
Xu, Jingjing [1 ]
Chen, Zhong [1 ]
Griffin, Julian L. [2 ]
机构
[1] Xiamen Univ, Dept Elect Sci, Xiamen 361005, Peoples R China
[2] Univ Cambridge, Dept Biochem, Cambridge CB2 1GA, England
[3] Univ Teknol Malaysia, Dept Bioproc Engn, Skudai 81310, Johor, Malaysia
[4] Univ Teknol Malaysia, Inst Bioprod Dev, Skudai 81310, Johor, Malaysia
关键词
Data normalization; NMR-based metabolomics; Principal component analysis; Dilution effect; Group aggregating normalization; PRINCIPAL COMPONENT ANALYSIS; TIME-DOMAIN ALGORITHM; 1D H-1-NMR DATA; SPECTRAL NORMALIZATION; METABONOMICS; SPECTROSCOPY; NOISE;
D O I
10.1016/j.chemolab.2011.06.002
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Data normalization plays a crucial role in metabolomics to take into account the inevitable variation in sample concentration and the efficiency of sample preparation procedure. The conventional methods such as constant sum normalization (CSN) and probabilistic quotient normalization (PQN) are widely used, but both methods have their own shortcomings. In the current study, a new data normalization method called group aggregating normalization (GAN) is proposed, by which the samples were normalized so that they aggregate close to their group centers in a principal component analysis (PCA) subspace. This is in contrast with CSN and PQN which rely on a constant reference for all samples. The evaluation of GAN method using both simulated and experimental metabolomic data demonstrated that GAN produces more robust model in the subsequent multivariate data analysis, more superior than both CSN and PQN methods. The current study also demonstrated that some of the differential metabolites identified using the CSN or PQN method could be false positives due to improper data normalization. (C) 2011 Elsevier B.V. All rights reserved.
引用
收藏
页码:123 / 132
页数:10
相关论文
共 25 条
  • [1] A solution to the 1D NMR alignment problem using an extended generalized fuzzy Hough transform and mode support
    Alm, Erik
    Torgrip, Ralf J. O.
    Aberg, K. Magnus
    Schuppe-Koistinen, Ina
    Lindberg, Johan
    [J]. ANALYTICAL AND BIOANALYTICAL CHEMISTRY, 2009, 395 (01) : 213 - 223
  • [2] [Anonymous], SYSTEMS INDIRECT OBS
  • [3] NMR-based metabonomic approaches for evaluating physiological influences on biofluid composition
    Bollard, ME
    Stanley, EG
    Lindon, JC
    Nicholson, JK
    Holmes, E
    [J]. NMR IN BIOMEDICINE, 2005, 18 (03) : 143 - 162
  • [4] Brindle JT, 2002, NAT MED, V8, P1439, DOI 10.1038/nm802
  • [5] CIBAGEIGY AG, 1977, WISSENSCHAFTLICHE TA, V1
  • [6] Scaling and normalization effects in NMR spectroscopic metabonomic data sets
    Craig, A
    Cloareo, O
    Holmes, E
    Nicholson, JK
    Lindon, JC
    [J]. ANALYTICAL CHEMISTRY, 2006, 78 (07) : 2262 - 2267
  • [7] Probabilistic quotient normalization as robust method to account for dilution of complex biological mixtures.: Application in 1H NMR metabonomics
    Dieterle, Frank
    Ross, Alfred
    Schlotterbeck, Gotz
    Senn, Hans
    [J]. ANALYTICAL CHEMISTRY, 2006, 78 (13) : 4281 - 4290
  • [8] Eriksson L., 2013, Multi-and megavariate data analysis basic principles and applications, V3rd ed.
  • [9] Correction of urine cotinine concentrations for creatinine excretion: Is it useful?
    Jatlow, P
    McKee, S
    O'Malley, SS
    [J]. CLINICAL CHEMISTRY, 2003, 49 (11) : 1932 - 1934
  • [10] Characterization of the measurement error structure in 1D 1H NMR data for metabolomics studies
    Karakach, Tobias K.
    Wentzell, Peter D.
    Walter, John A.
    [J]. ANALYTICA CHIMICA ACTA, 2009, 636 (02) : 163 - 174