Evaluation of Peak Picking Quality in LC-MS Metabolomics Data

被引:52
作者
Brodsky, Leonid [1 ,2 ]
Moussaieff, Arieh [1 ]
Shahaf, Nir [1 ]
Aharoni, Asaph [1 ]
Rogachev, Ilana [1 ]
机构
[1] Weizmann Inst Sci, Dept Plant Sci, IL-76100 Rehovot, Israel
[2] Univ Haifa, Inst Evolut, IL-31905 Haifa, Israel
关键词
MASS-SPECTROMETRY DATA; IDENTIFICATION; METABOLITES; PROFILE;
D O I
10.1021/ac101216e
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
The output of LC-MS metabolomics experiments consists of mass-peak intensities identified through a peak-picking/alignment procedure. Besides imperfections in biological samples and instrumentation, data accuracy is highly dependent on the applied algorithms and their parameters. Consequently, quality control (QC) is essential for further data analysis. Here, we present a QC approach that is based on discrepancies between replicate samples. First, the quantile normalization of per-sample log-signal distributions is applied to each group of biologically homogeneous samples. Next, the overall quality of each replicate group is characterized by the Z-transformed correlation coefficients between samples. This general QC allows a tuning of the procedure's parameters which minimizes the inter-replicate discrepancies in the generated output. Subsequently, an in-depth QC measure detects local neighborhoods on a template of aligned chromatograms that are enriched by divergences between intensity profiles of replicate samples. These neighborhoods are determined through a segmentation algorithm. The retention time (RT)-m/z positions of the neighborhoods with local divergences are indicative of either: incorrect alignment of chromatographic features, technical problems in the chromatograms, or to a true biological discrepancy between replicates for particular metabolites. We expect this method to aid in the accurate analysis of metabolomics data and in the development of new peak-picking/alignment procedures.
引用
收藏
页码:9177 / 9187
页数:11
相关论文
共 25 条
  • [1] Large-scale human metabolomics studies: A strategy for data (pre-) processing and validation
    Bijlsma, S
    Bobeldijk, L
    Verheij, ER
    Ramaker, R
    Kochhar, S
    Macdonald, IA
    van Ommen, B
    Smilde, AK
    [J]. ANALYTICAL CHEMISTRY, 2006, 78 (02) : 567 - 574
  • [2] A comparison of normalization methods for high density oligonucleotide array data based on variance and bias
    Bolstad, BM
    Irizarry, RA
    Åstrand, M
    Speed, TP
    [J]. BIOINFORMATICS, 2003, 19 (02) : 185 - 193
  • [3] BOLSTAD BM, 2001, UNPUB
  • [4] A binary search approach to whole-genome data analysis
    Brodsky, Leonid
    Kogan, Simon
    BenJacob, Eshel
    Nevo, Eviatar
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2010, 107 (39) : 16893 - 16898
  • [5] COMPUTER METHODS IN ANALYTICAL MASS SPECTROMETRY - IDENTIFICATION OF AN UNKNOWN COMPOUND IN A CATALOG
    CRAWFORD, LR
    MORRISON, JD
    [J]. ANALYTICAL CHEMISTRY, 1968, 40 (10) : 1464 - &
  • [6] Equating, or Correction for Between-Block Effects with Application to Body Fluid LC-MS and NMR Metabolomics Data Sets
    Draisma, Harmen H. M.
    Reijmers, Theo H.
    van der Kloet, Frans
    Bobeldijk-Pastorova, Ivana
    Spies-Faber, Elly
    Vogels, Jack T. W. E.
    Meulman, Jacqueline J.
    Boomsma, Dorret I.
    van der Greef, Jan
    Hankemeier, Thomas
    [J]. ANALYTICAL CHEMISTRY, 2010, 82 (03) : 1039 - 1046
  • [7] Quality control for plant metabolomics: reporting MSI-compliant studies
    Fiehn, Oliver
    Wohlgemuth, Gert
    Scholz, Martin
    Kind, Tobias
    Lee, Do Yup
    Lu, Yun
    Moon, Stephanie
    Nikolau, Basil
    [J]. PLANT JOURNAL, 2008, 53 (04) : 691 - 704
  • [8] Maximum likelihood estimation of optimal scaling factors for expression array normalization
    Hartemink, AJ
    Gifford, DK
    Jaakkola, TS
    Young, RA
    [J]. MICROARRAYS: OPTICAL TECHNOLOGIES AND INFORMATICS, 2001, 4266 : 132 - 140
  • [9] Automated quantitative analysis of complex lipidomes by liquid chromatography/mass spectrometry
    Hermansson, M
    Uphoff, A
    Kakela, R
    Somerharju, P
    [J]. ANALYTICAL CHEMISTRY, 2005, 77 (07) : 2166 - 2175
  • [10] MZmine: toolbox for processing and visualization of mass spectrometry based molecular profile data
    Katajamaa, M
    Miettinen, J
    Oresic, M
    [J]. BIOINFORMATICS, 2006, 22 (05) : 634 - 636