Instrumental Drift in Untargeted Metabolomics: Optimizing Data Quality with Intrastudy QC Samples

被引:14
作者
Maertens, Andre [1 ,2 ]
Holle, Johannes [3 ]
Mollenhauer, Brit [4 ,5 ]
Wegner, Andre [1 ]
Kirwan, Jennifer [6 ]
Hiller, Karsten [1 ]
机构
[1] Tech Univ Carolo Wilhelmina Braunschweig, Braunschweig Integrated Ctr Syst Biol, Dept Bioinformat & Biochem, D-38118 Braunschweig, Germany
[2] Phys Tech Bundesanstalt, D-38116 Braunschweig, Germany
[3] Univ Med Berlin, Dept Pediat Gastroenterol Nephrol & Metab Dis, D-13353 Berlin, Germany
[4] Univ Med Ctr Gottingen, Dept Neurol, D-37073 Gottingen, Germany
[5] Paracelsus Elena Klin, D-34128 Kassel, Germany
[6] Univ Med Berlin, Berlin Inst Hlth Charite, D-10117 Berlin, Germany
关键词
metabolomics; quality control; analytical variation; batch effects; MISSING VALUE IMPUTATION; MASS-SPECTROMETRY DATA; LARGE-SCALE; GAS-CHROMATOGRAPHY; EXPERIMENTAL-DESIGN; HUMAN SERUM; HPLC-MS; ALIGNMENT; PLASMA; BATCH;
D O I
10.3390/metabo13050665
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Untargeted metabolomics is an important tool in studying health and disease and is employed in fields such as biomarker discovery and drug development, as well as precision medicine. Although significant technical advances were made in the field of mass-spectrometry driven metabolomics, instrumental drifts, such as fluctuations in retention time and signal intensity, remain a challenge, particularly in large untargeted metabolomics studies. Therefore, it is crucial to consider these variations during data processing to ensure high-quality data. Here, we will provide recommendations for an optimal data processing workflow using intrastudy quality control (QC) samples that identifies errors resulting from instrumental drifts, such as shifts in retention time and metabolite intensities. Furthermore, we provide an in-depth comparison of the performance of three popular batch-effect correction methods of different complexity. By using different evaluation metrics based on QC samples and a machine learning approach based on biological samples, the performance of the batch-effect correction methods were evaluated. Here, the method TIGER demonstrated the overall best performance by reducing the relative standard deviation of the QCs and dispersion-ratio the most, as well as demonstrating the highest area under the receiver operating characteristic with three different probabilistic classifiers (Logistic regression, Random Forest, and Support Vector Machine). In summary, our recommendations will help to generate high-quality data that are suitable for further downstream processing, leading to more accurate and meaningful insights into the underlying biological processes.
引用
收藏
页数:16
相关论文
共 57 条
  • [1] Method Validation for Preparing Serum and Plasma Samples from Human Blood for Downstream Proteomic, Metabolomic, and Circulating Nucleic Acid-Based Applications
    Ammerlaan, Wim
    Trezzi, Jean-Pierre
    Lescuyer, Pierre
    Mathay, Conny
    Hiller, Karsten
    Betsou, Fay
    [J]. BIOPRESERVATION AND BIOBANKING, 2014, 12 (04) : 269 - 280
  • [2] UPLC-ESI-QTOF/MS and multivariate data analysis for blood plasma and serum metabolomics: Effect of experimental artefacts and anticoagulant
    Barri, Thaer
    Dragsted, Lars Ove
    [J]. ANALYTICA CHIMICA ACTA, 2013, 768 : 118 - 128
  • [3] High-throughput 1H NMR-based metabolic analysis of human serum and urine for large-scale epidemiological studies:: validation study
    Barton, Richard H.
    Nicholson, Jeremy K.
    Elliott, Paul
    Holmes, Elaine
    [J]. INTERNATIONAL JOURNAL OF EPIDEMIOLOGY, 2008, 37 : 31 - 40
  • [4] Development and Performance of a Gas Chromatography-Time-of-Flight Mass Spectrometry Analysis for Large-Scale Nontargeted Metabolomic Studies of Human Serum
    Begley, Paul
    Francis-McIntyre, Sue
    Dunn, Warwick B.
    Broadhurst, David I.
    Halsall, Antony
    Tseng, Andy
    Knowles, Joshua
    Goodacre, Royston
    Kell, Douglas B.
    [J]. ANALYTICAL CHEMISTRY, 2009, 81 (16) : 7038 - 7046
  • [5] XCMS2:: Processing tandem mass spectrometry data for metabolite identification and structural characterization
    Benton, H. P.
    Wong, D. M.
    Trauger, S. A.
    Siuzdak, G.
    [J]. ANALYTICAL CHEMISTRY, 2008, 80 (16) : 6382 - 6389
  • [6] Removal of batch effects using stratified subsampling of metabolomic data for in vitro endocrine disruptors screening
    Boccard, Julien
    Tonoli, David
    Strajhar, Petra
    Jeanneret, Fabienne
    Odermatt, Alex
    Rudaz, Serge
    [J]. TALANTA, 2019, 195 : 77 - 86
  • [7] Guidelines and considerations for the use of system suitability and quality control samples in mass spectrometry assays applied in untargeted clinical metabolomic studies
    Broadhurst, David
    Goodacre, Royston
    Reinke, Stacey N.
    Kuligowski, Julia
    Wilson, Ian D.
    Lewis, Matthew R.
    Dunn, Warwick B.
    [J]. METABOLOMICS, 2018, 14 (06)
  • [8] Large-scale untargeted LC-MS metabolomics data correction using between-batch feature alignment and cluster-based within-batch signal intensity drift correction
    Brunius, Carl
    Shi, Lin
    Landberg, Rikard
    [J]. METABOLOMICS, 2016, 12 (11)
  • [9] Instrumental and experimental effects in LC-MS-based metabolomics
    Burton, Lyle
    Ivosev, Gordana
    Tate, Stephen
    Impey, Gary
    Wingate, Julie
    Bonner, Ron
    [J]. JOURNAL OF CHROMATOGRAPHY B-ANALYTICAL TECHNOLOGIES IN THE BIOMEDICAL AND LIFE SCIENCES, 2008, 871 (02): : 227 - 235
  • [10] Pyrolysis-mass spectrometry for rapid classification of oysters according to rearing area
    Cardinal, M
    Viallon, C
    Thonat, C
    Berdagué, JL
    [J]. ANALUSIS, 2000, 28 (09) : 825 - 829