A Comprehensive Mass Spectrometry-Based Workflow for Clinical Metabolomics Cohort Studies

被引:6
作者
Shi, Zhan [1 ]
Li, Haohui [1 ]
Zhang, Wei [2 ]
Chen, Youxiang [2 ]
Zeng, Chunyan [2 ]
Kang, Xiuhua [2 ]
Xu, Xinping [2 ]
Xia, Zhenkun [3 ]
Qing, Bei [3 ]
Yuan, Yunchang [3 ]
Song, Guodong [4 ]
Caldana, Camila [5 ]
Hu, Junyuan [1 ]
Willmitzer, Lothar [5 ]
Li, Yan [1 ]
机构
[1] Metanotitia Inc, 59 Gaoxin South 9th Rd,Yuehai St, Shenzhen 518056, Peoples R China
[2] Nanchang Univ, Affiliated Hosp 1, 17 Yongwaizheng St, Nanchang 330209, Peoples R China
[3] Cent South Univ, Xiangya Hosp 2, Changsha 410011, Peoples R China
[4] Tianjin Med Univ, Hosp 2, 23 Pingjiang Rd, Tianjin 300211, Peoples R China
[5] Max Planck Inst Mol Plant Physiol, Potsdam Sci Pk,Muehlenberg 1, D-14476 Potsdam, Germany
基金
中国国家自然科学基金;
关键词
metabolomics; clinical cohort; LC-MS; GC-MS; quality control; data normalization; data modeling; BIOMARKER DISCOVERY; COLORECTAL-CANCER; ANNOTATION; DIAGNOSIS; QUANTIFICATION; SERUM; C-13; NMR;
D O I
10.3390/metabo12121168
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
As a comprehensive analysis of all metabolites in a biological system, metabolomics is being widely applied in various clinical/health areas for disease prediction, diagnosis, and prognosis. However, challenges remain in dealing with the metabolomic complexity, massive data, metabolite identification, intra- and inter-individual variation, and reproducibility, which largely limit its widespread implementation. This study provided a comprehensive workflow for clinical metabolomics, including sample collection and preparation, mass spectrometry (MS) data acquisition, and data processing and analysis. Sample collection from multiple clinical sites was strictly carried out with standardized operation procedures (SOP). During data acquisition, three types of quality control (QC) samples were set for respective MS platforms (GC-MS, LC-MS polar, and LC-MS lipid) to assess the MS performance, facilitate metabolite identification, and eliminate contamination. Compounds annotation and identification were implemented with commercial software and in-house-developed PAppLine (TM) and Ulib(MS) library. The batch effects were removed using a deep learning model method (NormAE). Potential biomarkers identification was performed with tree-based modeling algorithms including random forest, AdaBoost, and XGBoost. The modeling performance was evaluated using the F1 score based on a 10-times repeated trial for each. Finally, a sub-cohort case study validated the reliability of the entire workflow.
引用
收藏
页数:20
相关论文
共 58 条
  • [1] Mass spectrometry-based metabolomics: a guide for annotation, quantification and best reporting practices
    Alseekh, Saleh
    Aharoni, Asaph
    Brotman, Yariv
    Contrepois, Kevin
    D'Auria, John
    Ewald, Jan
    Ewald, Jennifer C.
    Fraser, Paul D.
    Giavalisco, Patrick
    Hall, Robert D.
    Heinemann, Matthias
    Link, Hannes
    Luo, Jie
    Neumann, Steffen
    Nielsen, Jens
    de Souza, Leonardo Perez
    Saito, Kazuki
    Sauer, Uwe
    Schroeder, Frank C.
    Schuster, Stefan
    Siuzdak, Gary
    Skirycz, Aleksandra
    Sumner, Lloyd W.
    Snyder, Michael P.
    Tang, Huiru
    Tohge, Takayuki
    Wang, Yulan
    Wen, Weiwei
    Wu, Si
    Xu, Guowang
    Zamboni, Nicola
    Fernie, Alisdair R.
    [J]. NATURE METHODS, 2021, 18 (07) : 747 - 756
  • [2] Metabolomics 20years on: what have we learned and what hurdles remain?
    Alseekh, Saleh
    Fernie, Alisdair R.
    [J]. PLANT JOURNAL, 2018, 94 (06) : 933 - 942
  • [3] Taking the leap between analytical chemistry and artificial intelligence: A tutorial review
    Ayres, Lucas B.
    Gomez, Federico J. V.
    Linton, Jeb R.
    Silva, Maria F.
    Garcia, Carlos D.
    [J]. ANALYTICA CHIMICA ACTA, 2021, 1161
  • [4] Metabolic fingerprinting of high-fat plasma samples processed by centrifugation- and filtration-based protein precipitation delineates significant differences in metabolite information coverage
    Barri, Thaer
    Holmer-Jensen, Jens
    Hermansen, Kjeld
    Dragsted, Lars O.
    [J]. ANALYTICA CHIMICA ACTA, 2012, 718 : 47 - 57
  • [5] Human Biospecimen Research: Experimental Protocol and Quality Control Tools
    Betsou, Fotini
    Barnes, Rebecca
    Burke, Thomas
    Coppola, Domenico
    DeSouza, Yvonne
    Eliason, James
    Glazer, Barbara
    Horsfall, David
    Kleeberger, Cynthia
    Lehmann, Sylvain
    Prasad, Anil
    Skubitz, Amy
    Somiari, Stella
    Guntell, Elaine
    [J]. CANCER EPIDEMIOLOGY BIOMARKERS & PREVENTION, 2009, 18 (04) : 1017 - 1025
  • [6] Guidelines and considerations for the use of system suitability and quality control samples in mass spectrometry assays applied in untargeted clinical metabolomic studies
    Broadhurst, David
    Goodacre, Royston
    Reinke, Stacey N.
    Kuligowski, Julia
    Wilson, Ian D.
    Lewis, Matthew R.
    Dunn, Warwick B.
    [J]. METABOLOMICS, 2018, 14 (06)
  • [7] Reliability of Serum Metabolites over a Two-Year Period: A Targeted Metabolomic Approach in Fasting and Non-Fasting Samples from EPIC
    Carayol, Marion
    Licaj, Idlir
    Achaintre, David
    Sacerdote, Carlotta
    Vineis, Paolo
    Key, Timothy J.
    Moret, N. Charlotte Onland
    Scalbert, Augustin
    Rinaldi, Sabina
    Ferrari, Pietro
    [J]. PLOS ONE, 2015, 10 (08):
  • [8] Metabolomics: an emerging but powerful tool for precision medicine
    Clish, Clary B.
    [J]. COLD SPRING HARBOR MOLECULAR CASE STUDIES, 2015, 1 (01):
  • [9] Clinical metabolomics paves the way towards future healthcare strategies
    Collino, Sebastiano
    Martin, Francois-Pierre J.
    Rezzi, Serge
    [J]. BRITISH JOURNAL OF CLINICAL PHARMACOLOGY, 2013, 75 (03) : 619 - 629
  • [10] TargetSearch - a Bioconductor package for the efficient preprocessing of GC-MS metabolite profiling data
    Cuadros-Inostroza, Alvaro
    Caldana, Camila
    Redestig, Henning
    Kusano, Miyako
    Lisec, Jan
    Pena-Cortes, Hugo
    Willmitzer, Lothar
    Hannah, Matthew A.
    [J]. BMC BIOINFORMATICS, 2009, 10