Network Marker Selection for Untargeted LC-MS Metabolomics Data

被引:11
|
作者
Cai, Qingpo [1 ]
Alvarez, Jessica A. [2 ]
Kang, Jian [3 ]
Yu, Tianwei [1 ]
机构
[1] Emory Univ, Dept Biostat & Bioinformat, Atlanta, GA 30322 USA
[2] Emory Univ, Dept Med, Atlanta, GA 30322 USA
[3] Univ Michigan, Dept Biostat, Ann Arbor, MI 48109 USA
关键词
metabolic network data; optimal matching; feature selection; BODY-MASS INDEX; OXIDATIVE STRESS; OBESITY; ASSOCIATION; PATHWAY; TISSUE; INFLAMMATION; ANNOTATION;
D O I
10.1021/acs.jproteome.6b00861
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Untargeted metabolomics using high-resolution liquid chromatographymass spectrometry (LCMS) is becoming one of the major areas of high-throughput biology. Functional analysis, that is, analyzing the data based on metabolic pathways or the genome-scale metabolic network, is critical in feature selection and interpretation of metabolomics data. One of the main challenges in the functional analyses is the lack of the feature identity in the LCMS data itself. By matching mass-to-charge ratio (m/z) values of the features to theoretical values derived from known metabolites, some features can be matched to one or more known metabolites. When multiple matchings occur, in most cases only one of the matchings can be true. At the same time, some known metabolites are missing in the measurements. Current network/pathway analysis methods ignore the uncertainty in metabolite identification and the missing observations, which could lead to errors in the selection of significant subnetworks/pathways. In this paper, we propose a flexible network feature selection framework that combines metabolomics data with the genome-scale metabolic network. The method adopts a sequential feature screening procedure and machine learning-based criteria to select important subnetworks and identify the optimal feature matching simultaneously. Simulation studies show that the proposed method has a much higher sensitivity than the commonly used maximal matching approach. For demonstration, we apply the method on a cohort of healthy subjects to detect subnetworks associated with the body mass index (BMI). The method identifies several subnetworks that are supported by the current literature, as well as detects some subnetworks with plausible new functional implications. The R code is available at http://web1.sph.emory.edu/users/ty u8/MSS.
引用
收藏
页码:1261 / 1269
页数:9
相关论文
共 50 条
  • [1] HeuSMA: A Multigradient LC-MS Strategy for Improving Peak Identification in Untargeted Metabolomics
    Chen, Yao-Yu
    An, Na
    Wang, Yan-Zhen
    Mei, Peng-Cheng
    Hao, Jun-Di
    Liu, Song-Mei
    Zhu, Quan-Fei
    Feng, Yu-Qi
    ANALYTICAL CHEMISTRY, 2025, : 7719 - 7728
  • [2] MS-CleanR: A Feature-Filtering Workflow for Untargeted LC-MS Based Metabolomics
    Fraisier-Vannier, Ophelie
    Chervin, Justine
    Cabanac, Guillaume
    Puech, Virginie
    Fournier, Sylvie
    Durand, Virginie
    Amiel, Aurelien
    Andre, Olivier
    Benamar, Omar Abdelaziz
    Dumas, Bernard
    Tsugawa, Hiroshi
    Marti, Guillaume
    ANALYTICAL CHEMISTRY, 2020, 92 (14) : 9971 - 9981
  • [3] Untargeted LC-MS Metabolomics Differentiates Between Virulent and Avirulent Clinical Strains of Pseudomonas aeruginosa
    Depke, Tobias
    Thoeming, Janne Gesine
    Kordes, Adrian
    Haeussler, Susanne
    Broenstrup, Mark
    BIOMOLECULES, 2020, 10 (07) : 1 - 21
  • [4] LC-MS based untargeted metabolomics studies of the metabolic response of Ginkgo biloba extract on arsenism patients
    Li, Weiwei
    Chen, Xiong
    Yao, Maolin
    Sun, Baofei
    Zhu, Kai
    Wang, Wenjuan
    Zhang, Aihua
    ECOTOXICOLOGY AND ENVIRONMENTAL SAFETY, 2024, 274
  • [5] Missing value imputation for LC-MS metabolomics data by incorporating metabolic network and adduct ion relations
    Jin, Zhuxuan
    Kang, Jian
    Yu, Tianwei
    BIOINFORMATICS, 2018, 34 (09) : 1555 - 1561
  • [6] Using Untargeted LC-MS Metabolomics to Identify the Association of Biomarkers in Cattle Feces with Marbling Standard Longissimus Lumborum
    Chen, Dong
    Su, Minchao
    Zhu, He
    Zhong, Gang
    Wang, Xiaoyan
    Ma, Weimin
    Wanapat, Metha
    Tan, Zhiliang
    ANIMALS, 2022, 12 (17):
  • [7] MCnebula: Critical Chemical Classes for the Classification and Boost Identification by Visualization for Untargeted LC-MS/MS Data Analysis
    Huang, Lichuang
    Shan, Qiyuan
    Lyu, Qiang
    Zhang, Shuosheng
    Wang, Lu
    Cao, Gang
    ANALYTICAL CHEMISTRY, 2023, 95 (26) : 9940 - 9948
  • [8] Is Current Practice Adhering to Guidelines Proposed for Metabolite Identification in LC-MS Untargeted Metabolomics? A Meta-Analysis of the Literature
    Kodra, Dritan
    Pousinis, Petros
    Vorkas, Panagiotis A.
    Kademoglou, Katerina
    Liapikos, Theodoros
    Pechlivanis, Alexandros
    Virgiliou, Christina
    Wilson, Ian D.
    Gika, Helen
    Theodoridis, Georgios
    JOURNAL OF PROTEOME RESEARCH, 2022, 21 (03) : 590 - 598
  • [9] Comparing univariate filtration preceding and succeeding PLS-DA analysis on the differential variables/metabolites identified from untargeted LC-MS metabolomics data
    Xu, Suyun
    Bai, Caihong
    Chen, Yanli
    Yu, Lingling
    Wu, Wenjun
    Hu, Kaifeng
    ANALYTICA CHIMICA ACTA, 2024, 1287
  • [10] Metabolic signatures and risk of type 2 diabetes in a Chinese population: an untargeted metabolomics study using both LC-MS and GC-MS
    Lu, Yonghai
    Wang, Yeli
    Ong, Choon-Nam
    Subramaniam, Tavintharan
    Choi, Hyung Won
    Yuan, Jian-Min
    Koh, Woon-Puay
    Pan, An
    DIABETOLOGIA, 2016, 59 (11) : 2349 - 2359