Partial Least Squares Discriminant Analysis and Bayesian Networks for Metabolomic Prediction of Childhood Asthma

被引:25
|
作者
Kelly, Rachel S. [1 ,2 ]
McGeachie, Michael J. [1 ,2 ]
Lee-Sarwar, Kathleen A. [1 ,2 ,3 ]
Kachroo, Priyadarshini [1 ,2 ]
Chu, Su H. [1 ,2 ]
Virkud, Yamini V. [1 ,4 ]
Huang, Mengna [1 ,2 ]
Litonjua, Augusto A. [1 ,2 ,5 ]
Weiss, Scott T. [1 ,2 ]
Lasky-Su, Jessica [1 ,2 ]
机构
[1] Brigham & Womens Hosp, Channing Div Network Med, 75 Francis St, Boston, MA 02115 USA
[2] Harvard Med Sch, Boston, MA 02115 USA
[3] Brigham & Womens Hosp, Div Rheumatol Immunol & Allergy, 75 Francis St, Boston, MA 02115 USA
[4] Massachusetts Gen Hosp Children, Dept Pediat, Boston, MA 02114 USA
[5] Univ Rochester, Med Ctr, Dept Pediat, Div Pediat Pulm Med, Rochester, NY 14642 USA
来源
METABOLITES | 2018年 / 8卷 / 04期
关键词
Partial Least-Squares Discriminant analysis; Bayesian networks; asthma; arginine metabolism; overfitting; OPERATING CHARACTERISTIC CURVES; CLASSIFICATION; MICROBIOME;
D O I
10.3390/metabo8040068
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
To explore novel methods for the analysis of metabolomics data, we compared the ability of Partial Least Squares Discriminant Analysis (PLS-DA) and Bayesian networks (BN) to build predictive plasma metabolite models of age three asthma status in 411 three year olds (n = 59 cases and 352 controls) from the Vitamin D Antenatal Asthma Reduction Trial (VDAART) study. The standard PLS-DA approach had impressive accuracy for the prediction of age three asthma with an Area Under the Curve Convex Hull (AUCCH) of 81%. However, a permutation test indicated the possibility of overfitting. In contrast, a predictive Bayesian network including 42 metabolites had a significantly higher AUCCH of 92.1% (p for difference <0.001), with no evidence that this accuracy was due to overfitting. Both models provided biologically informative insights into asthma; in particular, a role for dysregulated arginine metabolism and several exogenous metabolites that deserve further investigation as potential causative agents. As the BN model outperformed the PLS-DA model in both accuracy and decreased risk of overfitting, it may therefore represent a viable alternative to typical analytical approaches for the investigation of metabolomics data.
引用
收藏
页数:18
相关论文
共 50 条
  • [31] Application of partial least squares discriminant analysis and variable selection procedures: a 2D-PAGE proteomic study
    Marengo, Emilio
    Robotti, Elisa
    Bobba, Marco
    Milli, Alberto
    Campostrini, Natascia
    Righetti, Sabina Carla
    Cecconi, Daniela
    Righetti, Pier Giorgio
    ANALYTICAL AND BIOANALYTICAL CHEMISTRY, 2008, 390 (05) : 1327 - 1342
  • [32] Gait variability-based classification of the stages of the cognitive decline using partial least squares-discriminant analysis
    Kwak, Kiyoung
    Kostic, Emilija
    Kim, Dongwook
    SCIENCE PROGRESS, 2023, 106 (04)
  • [33] Efficient and Simplified Modeling for Kerosene Processing Quality Detection Using Partial Least Squares-Discriminant Analysis Regression
    Issa, Hayder M.
    Salih, Rezan H. Hama
    ARO-THE SCIENTIFIC JOURNAL OF KOYA UNIVERSITY, 2024, 12 (01): : 135 - 142
  • [34] Discrimination of healthy and osteoarthritic articular cartilages by Fourier transform infrared imaging and partial least squares-discriminant analysis
    Zhang, Xue-Xi
    Yin, Jian-Hua
    Mao, Zhi-Hua
    Xia, Yang
    JOURNAL OF BIOMEDICAL OPTICS, 2015, 20 (06)
  • [35] The importance of balanced data sets for partial least squares discriminant analysis: classification problems using hyperspectral imaging data
    Lindstrom, Susanne W.
    Geladi, Paul
    Jonsson, Oskar
    Pettersson, Fredrik
    JOURNAL OF NEAR INFRARED SPECTROSCOPY, 2011, 19 (04) : 233 - 241
  • [36] Combining bootstrap and uninformative variable elimination: Chemometric identification of metabonomic biomarkers by nonparametric analysis of discriminant partial least squares
    Sun, Xiao-Ming
    Yu, Xiao-Ping
    Liu, Yun
    Xu, Lu
    Di, Duo-Long
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2012, 115 : 37 - 43
  • [37] Application of partial least squares discriminant analysis and variable selection procedures: a 2D-PAGE proteomic study
    Emilio Marengo
    Elisa Robotti
    Marco Bobba
    Alberto Milli
    Natascia Campostrini
    Sabina Carla Righetti
    Daniela Cecconi
    Pier Giorgio Righetti
    Analytical and Bioanalytical Chemistry, 2008, 390 : 1327 - 1342
  • [38] Qualitative in situ analysis of multiple solid-state forms using spectroscopy and partial least squares discriminant modeling
    Kogermann, Karin
    Aaltonen, Jaakko
    Strachan, Clare J.
    Pollanen, Kati
    Veski, Peep
    Heinamaki, Jyrki
    Yliruusi, Jouko
    Rantanen, Jukka
    JOURNAL OF PHARMACEUTICAL SCIENCES, 2007, 96 (07) : 1802 - 1820
  • [39] Discrimination of Salix caprea, Salix gracilistyla, and Their Interspecific Hybrid Using Vegetative Characteristics and Partial Least Squares Discriminant Analysis
    Seo, Han-Na
    Lim, Hyo-In
    Kim, Yong-Yul
    Chae, Seung-Beom
    Cho, Wonwoo
    HORTSCIENCE, 2021, 56 (10) : 1230 - 1238
  • [40] Numerically stable locality-preserving partial least squares discriminant analysis for efficient dimensionality reduction and classification of high-dimensional data
    Ahmad, Noor Atinah
    HELIYON, 2024, 10 (04)