Partial Least Squares Discriminant Analysis and Bayesian Networks for Metabolomic Prediction of Childhood Asthma

被引:25
|
作者
Kelly, Rachel S. [1 ,2 ]
McGeachie, Michael J. [1 ,2 ]
Lee-Sarwar, Kathleen A. [1 ,2 ,3 ]
Kachroo, Priyadarshini [1 ,2 ]
Chu, Su H. [1 ,2 ]
Virkud, Yamini V. [1 ,4 ]
Huang, Mengna [1 ,2 ]
Litonjua, Augusto A. [1 ,2 ,5 ]
Weiss, Scott T. [1 ,2 ]
Lasky-Su, Jessica [1 ,2 ]
机构
[1] Brigham & Womens Hosp, Channing Div Network Med, 75 Francis St, Boston, MA 02115 USA
[2] Harvard Med Sch, Boston, MA 02115 USA
[3] Brigham & Womens Hosp, Div Rheumatol Immunol & Allergy, 75 Francis St, Boston, MA 02115 USA
[4] Massachusetts Gen Hosp Children, Dept Pediat, Boston, MA 02114 USA
[5] Univ Rochester, Med Ctr, Dept Pediat, Div Pediat Pulm Med, Rochester, NY 14642 USA
来源
METABOLITES | 2018年 / 8卷 / 04期
关键词
Partial Least-Squares Discriminant analysis; Bayesian networks; asthma; arginine metabolism; overfitting; OPERATING CHARACTERISTIC CURVES; CLASSIFICATION; MICROBIOME;
D O I
10.3390/metabo8040068
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
To explore novel methods for the analysis of metabolomics data, we compared the ability of Partial Least Squares Discriminant Analysis (PLS-DA) and Bayesian networks (BN) to build predictive plasma metabolite models of age three asthma status in 411 three year olds (n = 59 cases and 352 controls) from the Vitamin D Antenatal Asthma Reduction Trial (VDAART) study. The standard PLS-DA approach had impressive accuracy for the prediction of age three asthma with an Area Under the Curve Convex Hull (AUCCH) of 81%. However, a permutation test indicated the possibility of overfitting. In contrast, a predictive Bayesian network including 42 metabolites had a significantly higher AUCCH of 92.1% (p for difference <0.001), with no evidence that this accuracy was due to overfitting. Both models provided biologically informative insights into asthma; in particular, a role for dysregulated arginine metabolism and several exogenous metabolites that deserve further investigation as potential causative agents. As the BN model outperformed the PLS-DA model in both accuracy and decreased risk of overfitting, it may therefore represent a viable alternative to typical analytical approaches for the investigation of metabolomics data.
引用
收藏
页数:18
相关论文
共 50 条
  • [21] A tutorial review: Metabolomics and partial least squares-discriminant analysis - a marriage of convenience or a shotgun wedding
    Gromski, Piotr S.
    Muhamadali, Howbeer
    Ellis, David I.
    Xu, Yun
    Correa, Elon
    Turner, Michael L.
    Goodacre, Royston
    ANALYTICA CHIMICA ACTA, 2015, 879 : 10 - 23
  • [22] Least squares Support Vector Machine regression for discriminant analysis
    Van Gestel, T
    Suykens, JAK
    De Brabanter, J
    De Moor, B
    Vandewalle, J
    IJCNN'01: INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-4, PROCEEDINGS, 2001, : 2445 - 2450
  • [23] PARTIAL LEAST SQUARES PREDICTION IN HIGH-DIMENSIONAL REGRESSION
    Cook, R. Dennis
    Forzani, Liliana
    ANNALS OF STATISTICS, 2019, 47 (02) : 884 - 908
  • [24] Bankruptcy prediction using Partial Least Squares Logistic Regression
    Ben Jabeur, Sami
    JOURNAL OF RETAILING AND CONSUMER SERVICES, 2017, 36 : 197 - 202
  • [25] Partial Least Squares Discriminant Analysis Model Based on Variable Selection Applied to Identify the Adulterated Olive Oil
    Xinhui Li
    Sulan Wang
    Weimin Shi
    Qi Shen
    Food Analytical Methods, 2016, 9 : 1713 - 1718
  • [26] Identification of edible oils using terahertz spectroscopy combined with genetic algorithm and partial least squares discriminant analysis
    Yin, Ming
    Tang, Shoufeng
    Tong, Minming
    ANALYTICAL METHODS, 2016, 8 (13) : 2794 - 2798
  • [27] Boosting partial least-squares discriminant analysis with application to near infrared spectroscopic tea variety discrimination
    Tan, Shi-Miao
    Luo, Rui-Min
    Zhou, Yan-Ping
    Xu, Hui
    Song, Dan-Dan
    Ze, Tan
    Yang, Tian-Ming
    Nie, Yan
    JOURNAL OF CHEMOMETRICS, 2012, 26 (01) : 34 - 39
  • [28] A least squares formulation of multi-label linear discriminant analysis
    Shu, Xin
    Xu, Huanliang
    Tao, Liang
    NEUROCOMPUTING, 2015, 156 : 221 - 230
  • [29] Discrimination Between Producing Regions of Brazilian Propolis by UV-VIS Spectroscopy and Partial Least Squares Discriminant Analysis
    Nascimento Paganotti, Rosilene Silva
    Rezende, Jeob de Castro
    Sanches Barbeira, Paulo Jorge
    CURRENT ANALYTICAL CHEMISTRY, 2014, 10 (04) : 537 - 544
  • [30] Nondestructive Discrimination of Pharmaceutical Preparations Using Near-Infrared Spectroscopy and Partial Least-Squares Discriminant Analysis
    Chen, Hui
    Lin, Zan
    Tan, Chao
    ANALYTICAL LETTERS, 2018, 51 (04) : 564 - 574