Predictive factors for allergy at 4-6 years of age based on machine learning: A pilot study

被引:5
作者
Kamphorst, Kim [1 ,2 ,7 ]
Lopez-Rincon, Alejandro [3 ,6 ]
Vlieger, Arine M. [2 ]
Garssen, Johan [3 ,5 ]
van 't Riet, Esther [4 ]
van Elburg, Ruurd M. [1 ]
机构
[1] Univ Amsterdam, Amsterdam UMC, Emma Childrens Hosp, Dept Pediat Amsterdam Gastroenterol Metab & Nutr,A, Amsterdam, Netherlands
[2] St Antonius Hosp, Dept Pediat, Nieuwegein, Netherlands
[3] Univ Utrecht, Utrecht Inst Pharmaceut Sci, Div Pharmacol, Utrecht, Netherlands
[4] Univ Med Ctr Utrecht, Dept Strategy & Policy, Res Off, Utrecht, Netherlands
[5] Danone Nutr Res, Utrecht, Netherlands
[6] Univ Med Ctr Utrecht, Julius Ctr Hlth Sci & Primary Care, Dept Data Sci, Utrecht, Netherlands
[7] Amsterdam UMC, Emma Childrens Hosp, Room H8 235, Meibergdreef 9, NL-1105 AZ Amsterdam, Netherlands
关键词
Prediction; Feature selection; AI; Artificial intelligence; Atopic disorders; Cytokines; CLASSIFICATION;
D O I
10.1016/j.phanu.2022.100326
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Background: In Europe, allergic diseases are the most common chronic childhood illnesses and the result of a complex interplay between genetics and environmental factors. A new approach for analyzing this complex data is to employ machine learning (ML) algorithms. Therefore, the aim of this pilot study was to find predictors for the presence of parental-reported allergy at 4-6 years of age by using feature selection in ML.Methods: A recursive ensemble feature selection (REFS) was used, with a 20% step reduction and with eight different classifiers in the ensemble, and resampling given the class unbalance. Thereafter, the Receiver Oper-ating Characteristic Curves for five different classifiers, not included in the original ensemble feature selection technique, were calculated.Results: In total, 130 children (14 with and 116 without parental-reported allergy) and 248 features were included in the ML analyses. The REFS algorithm showed a result of 20 features and particularly, the Multi-layer Perceptron Classifier had an area under the curve (AUC) of 0.86 (SD 0.08). The features predictive for allergy were: tobacco exposure during pregnancy, atopic parents, gestational age, days of: diarrhea, cough, rash, and fever during first year of life, ever being exposed to antibiotics, Resistin, IL-27, MMP9, CXCL8, CCL13, Vimentin, IL-4, CCL22, GAL1, IL-6, LIGHT, and GMCSF.Conclusions: This ML model shows that a combination of environmental exposures and cytokines can predict later allergy with an AUC of 0.86 despite the small sample size. In the future, our ML model still needs to be externally validated.
引用
收藏
页数:6
相关论文
共 29 条
[1]  
[Anonymous], 2004, P 21 INT C MACH LEAR
[2]   Learning from neighbours [J].
Bala, V ;
Goyal, S .
REVIEW OF ECONOMIC STUDIES, 1998, 65 (03) :595-621
[3]   Pasting small votes for classification in large databases and on-line [J].
Breiman, L .
MACHINE LEARNING, 1999, 36 (1-2) :85-103
[4]  
Breiman L., 2001, MACH LEARN, V45, P5
[5]   Mechanisms of allergic disease - environmental and genetic determinants for the development of allergy [J].
Campbell, D. E. ;
Boyle, R. J. ;
Thornton, C. A. ;
Prescott, S. L. .
CLINICAL AND EXPERIMENTAL ALLERGY, 2015, 45 (05) :844-858
[6]  
COX DR, 1958, J R STAT SOC B, V20, P215
[7]  
Crammer K, 2006, J MACH LEARN RES, V7, P551
[8]   Testing children for allergies: why, how, who and when An updated statement of the European Academy of Allergy and Clinical Immunology (EAACI) Section on Pediatrics and the EAACI-Clemens von Pirquet Foundation [J].
Eigenmann, P. A. ;
Atanaskovic-Markovic, M. ;
Hourihane, J. O'B ;
Lack, G. ;
Lau, S. ;
Matricardi, P. M. ;
Muraro, A. ;
Baranova, L. Namazova ;
Nieto, A. ;
Papadopoulos, N. G. ;
Rethy, L. A. ;
Roberts, G. ;
Rudzeviciene, O. ;
Wahn, U. ;
Wickman, M. ;
Host, A. .
PEDIATRIC ALLERGY AND IMMUNOLOGY, 2013, 24 (02) :195-209
[9]  
Fonti V., 2017, VU AMSTERDAM RES PAP, V30, P1
[10]   Greedy function approximation: A gradient boosting machine [J].
Friedman, JH .
ANNALS OF STATISTICS, 2001, 29 (05) :1189-1232