Predicting low cognitive ability at age 5 years using perinatal data and machine learning

被引:1
作者
Bowe, Andrea K. [1 ]
Lightbody, Gordon [1 ,2 ]
O'Boyle, Daragh S. [1 ]
Staines, Anthony [3 ]
Murray, Deirdre M. [1 ,4 ]
机构
[1] Univ Coll Cork, INFANT Res Ctr, Cork, Ireland
[2] Univ Coll Cork, Dept Elect & Elect Engn, Cork, Ireland
[3] Dublin City Univ, Sch Nursing Psychotherapy & Community Hlth, Dublin, Ireland
[4] Cork Univ Hosp, Dept Paediat, Cork, Ireland
基金
英国惠康基金;
关键词
SOCIOECONOMIC-STATUS; EARLY INTERVENTION; HOME-ENVIRONMENT; CUMULATIVE RISK; INTELLIGENCE; EXPECTATIONS; IQ;
D O I
10.1038/s41390-023-02914-6
中图分类号
R72 [儿科学];
学科分类号
100202 ;
摘要
BackgroundThere are no early, accurate, scalable methods for identifying infants at high risk of poor cognitive outcomes in childhood. We aim to develop an explainable predictive model, using machine learning and population-based cohort data, for this purpose.MethodsData were from 8858 participants in the Growing Up in Ireland cohort, a nationally representative study of infants and their primary caregivers (PCGs). Maternal, infant, and socioeconomic characteristics were collected at 9-months and cognitive ability measured at age 5 years. Data preprocessing, synthetic minority oversampling, and feature selection were performed prior to training a variety of machine learning models using ten-fold cross validated grid search to tune hyperparameters. Final models were tested on an unseen test set.ResultsA random forest (RF) model containing 15 participant-reported features in the first year of infant life, achieved an area under the receiver operating characteristic curve (AUROC) of 0.77 for predicting low cognitive ability at age 5. This model could detect 72% of infants with low cognitive ability, with a specificity of 66%.ConclusionsModel performance would need to be improved before consideration as a population-level screening tool. However, this is a first step towards early, individual, risk stratification to allow targeted childhood screening.ImpactThis study is among the first to investigate whether machine learning methods can be used at a population-level to predict which infants are at high risk of low cognitive ability in childhood.A random forest model using 15 features which could be easily collected in the perinatal period achieved an AUROC of 0.77 for predicting low cognitive ability.Improved predictive performance would be required to implement this model at a population level but this may be a first step towards early, individual, risk stratification.
引用
收藏
页码:1254 / 1264
页数:11
相关论文
共 61 条
[1]   Early life determinants of low IQ at age 6 in children from the 2004 Pelotas Birth Cohort: a predictive approach [J].
Alberto Camargo-Figuera, Fabio ;
Barros, Aluisio J. D. ;
Santos, Ina S. ;
Matijasevich, Alicia ;
Barros, Fernando C. .
BMC PEDIATRICS, 2014, 14
[2]   Emotional Profile and Intellectual Functioning: A Comparison Among Children With Borderline Intellectual Functioning, Average Intellectual Functioning, and Gifted Intellectual Functioning [J].
Alesi, Marianna ;
Rappo, Gaetano ;
Pepi, Annamaria .
SAGE OPEN, 2015, 5 (03)
[3]  
American Psychiatric Association, 2013, DIAGN STAT MAN MENT, DOI 10.1176/appi.books.9780890425596
[4]  
[Anonymous], 2023, ICD-11 for mortality and morbidity statistics
[5]   Machine learning in clinical and epidemiological research: isn't it time for biostatisticians to work on it? [J].
Azzolina, Danila ;
Baldi, Ileana ;
Barbati, Giulia ;
Berchialla, Paola ;
Bottigliengo, Daniele ;
Bucci, Andrea ;
Calza, Stefano ;
Dolce, Pasquale ;
Edefonti, Valeria ;
Faragalli, Andrea ;
Fiorito, Giovanni ;
Gandin, Ilaria ;
Giudici, Fabiola ;
Gregori, Dario ;
Gregorio, Caterina ;
Ieva, Francesca ;
Lanera, Corrado ;
Lorenzoni, Giulia ;
Marchioni, Michele ;
Milanese, Alberto ;
Ricotti, Andrea ;
Sciannameo, Veronica ;
Solinas, Giuliana ;
Vezzoli, Marika .
EPIDEMIOLOGY BIOSTATISTICS AND PUBLIC HEALTH, 2019, 16 (04)
[6]   Association of Socioeconomic Status and Brain Injury With Neurodevelopmental Outcomes of Very Preterm Children [J].
Benavente-Fernandez, Isabel ;
Synnes, Anne ;
Grunau, Ruth E. ;
Chau, Vann ;
Ramraj, Chantel ;
Glass, Torin ;
Cayam-Rand, Dalit ;
Siddiqi, Arjumand ;
Miller, Steven P. .
JAMA NETWORK OPEN, 2019, 2 (05)
[7]   SMOTE for high-dimensional class-imbalanced data [J].
Blagus, Rok ;
Lusa, Lara .
BMC BIOINFORMATICS, 2013, 14
[8]   Predicting Low Cognitive Ability at Age 5-Feature Selection Using Machine Learning Methods and Birth Cohort Data [J].
Bowe, Andrea K. ;
Lightbody, Gordon ;
Staines, Anthony ;
Kiely, Mairead E. ;
McCarthy, Fergus P. ;
Murray, Deirdre M. .
INTERNATIONAL JOURNAL OF PUBLIC HEALTH, 2022, 67
[9]   Big data, machine learning, and population health: predicting cognitive outcomes in childhood [J].
Bowe, Andrea K. ;
Lightbody, Gordon ;
Staines, Anthony ;
Murray, Deirdre M. .
PEDIATRIC RESEARCH, 2023, 93 (02) :300-307
[10]   The predictive value of the ages and stages questionnaire in late infancy for low average cognitive ability at age 5 [J].
Bowe, Andrea K. ;
Hourihane, Jonathan ;
Staines, Anthony ;
Murray, Deirdre M. .
ACTA PAEDIATRICA, 2022, 111 (06) :1194-1200