A comparison of 12 machine learning models developed to predict ploidy, using a morphokinetic meta-dataset of 8147 embryos

被引:25
作者
Bamford, Thomas [1 ,3 ]
Easter, Christina [1 ]
Montgomery, Sue [2 ]
Smith, Rachel [2 ]
Dhillon-Smith, Rima K. [1 ]
Barrie, Amy [2 ]
Campbell, Alison [2 ]
Coomarasamy, Arri [1 ]
机构
[1] Univ Birmingham, Inst Metab & Syst Res, Coll Med & Dent Sci, Tommys Natl Ctr Miscarriage Res, Edgbaston, England
[2] Care Fertil Headquarters, Nottingham, England
[3] CARE Fertil Manchester, 108-112 Daisy Bank Rd, Manchester M14 5QH, England
关键词
morphokinetcs; ploidy; model; artificial intelligence; machine learning; euploid; aneuploid; TIME; ANEUPLOIDY; RISK; CLASSIFICATION; DIAGNOSIS;
D O I
10.1093/humrep/dead034
中图分类号
R71 [妇产科学];
学科分类号
100211 ;
摘要
STUDY QUESTION Are machine learning methods superior to traditional statistics in predicting blastocyst ploidy status using morphokinetic and clinical biodata? SUMMARY ANSWER Mixed effects logistic regression performed better than all machine learning methods for ploidy prediction using our dataset of 8147 embryos. WHAT IS KNOWN ALREADY Morphokinetic timings have been demonstrated to be delayed in aneuploid embryos. Machine learning and statistical models are increasingly being built, however, until now they have been limited by data insufficiency. STUDY DESIGN, SIZE, DURATION This is a multicentre cohort study. Data were obtained from 8147 biopsied blastocysts from 1725 patients, treated from 2012 to 2020. PARTICIPANTS/MATERIALS, SETTING, METHODS All embryos were cultured in a time-lapse system at nine IVF clinics in the UK. A total of 3004 euploid embryos and 5023 aneuploid embryos were included in the final verified dataset. We developed a total of 12 models using four different approaches: mixed effects multivariable logistic regression, random forest classifiers, extreme gradient boosting, and deep learning. For each of the four algorithms, two models were created, the first consisting of 22 covariates using 8027 embryos (Dataset 1) and the second, a dataset of 2373 embryos and 26 covariates (Dataset 2). Four final models were created by switching the target outcome from euploid to aneuploid for each algorithm (Dataset 1). Models were validated using internal-external cross-validation and external validation. MAIN RESULTS AND THE ROLE OF CHANCE All morphokinetic variables were significantly delayed in aneuploid embryos. The likelihood of euploidy was significantly increased the more expanded the blastocyst (P < 0.001) and the better the trophectoderm grade (P < 0.01). Univariable analysis showed no association with ploidy status for morula or cleavage stage fragmentation, morula grade, fertilization method, sperm concentration, or progressive motility. Male age did not correlate with the percentage of euploid embryos when stratified for female age. Multinucleation at the two-cell or four-cell stage was not associated with ploidy status. The best-performing model was logistic regression built using the larger dataset with 22 predictors (F1 score 0.59 for predicting euploidy; F1 score 0.77 for predicting aneuploidy; AUC 0.71; 95% CI 0.67-0.73). The best-performing models using the algorithms from random forest, extreme gradient boosting, and deep learning achieved an AUC of 0.68, 0.63, and 0.63, respectively. When using only morphokinetic predictors the AUC was 0.61 for predicting ploidy status, whereas a model incorporating only embryo grading was unable to discriminate aneuploid embryos (AUC = 0.52). The ploidy prediction model's performance improved with increasing age of the egg provider. LIMITATIONS, REASONS FOR CAUTION The models have not been validated in a prospective study design or yet been used to determine whether they improve clinical outcomes WIDER IMPLICATIONS OF THE FINDINGS This model may aid decision-making, particularly where pre-implantation genetic testing for aneuploidy is not permitted or for prioritizing embryos for biopsy. STUDY FUNDING/COMPETING INTEREST(S) No specific funding was sought for this study; university funds supported the first author. A.Ca. is a minor shareholder of participating centres.
引用
收藏
页码:569 / 581
页数:13
相关论文
共 54 条
[1]   Interpretable, not black-box, artificial intelligence should be used for embryo selection [J].
Afnan, Michael Anis Mihdi ;
Liu, Yanhe ;
Conitzer, Vincent ;
Rudin, Cynthia ;
Mishra, Abhishek ;
Savulescu, Julian ;
Afnan, Masoud .
HUMAN REPRODUCTION OPEN, 2021, 2021 (04)
[2]   Time-lapse systems for embryo incubation and assessment in assisted reproduction [J].
Armstrong, Sarah ;
Bhide, Priya ;
Jordan, Vanessa ;
Pacey, Allan ;
Marjoribanks, Jane ;
Farquhar, Cindy .
COCHRANE DATABASE OF SYSTEMATIC REVIEWS, 2019, (05)
[3]  
Ayyadevara V.K., 2018, PROMACHINE LEARNING, P105, DOI DOI 10.1007/978-1-4842-3564-5_5
[4]   Morphological and morphokinetic associations with aneuploidy: a systematic review and meta-analysis [J].
Bamford, Thomas ;
Barrie, Amy ;
Montgomery, Sue ;
Dhillon-Smith, Rima ;
Campbell, Alison ;
Easter, Christina ;
Coomarasamy, Arri .
HUMAN REPRODUCTION UPDATE, 2022, 28 (05) :656-686
[5]  
Barrie A., 2021, HUM FERTIL, V24, P46
[6]   Preliminary investigation of the prevalence and implantation potential of abnormal embryonic phenotypes assessed using time-lapse imaging [J].
Barrie, Amy ;
Homburg, Roy ;
McDowell, Garry ;
Brown, Jeremy ;
Kingsland, Charles ;
Troup, Stephen .
REPRODUCTIVE BIOMEDICINE ONLINE, 2017, 34 (05) :455-462
[7]   Increasing the probability of selecting chromosomally normal embryos by time-lapse morphokinetics analysis [J].
Basile, Natalia ;
del Carmen Nogales, Maria ;
Bronet, Fernando ;
Florensa, Mireia ;
Riqueiros, Marissa ;
Rodrigo, Lorena ;
Garcia-Velasco, Juan ;
Meseguer, Marcos .
FERTILITY AND STERILITY, 2014, 101 (03) :699-+
[8]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[9]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[10]   Retrospective analysis of outcomes after IVF using an aneuploidy risk model derived from time-lapse imaging without PGS [J].
Campbell, Alison ;
Fishel, Simon ;
Bowman, Natalie ;
Duffy, Samantha ;
Sedler, Mark ;
Thornton, Simon .
REPRODUCTIVE BIOMEDICINE ONLINE, 2013, 27 (02) :140-146