Identification of Diagnostic CpG Signatures in Patients with Gestational Diabetes Mellitus via Epigenome-Wide Association Study Integrated with Machine Learning

被引:7
作者
Liu, Yan [1 ]
Geng, Hui [2 ]
Duan, Bide [2 ]
Yang, Xiuzhi [2 ]
Ma, Airong [2 ]
Ding, Xiaoyan [2 ]
机构
[1] Nankai Univ, Tianjin Cent Hosp 1, Dept Obstet, Tianjin 300192, Peoples R China
[2] Zibo Cent Hosp, Dept Obstet, Zibo 255000, Shandong, Peoples R China
关键词
DNA METHYLATION PROFILES; BIOMARKERS; GENE; PATHOPHYSIOLOGY; VALIDATION; PREDICTION; DERIVATION; PROGNOSIS; EXPOSURE; PLACENTA;
D O I
10.1155/2021/1984690
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background. Gestational diabetes mellitus (GDM) is the most prevalent metabolic disease during pregnancy, but the diagnosis is controversial and lagging partly due to the lack of useful biomarkers. CpG methylation is involved in the development of GDM. However, the specific CpG methylation sites serving as diagnostic biomarkers of GDM remain unclear. Here, we aimed to explore CpG signatures and establish the predicting model for the GDM diagnosis. Methods. DNA methylation data of GSE88929 and GSE102177 were obtained from the GEO database, followed by the epigenome-wide association study (EWAS). GO and KEGG pathway analyses were performed by using the clusterProfiler package of R. The PPI network was constructed in the STRING database and Cytoscape software. The SVM model was established, in which the beta-values of selected CpG sites were the predictor variable and the occurrence of GDM was the outcome variable. Results. We identified 62 significant CpG methylation sites in the GDM samples compared with the control samples. GO and KEGG analyses based on the 62 CpG sites demonstrated that several essential cellular processes and signaling pathways were enriched in the system. A total of 12 hub genes related to the identified CpG sites were found in the PPI network. The SVM model based on the selected CpGs within the promoter region, including cg00922748, cg05216211, cg05376185, cg06617468, cg17097119, and cg22385669, was established, and the AUC values of the training set and testing set in the model were 0.8138 and 0.7576. The AUC value of the independent validation set of GSE102177 was 0.6667. Conclusion. We identified potential diagnostic CpG signatures by EWAS integrated with the SVM model. The SVM model based on the identified 6 CpG sites reliably predicted the GDM occurrence, contributing to the diagnosis of GDM. Our finding provides new insights into the cross-application of EWAS and machine learning in GDM investigation.
引用
收藏
页数:10
相关论文
共 81 条
[1]   Inflammatory and Other Biomarkers: Role in Pathophysiology and Prediction of Gestational Diabetes Mellitus [J].
Abell, Sally K. ;
De Courten, Barbora ;
Boyle, Jacqueline A. ;
Teede, Helena J. .
INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2015, 16 (06) :13442-13473
[2]   Machine Learning and the Cancer-Diagnosis Problem - No Gold Standard [J].
Adamson, Adewole S. ;
Welch, H. Gilbert .
NEW ENGLAND JOURNAL OF MEDICINE, 2019, 381 (24) :2285-2287
[3]   Bioinformatics Tools for Genome-Wide Epigenetic Research [J].
Angarica, Vladimir Espinosa ;
del Sol, Antonio .
NEUROEPIGENOMICS IN AGING AND DISEASE, 2017, 978 :489-512
[4]   Classification and Diagnosis of Diabetes [J].
不详 .
DIABETES CARE, 2015, 38 :S8-S16
[5]   Prediction of gestational diabetes based on nationwide electronic health records [J].
Artzi, Nitzan Shalom ;
Shilo, Smadar ;
Hadar, Eran ;
Rossman, Hagai ;
Barbash-Hazan, Shiri ;
Ben-Haroush, Avi ;
Balicer, Ran D. ;
Feldman, Becca ;
Wiznitzer, Arnon ;
Segal, Eran .
NATURE MEDICINE, 2020, 26 (01) :71-+
[6]   Minfi: a flexible and comprehensive Bioconductor package for the analysis of Infinium DNA methylation microarrays [J].
Aryee, Martin J. ;
Jaffe, Andrew E. ;
Corrada-Bravo, Hector ;
Ladd-Acosta, Christine ;
Feinberg, Andrew P. ;
Hansen, Kasper D. ;
Irizarry, Rafael A. .
BIOINFORMATICS, 2014, 30 (10) :1363-1369
[7]   Epigenome-wide and transcriptome-wide analyses reveal gestational diabetes is associated with alterations in the human leukocyte antigen complex [J].
Binder, Alexandra M. ;
LaRocca, Jessica ;
Lesseur, Corina ;
Marsit, Carmen J. ;
Michels, Karin B. .
CLINICAL EPIGENETICS, 2015, 7
[8]   Analysing and interpreting DNA methylation data [J].
Bock, Christoph .
NATURE REVIEWS GENETICS, 2012, 13 (10) :705-719
[9]   Possible contribution of (pro) renin receptor to development of gestational diabetes mellitus [J].
Bokuda, Kanako ;
Ichihara, Atsuhiro .
WORLD JOURNAL OF DIABETES, 2014, 5 (06) :912-916
[10]   The potential role of biomarkers in predicting gestational diabetes [J].
Brink, Huguette S. ;
van der Lely, Aart Jan ;
van der Linden, Joke .
ENDOCRINE CONNECTIONS, 2016, 5 (05) :R26-R34