Identification and validation of cuproptosis related genes and signature markers in bronchopulmonary dysplasia disease using bioinformatics analysis and machine learning

被引:9
作者
Jia, Mingxuan [1 ]
Li, Jieyi [2 ]
Zhang, Jingying [2 ]
Wei, Ningjing [3 ]
Yin, Yating [3 ]
Chen, Hui [2 ]
Yan, Shixing [4 ]
Wang, Yong [2 ]
机构
[1] Adcote Sch Shanghai, Shanghai 200000, Peoples R China
[2] Shanghai Literature Inst Tradit Chinese Med, Shanghai 200000, Peoples R China
[3] ChengZheng Wisdom Shanghai Hlth Sci & Technol Co L, Shanghai 200000, Peoples R China
[4] Shanghai Daosh Med Technol Co Ltd, Shanghai 200000, Peoples R China
关键词
Bronchopulmonary dysplasia disease; Cuproptosis; Machine learning; Biomarkers; Bioinformatics analysis;
D O I
10.1186/s12911-023-02163-x
中图分类号
R-058 [];
学科分类号
摘要
BackgroundBronchopulmonary Dysplasia (BPD) has a high incidence and affects the health of preterm infants. Cuproptosis is a novel form of cell death, but its mechanism of action in the disease is not yet clear. Machine learning, the latest tool for the analysis of biological samples, is still relatively rarely used for in-depth analysis and prediction of diseases.Methods and resultsFirst, the differential expression of cuproptosis-related genes (CRGs) in the GSE108754 dataset was extracted and the heat map showed that the expression of NFE2L2 gene was significantly higher in the control group whereas the expression of GLS gene was significantly higher in the treatment group. Chromosome location analysis showed that both the genes were positively correlated and associated with chromosome 2. The results of immune infiltration and immune cell differential analysis showed differences in the four immune cells, significantly in Monocytes cells. Five new pathways were analyzed through two subgroups based on consistent clustering of CRG expression. Weighted correlation network analysis (WGCNA) set the screening condition to the top 25% to obtain the disease signature genes. Four machine learning algorithms: Generalized Linear Models (GLM), Random Forest (RF), Support Vector Machine (SVM), and Extreme Gradient Boosting (XGB) were used to screen the disease signature genes, and the final five marker genes for disease prediction. The models constructed by GLM method were proved to be more accurate in the validation of two datasets, GSE190215 and GSE188944.ConclusionWe eventually identified two copper death-associated genes, NFE2L2 and GLS. A machine learning model-GLM was constructed to predict the prevalence of BPD disease, and five disease signature genes NFATC3, ERMN, PLA2G4A, MTMR9LP and LOC440700 were identified. These genes that were bioinformatics analyzed could be potential targets for identifying BPD disease and treatment.
引用
收藏
页数:11
相关论文
共 56 条
[1]   The adverse impact of obesity on heart rate variability is modified by a NFE2L2 gene variant: The SAPALDIA cohort [J].
Adam, Martin ;
Imboden, Medea ;
Schaffner, Emmanuel ;
Boes, Eva ;
Kronenberg, Florian ;
Pons, Marco ;
Bettschart, Robert ;
Barthelemy, Jean-Claude ;
Schindler, Christian ;
Probst-Hensch, Nicole .
INTERNATIONAL JOURNAL OF CARDIOLOGY, 2017, 228 :341-346
[2]   Altered Long Noncoding RNA Expression Profile in Multiple Myeloma Patients with Bisphosphonate-Induced Osteonecrosis of the Jaw [J].
Allegra, Alessandro ;
Mania, Manuela ;
D'Ascola, Angela ;
Oteri, Giacomo ;
Siniscalchi, Enrico Nastro ;
Avenoso, Angela ;
Innao, Vanessa ;
Scuruchi, Michele ;
Allegra, Andrea Gaetano ;
Musolino, Caterina ;
Campo, Salvatore .
BIOMED RESEARCH INTERNATIONAL, 2020, 2020
[3]   Protein Function Analysis through Machine Learning [J].
Avery, Chris ;
Patterson, John ;
Grear, Tyler ;
Frater, Theodore ;
Jacobs, Donald J. .
BIOMOLECULES, 2022, 12 (09)
[4]   Identification of a novel cuproptosis-related gene signature and integrative analyses in patients with lower-grade gliomas [J].
Bao, Jia-hao ;
Lu, Wei-cheng ;
Duan, Hao ;
Ye, Ya-qi ;
Li, Jiang-bo ;
Liao, Wen-ting ;
Li, Yong-chun ;
Sun, Yang-peng .
FRONTIERS IN IMMUNOLOGY, 2022, 13
[5]   NeuralNetTools: Visualization and Analysis Tools for Neural Networks [J].
Beck, Marcus W. .
JOURNAL OF STATISTICAL SOFTWARE, 2018, 85 (11)
[6]   Constructing bi-plots for random forest: Tutorial [J].
Blanchet, Lionel ;
Vitale, Raffaele ;
van Vorstenbosch, Robert ;
Stavropoulos, George ;
Pender, John ;
Jonkers, Daisy ;
van Schooten, Frederik-Jan ;
Smolinska, Agnieszka .
ANALYTICA CHIMICA ACTA, 2020, 1131 :146-155
[7]   Retip: Retention Time Prediction for Compound Annotation in Untargeted Metabolomics [J].
Bonini, Paolo ;
Kind, Tobias ;
Tsugawa, Hiroshi ;
Barupal, Dinesh Kumar ;
Fiehn, Oliver .
ANALYTICAL CHEMISTRY, 2020, 92 (11) :7515-7522
[8]   A broad cuproptosis landscape in inflammatory bowel disease [J].
Chen, Yuan ;
Li, Xinfang ;
Sun, Ran ;
Ji, Jiamin ;
Yang, Fan ;
Tian, Weiliang ;
Ji, Wu ;
Huang, Qian .
FRONTIERS IN IMMUNOLOGY, 2022, 13
[9]   Whole-genome sequencing of 508 patients identifies key molecular features associated with poor prognosis in esophageal squamous cell carcinoma [J].
Cui, Yongping ;
Chen, Hongyan ;
Xi, Ruibin ;
Cui, Heyang ;
Zhao, Yahui ;
Xu, Enwei ;
Yan, Ting ;
Lu, Xiaomei ;
Huang, Furong ;
Kong, Pengzhou ;
Li, Yang ;
Zhu, Xiaolin ;
Wang, Jiawei ;
Zhu, Wenjie ;
Wang, Jie ;
Ma, Yanchun ;
Zhou, Yong ;
Guo, Shiping ;
Zhang, Ling ;
Liu, Yiqian ;
Wang, Bin ;
Xi, Yanfeng ;
Sun, Ruifang ;
Yu, Xiao ;
Zhai, Yuanfang ;
Wang, Fang ;
Yang, Jian ;
Yang, Bin ;
Cheng, Caixia ;
Liu, Jing ;
Song, Bin ;
Li, Hongyi ;
Wang, Yi ;
Zhang, Yingchun ;
Cheng, Xiaolong ;
Zhan, Qimin ;
Li, Yanhong ;
Liu, Zhihua .
CELL RESEARCH, 2020, 30 (10) :902-913
[10]   Bronchopulmonary dysplasia: a crime of opportunity? [J].
Ericsson, Aaron C. .
EUROPEAN RESPIRATORY JOURNAL, 2020, 55 (05)