Bipolar disorder: Construction and analysis of a joint diagnostic model using random forest and feedforward neural networks

被引:1
|
作者
Sun, Ping [1 ,2 ]
Wang, Xiangwen [1 ,3 ]
Wang, Shenghai [1 ]
Jia, Xueyu [8 ]
Feng, Shunkang [1 ]
Chen, Jun [2 ,4 ,5 ,6 ]
Fang, Yiru [2 ,4 ,5 ,6 ,7 ]
机构
[1] Qingdao Mental Hlth Ctr, Qingdao 266034, Shandong, Peoples R China
[2] Shanghai Jiao Tong Univ, Sch Med, Shanghai Mental Hlth Ctr, Clin Res Ctr, Shanghai 200030, Peoples R China
[3] Jining Med Univ, Res Inst Mental Hlth, Sch Mental Hlth, Jining 272002, Shandong, Peoples R China
[4] Shanghai Jiao Tong Univ, Ruijin Hosp, Dept Psychiat, Sch Med, Shanghai 200025, Peoples R China
[5] Shanghai Jiao Tong Univ, Ruijin Hosp, Affect Disorders Ctr, Sch Med, Shanghai 200025, Peoples R China
[6] Shanghai Key Lab Psychot Disorders, Shanghai 201108, Peoples R China
[7] Chinese Acad Sci, State Key Lab Neurosci, Shanghai Inst Biol Sci, Shanghai 200031, Peoples R China
[8] Qingdao Univ, Dept Med, Qingdao 266000, Shandong, Peoples R China
来源
IBRO NEUROSCIENCE REPORTS | 2024年 / 17卷
基金
中国国家自然科学基金;
关键词
Bipolar disorder; Machine learning; Neural networks; Diagnostic models; GENE-EXPRESSION;
D O I
10.1016/j.ibneur.2024.07.007
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
Background: To construct a diagnostic model for Bipolar Disorder (BD) depressive phase using peripheral tissue RNA data from patients and combining Random Forest with Feedforward Neural Network methods. Methods: Datasets GSE23848, GSE39653, and GSE69486 were selected, and differential gene expression analysis was conducted using the limma package in R. Key genes from the differentially expressed genes were identified using the Random Forest method. These key genes' expression levels in each sample were used to train a Feedforward Neural Network model. Techniques like L1 regularization, early stopping, and dropout layers were employed to prevent model overfitting. Model performance was then validated, followed by GO, KEGG, and protein-protein interaction network analyses. Results: The final model was a Feedforward Neural Network with two hidden layers and two dropout layers, comprising 2345 trainable parameters. Model performance on the validation set, assessed through 1000 bootstrap resampling iterations, demonstrated a specificity of 0.769(95% CI 0.571-1.000), sensitivity of 0.818 (95% CI 0.533-1.000), AUC value of 0.832 (95 % CI 0.642-0.979), and accuracy of 0.792 (95 % CI 0.625-0.958). Enrichment analysis of key genes indicated no significant enrichment in any known pathways. Conclusion: Key genes with biological significance were identified based on the decrease in Gini coefficient within the Random Forest model. The combined use of Random Forest and Feedforward Neural Network to establish a diagnostic model showed good classification performance in Bipolar Disorder.
引用
收藏
页码:145 / 153
页数:9
相关论文
共 45 条
  • [31] Analysis of an individual-based influenza epidemic model using random forest metamodels and adaptive sequential sampling
    Edali, Mert
    Yucel, Gonenc
    SYSTEMS RESEARCH AND BEHAVIORAL SCIENCE, 2020, 37 (06) : 936 - 958
  • [32] bvnGPS: a generalizable diagnostic model for acute bacterial and viral infection using integrative host transcriptomics and pretrained neural networks
    Li, Qizhi
    Zheng, Xubin
    Xie, Jize
    Wang, Ran
    Li, Mengyao
    Wong, Man-Hon
    Leung, Kwong-Sak
    Li, Shuai
    Geng, Qingshan
    Cheng, Lixin
    BIOINFORMATICS, 2023, 39 (03)
  • [33] A Meta-Model for The Design of Soft Pneumatic Actuators Using Neural Networks and Finite Element Analysis
    Ligthart, Philip Frederik
    Venter, Martin Philip
    ADVANCED THEORY AND SIMULATIONS, 2025,
  • [34] Integrative analysis of signaling and metabolic pathways, immune infiltration patterns, and machine learning-based diagnostic model construction in major depressive disorder
    Lei Tang
    Liling Wu
    Mengqin Dai
    Nian Liu
    Lu liu
    Scientific Reports, 15 (1)
  • [35] Local PM2.5 Hotspot Detector at 300 m Resolution: A Random Forest-Convolutional Neural Network Joint Model Jointly Trained on Satellite Images and Meteorology
    Zheng, Tongshu
    Bergin, Michael
    Wang, Guoyin
    Carlson, David
    REMOTE SENSING, 2021, 13 (07)
  • [36] Predicting tensile-shear strength of nugget using M5P model tree and random forest: An analysis
    Dang, Subrat Kumar
    Singh, Kulwant
    COMPUTERS IN INDUSTRY, 2021, 124
  • [37] A Churn Prediction Model Using Random Forest: Analysis of Machine Learning Techniques for Churn Prediction and Factor Identification in Telecom Sector
    Ullah, Irfan
    Raza, Basit
    Malik, Ahmad Kamran
    Imran, Muhammad
    Ul Islam, Saif
    Kim, Sung Won
    IEEE ACCESS, 2019, 7 : 60134 - 60149
  • [38] Continuous polymerization in tubular reactors with prepolymerization: analysis using two-dimensional phenomenological model and hybrid model with neural networks
    Nogueira, AL
    Lona, LMF
    Machado, RAF
    JOURNAL OF APPLIED POLYMER SCIENCE, 2004, 91 (02) : 871 - 882
  • [39] Spatio-Temporal Analysis of Heavy Metals in Arid Soils at the Catchment Scale Using Digital Soil Assessment and a Random Forest Model
    Taghizadeh-Mehrjardi, Ruhollah
    Fathizad, Hassan
    Ali Hakimzadeh Ardakani, Mohammad
    Sodaiezadeh, Hamid
    Kerry, Ruth
    Heung, Brandon
    Scholten, Thomas
    REMOTE SENSING, 2021, 13 (09)
  • [40] A COMPARATIVE STUDY OF FORECASTING CORPORATE CREDIT RATINGS USING ARTIFICIAL NEURAL NETWORKS, SUPPORT VECTOR MACHINE, RANDOM FOREST, THE NAIVE BAYES, DECISION TREE AND K-NEAREST NEIGHBOR
    Al-Sayed, Dalia Adel Abbas
    Awad, Wael Abdel Qader
    Salem, Mohamed Talaat Mohamed
    ADVANCES AND APPLICATIONS IN STATISTICS, 2024, 91 (02) : 125 - 139