Bipolar disorder: Construction and analysis of a joint diagnostic model using random forest and feedforward neural networks

被引:1
|
作者
Sun, Ping [1 ,2 ]
Wang, Xiangwen [1 ,3 ]
Wang, Shenghai [1 ]
Jia, Xueyu [8 ]
Feng, Shunkang [1 ]
Chen, Jun [2 ,4 ,5 ,6 ]
Fang, Yiru [2 ,4 ,5 ,6 ,7 ]
机构
[1] Qingdao Mental Hlth Ctr, Qingdao 266034, Shandong, Peoples R China
[2] Shanghai Jiao Tong Univ, Sch Med, Shanghai Mental Hlth Ctr, Clin Res Ctr, Shanghai 200030, Peoples R China
[3] Jining Med Univ, Res Inst Mental Hlth, Sch Mental Hlth, Jining 272002, Shandong, Peoples R China
[4] Shanghai Jiao Tong Univ, Ruijin Hosp, Dept Psychiat, Sch Med, Shanghai 200025, Peoples R China
[5] Shanghai Jiao Tong Univ, Ruijin Hosp, Affect Disorders Ctr, Sch Med, Shanghai 200025, Peoples R China
[6] Shanghai Key Lab Psychot Disorders, Shanghai 201108, Peoples R China
[7] Chinese Acad Sci, State Key Lab Neurosci, Shanghai Inst Biol Sci, Shanghai 200031, Peoples R China
[8] Qingdao Univ, Dept Med, Qingdao 266000, Shandong, Peoples R China
来源
IBRO NEUROSCIENCE REPORTS | 2024年 / 17卷
基金
中国国家自然科学基金;
关键词
Bipolar disorder; Machine learning; Neural networks; Diagnostic models; GENE-EXPRESSION;
D O I
10.1016/j.ibneur.2024.07.007
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
Background: To construct a diagnostic model for Bipolar Disorder (BD) depressive phase using peripheral tissue RNA data from patients and combining Random Forest with Feedforward Neural Network methods. Methods: Datasets GSE23848, GSE39653, and GSE69486 were selected, and differential gene expression analysis was conducted using the limma package in R. Key genes from the differentially expressed genes were identified using the Random Forest method. These key genes' expression levels in each sample were used to train a Feedforward Neural Network model. Techniques like L1 regularization, early stopping, and dropout layers were employed to prevent model overfitting. Model performance was then validated, followed by GO, KEGG, and protein-protein interaction network analyses. Results: The final model was a Feedforward Neural Network with two hidden layers and two dropout layers, comprising 2345 trainable parameters. Model performance on the validation set, assessed through 1000 bootstrap resampling iterations, demonstrated a specificity of 0.769(95% CI 0.571-1.000), sensitivity of 0.818 (95% CI 0.533-1.000), AUC value of 0.832 (95 % CI 0.642-0.979), and accuracy of 0.792 (95 % CI 0.625-0.958). Enrichment analysis of key genes indicated no significant enrichment in any known pathways. Conclusion: Key genes with biological significance were identified based on the decrease in Gini coefficient within the Random Forest model. The combined use of Random Forest and Feedforward Neural Network to establish a diagnostic model showed good classification performance in Bipolar Disorder.
引用
收藏
页码:145 / 153
页数:9
相关论文
共 45 条
  • [11] Establishment of a Combined Diagnostic Model of Abdominal Aortic Aneurysm with Random Forest and Artificial Neural Network
    Duan, Yixuan
    Xie, Enrui
    Liu, Chang
    Sun, Jingjing
    Deng, Jie
    BIOMED RESEARCH INTERNATIONAL, 2022, 2022
  • [12] Construction of a Diagnostic Algorithm for Diagnosis of Adult Asthma Using Machine Learning with Random Forest and XGBoost
    Tomita, Katsuyuki
    Yamasaki, Akira
    Katou, Ryohei
    Ikeuchi, Tomoyuki
    Touge, Hirokazu
    Sano, Hiroyuki
    Tohda, Yuji
    DIAGNOSTICS, 2023, 13 (19)
  • [13] Automated Risk Analysis for Construction Contracts Using Neural Networks
    Hamdy, Khaled
    AbdelRasheed, Ibrahim
    Essawy, Yasmeen A. S.
    ElDeen, Ahmed Gamal
    JOURNAL OF LEGAL AFFAIRS AND DISPUTE RESOLUTION IN ENGINEERING AND CONSTRUCTION, 2024, 16 (04)
  • [14] Construction and analysis of the protein-protein interaction networks for schizophrenia, bipolar disorder, and major depression
    Sheng-An Lee
    Theresa Tsun-Hui Tsao
    Ko-Chun Yang
    Han Lin
    Yu-Lun Kuo
    Chien-Hsiang Hsu
    Wen-Kuei Lee
    Kuo-Chuan Huang
    Cheng-Yan Kao
    BMC Bioinformatics, 12
  • [15] Predicting types of occupational accidents at construction sites in Korea using random forest model
    Kang, Kyungsu
    Ryu, Hanguk
    SAFETY SCIENCE, 2019, 120 : 226 - 236
  • [16] Classification model in different forest strata in a floodplain environment using artificial neural networks
    Silva, Anthoinny Vittoria dos Santos
    de Souza, Rodrigo Galvao Teixeira
    Liarte, Gabriel Victor Caetano Carvalho
    Pinho, Bianca Caterine Piedade
    de Oliveira, Cinthia Pereira
    Gonzales, Duberli Geomar Elera
    de Lima, Robson Borges
    de Abreu, Jadson Coelho
    REVISTA FORESTAL MESOAMERICA KURU-RFMK, 2022, 19 (45): : 64 - 70
  • [17] Development and analysis of a comprehensive diagnostic model for aortic valve calcification using machine learning methods and artificial neural networks
    Xiong, Tao
    Chen, Yan
    Han, Shen
    Zhang, Tian-Chen
    Pu, Lei
    Fan, Yu-Xin
    Fan, Wei-Chen
    Zhang, Ya-Yong
    Li, Ya-Xiong
    FRONTIERS IN CARDIOVASCULAR MEDICINE, 2022, 9
  • [18] Stochastic approach to texture analysis using probabilistic neural networks and Markov random fields
    Dehmeshki, J
    Daemi, F
    Hatfield, F
    Rashidi, M
    APPLICATIONS OF DIGITAL IMAGE PROCESSING XX, 1997, 3164 : 346 - 358
  • [19] Construction and analysis of a joint diagnostic model of machine learning for cryptorchidism based on single-cell sequencing
    Chen, Yuehua
    Zhou, Xiaomeng
    Ji, Linghua
    Zhao, Jun
    Xian, Hua
    Xu, Yunzhao
    Wang, Ziheng
    Ge, Wenliang
    BIRTH DEFECTS RESEARCH, 2024, 116 (03):
  • [20] Bioinformatics Analysis of circRNA Expression and Construction of "circRNA-miRNA-mRNA" Competing Endogenous RNAs Networks in Bipolar Disorder Patients
    Fu, Yonghui
    He, Wenfeng
    Zhou, Chaoxiong
    Fu, Xia
    Wan, Qigen
    He, Ling
    Wei, Bo
    FRONTIERS IN GENETICS, 2021, 12