Enhancing the efficacy of depression detection system using optimal feature selection from EHR

被引:18
作者
Bhadra, Sweta [1 ]
Kumar, Chandan Jyoti [1 ]
机构
[1] Cotton Univ, Dept Comp Sci & Informat Technol, Gauhati, India
关键词
Depression; machine learning; genetic algorithm; firefly algorithm; particle swarm optimization; EXTRACTION; ALGORITHM; ENSEMBLE; DIAGNOSIS; RECORDS; MODEL;
D O I
10.1080/10255842.2023.2181660
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Diagnosing depression at an early stage is crucial and majorly depends on the clinician's skill. The present work aims to develop an automated tool for assisting the diagnostic procedure of depression using multiple machine-learning techniques. The dataset of sample size 4184 used in this study contains biometric and demographic information of individuals with or without depression, accessed from the University of Nice Sophia-Antipolis. The Artificial Neural Network (ANN), Support Vector Machine (SVM), Random Forest (RF) and Extreme Gradient Boosting (XGBoost) are used for classifying the depressed from the control group. To enhance the computational efficiency, various feature selection algorithms like Recursive Feature Elimination (RFE), Mutual Information (MI) and three bio-inspired techniques, viz. Particle Swarm Optimization (PSO), Genetic Algorithm (GA) and Firefly Algorithms (FA) have been incorporated. To enhance the feature selection process further, majority voting is carried out in all possible combinations of three, four and five feature selection techniques. These feature selection techniques bring down the feature set size significantly to a mean of 33 from the actual size of 61 which is a reduction of 45.90%. The classification accuracy of the enhanced model varies between 84.18% and 88.46%, which is a significant improvement in performance as compared to the pre-existing models (83.76-85.89%). The proposed predictive models outperform the pre-existing classification models without feature selection and thereby enhancing both the performance and efficiency of the diagnostic process.
引用
收藏
页码:222 / 236
页数:15
相关论文
共 62 条
[1]   Computer-Aided Diagnosis of Depression Using EEG Signals [J].
Acharya, U. Rajendra ;
Sudarshan, Vidya K. ;
Adeli, Hojjat ;
Santhosh, Jayasree ;
Koh, Joel E. W. ;
Adeli, Amir .
EUROPEAN NEUROLOGY, 2015, 73 (5-6) :329-336
[2]   Feature Selection Using Particle Swarm Optimization in Intrusion Detection [J].
Ahmad, Iftikhar .
INTERNATIONAL JOURNAL OF DISTRIBUTED SENSOR NETWORKS, 2015,
[3]   RUN beyond the metaphor: An efficient optimization algorithm based on Runge Kutta method [J].
Ahmadianfar, Iman ;
Heidari, Ali Asghar ;
Gandomi, Amir H. ;
Chu, Xuefeng ;
Chen, Huiling .
EXPERT SYSTEMS WITH APPLICATIONS, 2021, 181
[4]   Estimation of postpartum depression risk from electronic health records using machine learning [J].
Amit, Guy ;
Girshovitz, Irena ;
Marcus, Karni ;
Zhang, Yiye ;
Pathak, Jyotishman ;
Bar, Vered ;
Akiva, Pinchas .
BMC PREGNANCY AND CHILDBIRTH, 2021, 21 (01)
[5]  
[Anonymous], 2022, DEPRESSION RATES 202
[6]   The prevalence and correlates of depression, anxiety, and stress in a sample of college students [J].
Beiter, R. ;
Nash, R. ;
McCrady, M. ;
Rhoades, D. ;
Linscomb, M. ;
Clarahan, M. ;
Sammut, S. .
JOURNAL OF AFFECTIVE DISORDERS, 2015, 173 :90-96
[7]  
Beraha M, 2019, IEEE IJCNN
[8]   An insight into diagnosis of depression using machine learning techniques: a systematic review [J].
Bhadra, Sweta ;
Kumar, Chandan Jyoti .
CURRENT MEDICAL RESEARCH AND OPINION, 2022, 38 (05) :749-771
[9]   Feature-level fusion approaches based on multimodal EEG data for depression recognition [J].
Cai, Hanshu ;
Qu, Zhidiao ;
Li, Zhe ;
Zhang, Yi ;
Hu, Xiping ;
Hu, Bin .
INFORMATION FUSION, 2020, 59 (59) :127-138
[10]   A survey on feature selection methods [J].
Chandrashekar, Girish ;
Sahin, Ferat .
COMPUTERS & ELECTRICAL ENGINEERING, 2014, 40 (01) :16-28