A CONTEMPORARY MULTI-OBJECTIVE FEATURE SELECTION MODEL FOR DEPRESSION DETECTION USING A HYBRID PBGSK OPTIMIZATION ALGORITHM

被引:1
作者
Priya, Santhosam Kavi [1 ]
Karthika, Kasirajan Pon [1 ]
机构
[1] Mepco Schlenk Engn Coll Autonomous, Dept Comp Sci & Engn, Sivakasi 626005, Tamil Nadu, India
关键词
depression detection; text classification; dimensionality reduction; hybrid feature selection; binary gaining-sharing knowledge-based optimization; TEXT CLASSIFICATION; POSTS;
D O I
10.34768/amcs-2023-0010
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Depression is one of the primary causes of global mental illnesses and an underlying reason for suicide. The user generated text content available in social media forums offers an opportunity to build automatic and reliable depression detection models. The core objective of this work is to select an optimal set of features that may help in classifying depressive contents posted on social media. To this end, a novel multi-objective feature selection technique (EFS-pBGSK) and machine learning algorithms are employed to train the proposed model. The novel feature selection technique incorporates a binary gaining-sharing knowledge-based optimization algorithm with population reduction (pBGSK) to obtain the optimized features from the original feature space. The extensive feature selector (EFS) is used to filter out the excessive features based on their ranking. Two text depression datasets collected from Twitter and Reddit forums are used for the evaluation of the proposed feature selection model. The experimentation is carried out using naive Bayes (NB) and support vector machine (SVM) classifiers for five different feature subset sizes (10, 50, 100, 300 and 500). The experimental outcome indicates that the proposed model can achieve superior performance scores. The top results are obtained using the SVM classifier for the SDD dataset with 0.962 accuracy, 0.929 F1 score, 0.0809 log-loss and 0.0717 mean absolute error (MAE). As a result, the optimal combination of features selected by the proposed hybrid model significantly improves the performance of the depression detection system.
引用
收藏
页码:117 / 131
页数:15
相关论文
共 44 条
[1]   Metaheuristic Algorithms on Feature Selection: A Survey of One Decade of Research (2009-2019) [J].
Agrawal, Prachi ;
Abutarboush, Hattan F. ;
Ganesh, Talari ;
Mohamed, Ali Wagdy .
IEEE ACCESS, 2021, 9 :26766-26791
[2]   A novel binary gaining-sharing knowledge-based optimization algorithm for feature selection [J].
Agrawal, Prachi ;
Ganesh, Talari ;
Mohamed, Ali Wagdy .
NEURAL COMPUTING & APPLICATIONS, 2021, 33 (11) :5989-6008
[3]  
Asim M. N., 2017, COMP FEATURE SELECTI, P1, DOI DOI 10.1109/INTELLECT.2017.8277634
[4]  
Babu Nirmal Varghese, 2022, SN Comput Sci, V3, P74, DOI [10.1007/s42979-021-00958-1, 10.1007/s42979-021-00958-1]
[5]   A text classification framework for simple and effective early depression detection over social media streams [J].
Burdisso, Sergio G. ;
Errecalde, Marcelo ;
Montes-y-Gomez, Manuel .
EXPERT SYSTEMS WITH APPLICATIONS, 2019, 133 :182-197
[6]   Feature selection for text classification with Naive Bayes [J].
Chen, Jingnian ;
Huang, Houkuan ;
Tian, Shengfeng ;
Qu, Youli .
EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (03) :5432-5435
[7]   A textual-based featuring approach for depression detection using machine learning classifiers and social media texts [J].
Chiong, Raymond ;
Budhi, Gregorius Satia ;
Dhakal, Sandeep ;
Chiong, Fabian .
COMPUTERS IN BIOLOGY AND MEDICINE, 2021, 135
[8]   Feature selection for text classification: A review [J].
Deng, Xuelian ;
Li, Yuqing ;
Weng, Jian ;
Zhang, Jilian .
MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (03) :3797-3816
[9]   A Depression Recognition Method for College Students Using Deep Integrated Support Vector Algorithm [J].
Ding, Yan ;
Chen, Xuemei ;
Fu, Qiming ;
Zhong, Shan .
IEEE ACCESS, 2020, 8 (08) :75616-75629
[10]   Feature selection and classification using support vector machine and decision tree [J].
Durgalakshmi, B. ;
Vijayakumar, V. .
COMPUTATIONAL INTELLIGENCE, 2020, 36 (04) :1480-1492