Early Detection of Depression: Social Network Analysis and Random Forest Techniques

被引:88
作者
Cacheda, Fidel [1 ,2 ]
Fernandez, Diego [1 ,2 ]
Novoa, Francisco J. [1 ,2 ]
Carneiro, Victor [1 ,2 ]
机构
[1] Univ A Coruna, Fac Comp Sci, Dept Comp Sci, Campus Elvina, La Coruna 15071, Spain
[2] Univ A Coruna, Ctr Informat & Commun Technol Res, La Coruna, Spain
关键词
depression; major depressive disorder; social media; artificial intelligence; machine learning; MENTAL-DISORDERS; HEALTH; MEDIA; PREVENTION;
D O I
10.2196/12554
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background: Major depressive disorder (MDD) or depression is among the most prevalent psychiatric disorders, affecting more than 300 million people globally. Early detection is critical for rapid intervention, which can potentially reduce the escalation of the disorder. Objective: This study used data from social media networks to explore various methods of early detection of MDDs based on machine learning. We performed a thorough analysis of the dataset to characterize the subjects' behavior based on different aspects of their writings: textual spreading, time gap, and time span. Methods: We proposed 2 different approaches based on machine learning singleton and dual. The former uses 1 random forest (RF) classifier with 2 threshold functions, whereas the latter uses 2 independent RF classifiers, one to detect depressed subjects and another to identify nondepressed individuals. In both cases, features are defined from textual, semantic, and writing similarities. Results: The evaluation follows a time-aware approach that rewards early detections and penalizes late detections. The results show how a dual model performs significantly better than the singleton model and is able to improve current state-of-the-art detection models by more than 10%. Conclusions: Given the results, we consider that this study can help in the development of new solutions to deal with the early detection of depression on social networks.
引用
收藏
页数:18
相关论文
共 49 条
  • [1] Feature normalization and likelihood-based similarity measures for image retrieval
    Aksoy, S
    Haralick, RM
    [J]. PATTERN RECOGNITION LETTERS, 2001, 22 (05) : 563 - 582
  • [2] Viewpoint The Importance of Reviewing the Code
    Anel, Juan A.
    [J]. COMMUNICATIONS OF THE ACM, 2011, 54 (05) : 40 - 41
  • [3] [Anonymous], 2013, WILL COMPR PLAN 2013
  • [4] [Anonymous], 2009, ACM SIGKDD explorations newsletter, DOI 10.1145/1656274.1656278
  • [5] Balani Sairam, 2015, P 33 ANN ACM C EXT A, P1373, DOI DOI 10.1145/2702613.2732733
  • [6] Beck A.T., 1996, Psychol. Assess, DOI DOI 10.1037/T00742-000
  • [7] A Collaborative Approach to Identifying Social Media Markers of Schizophrenia by Employing Machine Learning and Clinical Appraisals
    Birnbaum, Michael L.
    Ernala, Sindhu Kiranmai
    Rizvi, Asra F.
    De Choudhury, Munmun
    Kane, John M.
    [J]. JOURNAL OF MEDICAL INTERNET RESEARCH, 2017, 19 (08)
  • [8] Random forests
    Breiman, L
    [J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32
  • [9] Measuring depression severity in general practice: discriminatory performance of the PHQ-9, HADS-D, and BDI-II
    Cameron, Isobel M.
    Cardy, Amanda
    Crawford, John R.
    du Toit, Schalk W.
    Hay, Steven
    Lawton, Kenneth
    Mitchell, Kenneth
    Sharma, Sumit
    Shivaprasad, Shilpa
    Winning, Sally
    Reid, Ian C.
    [J]. BRITISH JOURNAL OF GENERAL PRACTICE, 2011, 61 (588) : e419 - e426
  • [10] Pandemics in the Age of Twitter: Content Analysis of Tweets during the 2009 H1N1 Outbreak
    Chew, Cynthia
    Eysenbach, Gunther
    [J]. PLOS ONE, 2010, 5 (11):