Multimodal region-based behavioral modeling for suicide risk screening

被引:0
作者
Alghowinem, Sharifa [1 ,2 ]
Zhang, Xiajie [1 ]
Breazeal, Cynthia [1 ]
Park, Hae Won [1 ]
机构
[1] MIT, Personal Robot Grp, Media Lab, Cambridge, MA 02139 USA
[2] Prince Sultan Univ, Comp & Informat Sci Coll, Riyadh, Saudi Arabia
来源
FRONTIERS IN COMPUTER SCIENCE | 2023年 / 5卷
关键词
suicide risk screening; nonverbal behavior; speech prosody; region-based behavior analysis; multimodal fusion; deep learning automatic suicide risk screening; DEPRESSION; ANXIETY; GAZE;
D O I
10.3389/fcomp.2023.990426
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
IntroductionSuicide is a leading cause of death around the world, interpolating a huge suffering to the families and communities of the individuals. Such pain and suffering are preventable with early screening and monitoring. However, current suicide risk identification relies on self-disclosure and/or the clinician's judgment. Research question/statmentTherefore, we investigate acoustic and nonverbal behavioral markers that are associated with different levels of suicide risks through a multimodal approach for suicide risk detection.Given the differences in the behavioral dynamics between subregions of facial expressions and body gestures in terms of timespans, we propose a novel region-based multimodal fusion. MethodsWe used a newly collected video interview dataset of young Japanese who are at risk of suicide to extract engineered features and deep representations from the speech, regions of the face (i.e., eyes, nose, mouth), regions of the body (i.e., shoulders, arms, legs), as well as the overall combined regions of face and body. ResultsThe results confirmed that behavioral dynamics differs between regions, where some regions benefit from a shorter timespans, while other regions benefit from longer ones. Therefore, a region-based multimodal approach is more informative in terms of behavioral markers and accounts for both subtle and strong behaviors. Our region-based multimodal results outperformed the single modality, reaching a sample-level accuracy of 96% compared with the highest single modality that reached sample-level accuracy of 80%. Interpretation of the behavioral markers, showed the higher the suicide risk levels, the lower the expressivity, movement and energy observed from the subject. Moreover, the high-risk suicide group express more disgust and contact avoidance, while the low-risk suicide group express self-soothing and anxiety behaviors. DiscussionEven though multimodal analysis is a powerful tool to enhance the model performance and its reliability, it is important to ensure through a careful selection that a strong behavioral modality (e.g., body movement) does not dominate another subtle modality (e.g., eye blink). Despite the small sample size, our unique dataset and the current results adds a new cultural dimension to the research on nonverbal markers of suicidal risks. Given a larger dataset, future work on this method can be useful in helping psychiatrists with the assessment of suicide risk and could have several applications to identify those at risk.
引用
收藏
页数:20
相关论文
共 69 条
  • [1] Beyond the Words: Analysis and Detection of Self-Disclosure Behavior during Robot Positive Psychology Interaction
    Alghowinem, Sharifa
    Jeong, Sooyeon
    Arias, Kika
    Picard, Rosalind
    Breazeal, Cynthia
    Park, Hae Won
    [J]. 2021 16TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2021), 2021,
  • [2] Alghowinem S., 2021, PROC 16 IEEE INT C A, P01
  • [3] Alghowinem S, 2023, IEEE T AFFECT COMPUT, V14, P133, DOI [10.1109/TAFFC.2020.3035535, 10.1109/taffc.2020.3035535]
  • [4] Multimodal Depression Detection: Fusion Analysis of Paralinguistic, Head Pose and Eye Gaze Behaviors
    Alghowinem, Sharifa
    Goecke, Roland
    Wagner, Michael
    Epps, Julien
    Hyett, Matthew
    Parker, Gordon
    Breakspear, Michael
    [J]. IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2018, 9 (04) : 478 - 490
  • [5] Cross-Cultural Depression Recognition from Vocal Biomarkers
    Alghowinem, Sharifa
    Goecke, Roland
    Epps, Julien
    Wagner, Michael
    Cohn, Jeffrey
    [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 1943 - 1947
  • [6] Alghowinem S, 2013, IEEE IMAGE PROC, P4220, DOI 10.1109/ICIP.2013.6738869
  • [7] Head Pose and Movement Analysis as an Indicator of Depression
    Alghowinem, Sharifa
    Goecke, Roland
    Wagner, Michael
    Parker, Gordon
    Breakspear, Michael
    [J]. 2013 HUMAINE ASSOCIATION CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION (ACII), 2013, : 283 - 288
  • [8] Baltrusaitis T, 2015, IEEE INT CONF AUTOMA
  • [9] The Dempster-Shafer theory of evidence: an alternative approach to multicriteria decision modelling
    Beynon, M
    Curry, B
    Morgan, P
    [J]. OMEGA-INTERNATIONAL JOURNAL OF MANAGEMENT SCIENCE, 2000, 28 (01): : 37 - 50
  • [10] INTONATION AND GESTURE
    BOLINGER, D
    [J]. AMERICAN SPEECH, 1983, 58 (02) : 156 - 174