Stress detection from audio on multiple window analysis size in a public speaking task

被引:15
|
作者
Soury, Mariette [1 ]
Devillers, Laurence [1 ]
机构
[1] LIMSI CNRS, Orsay, France
来源
2013 HUMAINE ASSOCIATION CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION (ACII) | 2013年
关键词
stress; prosodic cues; public speaking task; SPEECH;
D O I
10.1109/ACII.2013.93
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Speech production modifications are one of the many indications of stress in humans. A job interview simulation task permitted the collection of a multimodal corpus, including physiological data. Physiological cues of stress are reliable on long periods, and require invasive sensors. Human voice variations have been proved to be a non-invasive stress cue. In this paper, we focus on a frame-wise detection of stress on several window analysis sizes and analyze the behavior of different audio features classes. We trained our system on 19 subjects, and test it on 10 other subjects. Our best system obtains a 71.9 percent Unweighted Average Recall on 5s windows.
引用
收藏
页码:529 / 533
页数:5
相关论文
共 9 条
  • [1] Psychological stress, appraisal, emotion and cardiovascular response in a public speaking task
    Feldman, PJ
    Cohen, S
    Hamrick, N
    Lepore, SJ
    PSYCHOLOGY & HEALTH, 2004, 19 (03) : 353 - 368
  • [2] Multimodal Expressions of Stress during a Public Speaking Task Collection, Annotation and Global Analyses
    Giraud, Tom
    Soury, Mariette
    Hua, Jiewen
    Delaborde, Agnes
    Tahon, Marie
    Jauregui, David Antonio Gomez
    Eyharabide, Victoria
    Filaire, Edith
    Le Scanff, Christine
    Devillers, Laurence
    Isableu, Brice
    Martin, Jean Claude
    2013 HUMAINE ASSOCIATION CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION (ACII), 2013, : 417 - 422
  • [3] Multimodal Stress Detection from Multiple Assessments
    Aigrain, Jonathan
    Spodenkiewicz, Michel
    Dubuisson, Severine
    Detyniecki, Marcin
    Cohen, David
    Chetouani, Mohamed
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2018, 9 (04) : 491 - 506
  • [4] Real-time Stress Detection Model and Voice Analysis: An Integrated VR-based Game for Training Public Speaking Skills
    Arushi
    Dillon, Roberto
    Teoh, Ai Ni
    2021 IEEE CONFERENCE ON GAMES (COG), 2021, : 994 - 997
  • [5] Emotion detection from multilingual audio using deep analysis
    Bhattacharya, Sudipta
    Borah, Samarjeet
    Mishra, Brojo Kishore
    Mondal, Atreyee
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (28) : 41309 - 41338
  • [6] Emotion detection from multilingual audio using deep analysis
    Sudipta Bhattacharya
    Samarjeet Borah
    Brojo Kishore Mishra
    Atreyee Mondal
    Multimedia Tools and Applications, 2022, 81 : 41309 - 41338
  • [7] Enabling COVID-19 Detection from Multiple Audio Recordings: A Preliminary Comparison Between Cough, Breath, and Speech Signals
    Ponsiglione, Alfonso Maria
    Angelone, Francesca
    Sparaco, Rossella
    Piccolo, Salvatore
    Parrish, Amy
    Calcagno, Andrea
    Fournier, Guillaume
    Martins, Ayana de Brito
    Cordella, Fulvio
    Arienzo, Arianna
    Castella, Lorenzo
    Vitale, Vincenzo Norman
    Amato, Francesco
    Romano, Maria
    9TH EUROPEAN MEDICAL AND BIOLOGICAL ENGINEERING CONFERENCE, VOL 1, EMBEC 2024, 2024, 112 : 373 - 383
  • [8] Ultra-Short Window Length and Feature Importance Analysis for Cognitive Load Detection from Wearable Sensors
    Tervonen, Jaakko
    Pettersson, Kati
    Mantyjarvi, Jani
    ELECTRONICS, 2021, 10 (05) : 1 - 19
  • [9] Numerical Modeling and Onsite Detection Analysis of Upward Mining Feasibility of Residual Coal from Multi-Gobs in Close-Multiple Coal Seams
    Li, Yang
    Ren, Yuqi
    Lei, Xinghai
    Wang, Nan
    Jin, Xiangyang
    MINING METALLURGY & EXPLORATION, 2023, 40 (04) : 1153 - 1169