Stress detection from audio on multiple window analysis size in a public speaking task

被引：15

作者：

Soury, Mariette ^{[1
]}

Devillers, Laurence ^{[1
]}

机构：

[1] LIMSI CNRS, Orsay, France

来源：

2013 HUMAINE ASSOCIATION CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION (ACII) | 2013年

关键词：

stress; prosodic cues; public speaking task; SPEECH;

D O I：

10.1109/ACII.2013.93

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Speech production modifications are one of the many indications of stress in humans. A job interview simulation task permitted the collection of a multimodal corpus, including physiological data. Physiological cues of stress are reliable on long periods, and require invasive sensors. Human voice variations have been proved to be a non-invasive stress cue. In this paper, we focus on a frame-wise detection of stress on several window analysis sizes and analyze the behavior of different audio features classes. We trained our system on 19 subjects, and test it on 10 other subjects. Our best system obtains a 71.9 percent Unweighted Average Recall on 5s windows.

引用

页码：529 / 533

页数：5

共 9 条

[1] Psychological stress, appraisal, emotion and cardiovascular response in a public speaking task
Feldman, PJ
Cohen, S
Hamrick, N
Lepore, SJ
PSYCHOLOGY & HEALTH, 2004, 19 (03) : 353 - 368
[2] Multimodal Expressions of Stress during a Public Speaking Task Collection, Annotation and Global Analyses
Giraud, Tom
Soury, Mariette
Hua, Jiewen
Delaborde, Agnes
Tahon, Marie
Jauregui, David Antonio Gomez
Eyharabide, Victoria
Filaire, Edith
Le Scanff, Christine
Devillers, Laurence
Isableu, Brice
Martin, Jean Claude
2013 HUMAINE ASSOCIATION CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION (ACII), 2013, : 417 - 422
[3] Multimodal Stress Detection from Multiple Assessments
Aigrain, Jonathan
Spodenkiewicz, Michel
Dubuisson, Severine
Detyniecki, Marcin
Cohen, David
Chetouani, Mohamed
IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2018, 9 (04) : 491 - 506
[4] Real-time Stress Detection Model and Voice Analysis: An Integrated VR-based Game for Training Public Speaking Skills
Arushi
Dillon, Roberto
Teoh, Ai Ni
2021 IEEE CONFERENCE ON GAMES (COG), 2021, : 994 - 997
[5] Emotion detection from multilingual audio using deep analysis
Bhattacharya, Sudipta
Borah, Samarjeet
Mishra, Brojo Kishore
Mondal, Atreyee
MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (28) : 41309 - 41338
[6] Emotion detection from multilingual audio using deep analysis
Sudipta Bhattacharya
Samarjeet Borah
Brojo Kishore Mishra
Atreyee Mondal
Multimedia Tools and Applications, 2022, 81 : 41309 - 41338
[7] Enabling COVID-19 Detection from Multiple Audio Recordings: A Preliminary Comparison Between Cough, Breath, and Speech Signals
Ponsiglione, Alfonso Maria
Angelone, Francesca
Sparaco, Rossella
Piccolo, Salvatore
Parrish, Amy
Calcagno, Andrea
Fournier, Guillaume
Martins, Ayana de Brito
Cordella, Fulvio
Arienzo, Arianna
Castella, Lorenzo
Vitale, Vincenzo Norman
Amato, Francesco
Romano, Maria
9TH EUROPEAN MEDICAL AND BIOLOGICAL ENGINEERING CONFERENCE, VOL 1, EMBEC 2024, 2024, 112 : 373 - 383
[8] Ultra-Short Window Length and Feature Importance Analysis for Cognitive Load Detection from Wearable Sensors
Tervonen, Jaakko
Pettersson, Kati
Mantyjarvi, Jani
ELECTRONICS, 2021, 10 (05) : 1 - 19
[9] Numerical Modeling and Onsite Detection Analysis of Upward Mining Feasibility of Residual Coal from Multi-Gobs in Close-Multiple Coal Seams
Li, Yang
Ren, Yuqi
Lei, Xinghai
Wang, Nan
Jin, Xiangyang
MINING METALLURGY & EXPLORATION, 2023, 40 (04) : 1153 - 1169

← 1 →