Analyzing the Perceptual Salience of Audio Features for Musical Emotion Recognition

被引:0
作者
Schmidt, Erik M. [1 ]
Prockup, Matthew [1 ]
Scott, Jeffrey [1 ]
Dolhansky, Brian [2 ]
Morton, Brandon G. [1 ]
Kim, Youngmoo E. [1 ]
机构
[1] Drexel Univ, Philadelphia, PA USA
[2] Univ Penn, Philadelphia, PA 19104 USA
来源
FROM SOUNDS TO MUSIC AND EMOTIONS | 2013年 / 7900卷
关键词
emotion; music emotion recognition; features; acoustic features; machine learning; invariance; MODE; RESPONSES; TEMPO;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
While the organization of music in terms of emotional affect is a natural process for humans, quantifying it empirically proves to be a very difficult task. Consequently, no acoustic feature (or combination thereof) has emerged as the optimal representation for musical emotion recognition. Due to the subjective nature of emotion, determining whether an acoustic feature domain is informative requires evaluation by human subjects. In this work, we seek to perceptually evaluate two of the most commonly used features in music information retrieval: mel-frequency cepstral coefficients and chroma. Furthermore, to identify emotion-informative feature domains, we explore which musical features are most relevant in determining emotion perceptually, and which acoustic feature domains are most variant or invariant to those changes. Finally, given our collected perceptual data, we conduct an extensive computational experiment for emotion prediction accuracy on a large number of acoustic feature domains, investigating pairwise prediction both in the context of a general corpus as well as in the context of a corpus that is constrained to contain only specific musical feature transformations.
引用
收藏
页码:278 / 300
页数:23
相关论文
共 50 条
[41]   Evaluating intonational features for emotion recognition from speech [J].
Zervas, Panagiotis ;
Mporas, Iosif ;
Fakotakis, Nikos ;
Kokkinakis, George .
INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2007, 16 (06) :1001-1014
[42]   Acoustic Features for Music Emotion Recognition and System Building [J].
Soruss, Kanawat ;
Choksuriwong, Anant ;
Karnjanadecha, Montri .
PROCEEDINGS OF THE 2017 INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY (ICIT 2017), 2017, :413-417
[43]   Speech Emotion Recognition Using Local and Global Features [J].
Gao, Yuanbo ;
Li, Baobin ;
Wang, Ning ;
Zhu, Tingshao .
BRAIN INFORMATICS, BI 2017, 2017, 10654 :3-13
[44]   Multimodal emotion recognition based on audio and text by using hybrid attention networks [J].
Zhang, Shiqing ;
Yang, Yijiao ;
Chen, Chen ;
Liu, Ruixin ;
Tao, Xin ;
Guo, Wenping ;
Xu, Yicheng ;
Zhao, Xiaoming .
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 85
[45]   Audio-Visual Emotion Recognition Based on Facial Expression and Affective Speech [J].
Zhang, Shiqing ;
Li, Lemin ;
Zhao, Zhijin .
MULTIMEDIA AND SIGNAL PROCESSING, 2012, 346 :46-+
[46]   Human emotion recognition and analysis in response to audio music using brain signals [J].
Bhatti, Adnan Mehmood ;
Majid, Muhammad ;
Anwar, Syed Muhammad ;
Khan, Bilal .
COMPUTERS IN HUMAN BEHAVIOR, 2016, 65 :267-275
[47]   Leveraging recent advances in deep learning for audio-Visual emotion recognition [J].
Schoneveld, Liam ;
Othmani, Alice ;
Abdelkawy, Hazem .
PATTERN RECOGNITION LETTERS, 2021, 146 :1-7
[48]   Autism, music and Alexithymia: A musical intervention to enhance emotion recognition in adolescents with ASD [J].
Pedregal, Celia Redondo ;
Heaton, Pamela .
RESEARCH IN DEVELOPMENTAL DISABILITIES, 2021, 116
[49]   Learning Better Representations for Audio-Visual Emotion Recognition with Common Information [J].
Ma, Fei ;
Zhang, Wei ;
Li, Yang ;
Huang, Shao-Lun ;
Zhang, Lin .
APPLIED SCIENCES-BASEL, 2020, 10 (20) :1-23
[50]   A systematic review of interpretability and explainability for speech emotion features in automatic speech emotion recognition [J].
Jayasinghe, Hiruni Maleesa ;
Wong, Kok Wai ;
Nugaliyadde, Anupiya .
PATTERN RECOGNITION, 2026, 171