Analyzing the Perceptual Salience of Audio Features for Musical Emotion Recognition

被引:0
作者
Schmidt, Erik M. [1 ]
Prockup, Matthew [1 ]
Scott, Jeffrey [1 ]
Dolhansky, Brian [2 ]
Morton, Brandon G. [1 ]
Kim, Youngmoo E. [1 ]
机构
[1] Drexel Univ, Philadelphia, PA USA
[2] Univ Penn, Philadelphia, PA 19104 USA
来源
FROM SOUNDS TO MUSIC AND EMOTIONS | 2013年 / 7900卷
关键词
emotion; music emotion recognition; features; acoustic features; machine learning; invariance; MODE; RESPONSES; TEMPO;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
While the organization of music in terms of emotional affect is a natural process for humans, quantifying it empirically proves to be a very difficult task. Consequently, no acoustic feature (or combination thereof) has emerged as the optimal representation for musical emotion recognition. Due to the subjective nature of emotion, determining whether an acoustic feature domain is informative requires evaluation by human subjects. In this work, we seek to perceptually evaluate two of the most commonly used features in music information retrieval: mel-frequency cepstral coefficients and chroma. Furthermore, to identify emotion-informative feature domains, we explore which musical features are most relevant in determining emotion perceptually, and which acoustic feature domains are most variant or invariant to those changes. Finally, given our collected perceptual data, we conduct an extensive computational experiment for emotion prediction accuracy on a large number of acoustic feature domains, investigating pairwise prediction both in the context of a general corpus as well as in the context of a corpus that is constrained to contain only specific musical feature transformations.
引用
收藏
页码:278 / 300
页数:23
相关论文
共 50 条
[21]   The Role of Time in Music Emotion Recognition: Modeling Musical Emotions from Time-Varying Music Features [J].
Caetano, Marcelo ;
Mouchtaris, Athanasios ;
Wiering, Frans .
FROM SOUNDS TO MUSIC AND EMOTIONS, 2013, 7900 :171-196
[22]   Analyzing recognition of EEG based human attention and emotion using Machine learning [J].
Alam, Mohammad Shabbir ;
Jalil, Siti Zura A. ;
Upreti, Kamal .
MATERIALS TODAY-PROCEEDINGS, 2022, 56 :3349-3354
[23]   Emotion Recognition and Emotion Based Classification of Audio using Genetic Algorithm - An Optimized Approach [J].
Bargaje, Mahesh .
2015 INTERNATIONAL CONFERENCE ON INDUSTRIAL INSTRUMENTATION AND CONTROL (ICIC), 2015, :562-567
[24]   Construction of Japanese Audio-Visual Emotion Database and Its Application in Emotion Recognition [J].
Lubis, Nurul ;
Gomez, Randy ;
Sakti, Sakriani ;
Nakamura, Keisuke ;
Yoshino, Koichiro ;
Nakamura, Satoshi ;
Nakadai, Kazuhiro .
LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2016, :2180-2184
[25]   Optimizing Speech Emotion Recognition with Machine Learning Based Advanced Audio Cue Analysis [J].
Pallewela, Nuwan ;
Alahakoon, Damminda ;
Adikari, Achini ;
Pierce, John E. ;
Rose, Miranda L. .
TECHNOLOGIES, 2024, 12 (07)
[26]   Perceptual Biases in Facial Emotion Recognition in Borderline Personality Disorder [J].
Daros, Alexander R. ;
Uliaszek, Amanda A. ;
Ruocco, Anthony C. .
PERSONALITY DISORDERS-THEORY RESEARCH AND TREATMENT, 2014, 5 (01) :79-87
[27]   Significance of Phonological Features in Speech Emotion Recognition [J].
Wei Wang ;
Paul A. Watters ;
Xinyi Cao ;
Lingjie Shen ;
Bo Li .
International Journal of Speech Technology, 2020, 23 :633-642
[28]   SPEECH EMOTION RECOGNITION WITH ACOUSTIC AND LEXICAL FEATURES [J].
Jin, Qin ;
Li, Chengxin ;
Chen, Shizhe ;
Wu, Huimin .
2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, :4749-4753
[29]   Significance of Phonological Features in Speech Emotion Recognition [J].
Wang, Wei ;
Watters, Paul A. ;
Cao, Xinyi ;
Shen, Lingjie ;
Li, Bo .
INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2020, 23 (03) :633-642
[30]   The 3D Emotion Recognition Using SVM and HoG Features [J].
Savakar, Dayanand G. ;
Hosur, Ravi .
INTERNATIONAL JOURNAL OF IMAGE AND GRAPHICS, 2020, 20 (03)