Analyzing the Perceptual Salience of Audio Features for Musical Emotion Recognition

被引:0
|
作者
Schmidt, Erik M. [1 ]
Prockup, Matthew [1 ]
Scott, Jeffrey [1 ]
Dolhansky, Brian [2 ]
Morton, Brandon G. [1 ]
Kim, Youngmoo E. [1 ]
机构
[1] Drexel Univ, Philadelphia, PA USA
[2] Univ Penn, Philadelphia, PA 19104 USA
来源
FROM SOUNDS TO MUSIC AND EMOTIONS | 2013年 / 7900卷
关键词
emotion; music emotion recognition; features; acoustic features; machine learning; invariance; MODE; RESPONSES; TEMPO;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
While the organization of music in terms of emotional affect is a natural process for humans, quantifying it empirically proves to be a very difficult task. Consequently, no acoustic feature (or combination thereof) has emerged as the optimal representation for musical emotion recognition. Due to the subjective nature of emotion, determining whether an acoustic feature domain is informative requires evaluation by human subjects. In this work, we seek to perceptually evaluate two of the most commonly used features in music information retrieval: mel-frequency cepstral coefficients and chroma. Furthermore, to identify emotion-informative feature domains, we explore which musical features are most relevant in determining emotion perceptually, and which acoustic feature domains are most variant or invariant to those changes. Finally, given our collected perceptual data, we conduct an extensive computational experiment for emotion prediction accuracy on a large number of acoustic feature domains, investigating pairwise prediction both in the context of a general corpus as well as in the context of a corpus that is constrained to contain only specific musical feature transformations.
引用
收藏
页码:278 / 300
页数:23
相关论文
共 50 条
  • [1] Audio Features for Music Emotion Recognition: A Survey
    Panda, Renato
    Malheiro, Ricardo
    Paiva, Rui Pedro
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2023, 14 (01) : 68 - 88
  • [2] Continuous Music Emotion Recognition Using Selected Audio Features
    Chmulik, Michal
    Jarina, Roman
    Kuba, Michal
    Lieskovska, Eva
    2019 42ND INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP), 2019, : 589 - 592
  • [3] The source dilemma hypothesis: Perceptual uncertainty contributes to musical emotion
    Bonin, Tanor L.
    Trainor, Laurel J.
    Belyk, Michel
    Andrews, Paul W.
    COGNITION, 2016, 154 : 174 - 181
  • [4] Tonic and scale recognition in Persian Audio Musical Signals
    Heydarian, Peyman
    Jones, Lewis
    2014 12TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP), 2014, : 18 - 21
  • [5] Felt Emotion Elicited by Music: Are Sensitivities to Various Musical Features Different for Young Children and Young Adults?
    Chen, Xuqian
    Huang, Shengqiao
    Hei, Xueting
    Zeng, Hongyuan
    SPANISH JOURNAL OF PSYCHOLOGY, 2020, 23
  • [6] The Role of Emotion in Musical Improvisation: An Analysis of Structural Features
    McPherson, Malinda J.
    Lopez-Gonzalez, Monica
    Rankin, Summer K.
    Limb, Charles J.
    PLOS ONE, 2014, 9 (08):
  • [7] CHALLENGES AND OPPORTUNITIES OF PREDICTING MUSICAL EMOTIONS WITH PERCEPTUAL AND AUTOMATIZED FEATURES
    Lange, Elke B.
    Frieler, Klaus
    MUSIC PERCEPTION, 2018, 36 (02): : 217 - 242
  • [8] Audio-Visual Emotion Recognition System Using Multi-Modal Features
    Handa, Anand
    Agarwal, Rashi
    Kohli, Narendra
    INTERNATIONAL JOURNAL OF COGNITIVE INFORMATICS AND NATURAL INTELLIGENCE, 2021, 15 (04)
  • [9] NOVEL AUDIO FEATURES FOR CAPTURING TEMPO SALIENCE IN MUSIC RECORDINGS
    Thoshkahna, Balaji
    Mueller, Meinard
    Kulkarni, Venkatesh
    Jiang, Nanzhu
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 181 - 185
  • [10] Deep Learning for Audio Visual Emotion Recognition
    Hussain, T.
    Wang, W.
    Bouaynaya, N.
    Fathallah-Shaykh, H.
    Mihaylova, L.
    2022 25TH INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION 2022), 2022,