Continuous Music Emotion Recognition Using Selected Audio Features

被引:1
作者
Chmulik, Michal [1 ]
Jarina, Roman [1 ]
Kuba, Michal [1 ]
Lieskovska, Eva [1 ]
机构
[1] Univ Zilina, Fac Elect Engn & Informat Technol, Dept Multimedia & Informat & Commun Technol, AudioLab, Univ 1, Zilina 01026, Slovakia
来源
2019 42ND INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP) | 2019年
关键词
music emotion recognition; arousal-valence dimensions; audio features; support vector regression;
D O I
10.1109/tsp.2019.8768806
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this article, we present system for dynamic dimensional music emotion recognition (MER) and comparison of the system efficiency depending on different audio features. Experiments were performed on the database provided within the Emotion in Music task of MediaEval2015 benchmark (EiMME2015). In addition to the baseline features that were defined by the EiMME2015 organizers and included in the development data, we selected two-dimensional cepstral coefficients (TDC), linear prediction (LP) based coefficients, and the group of musical descriptors to evaluate their discriminative properties for recognition of music emotion. The experiments show that a combination of the EiMME2015 baseline features, LP coefficients and proposed set of music features significantly increases system performance for both arousal and valence emotional dimensions.
引用
收藏
页码:589 / 592
页数:4
相关论文
共 18 条
  • [1] Aljanaki Anna, 2016, THESIS
  • [2] [Anonymous], P MEDIAEVAL 2014 WOR
  • [3] [Anonymous], 6 EUR C SPEECH COMM
  • [4] [Anonymous], 2014, P MEDIAEVAL 2014 WOR
  • [5] [Anonymous], MEDIAEVAL 2015 WORKS
  • [6] [Anonymous], 2013, INT J COMPUT APPL, DOI DOI 10.5120/13364-0958
  • [7] [Anonymous], P MEDIAEVAL 2014 WOR
  • [8] [Anonymous], LECT NOTES COMPUTER
  • [9] [Anonymous], 2011, Music emotion recognition
  • [10] SPOKEN-WORD RECOGNITION USING DYNAMIC FEATURES ANALYZED BY TWO-DIMENSIONAL CEPSTRUM
    ARIKI, Y
    MIZUTA, S
    NAGATA, M
    SAKAI, T
    [J]. IEE PROCEEDINGS-I COMMUNICATIONS SPEECH AND VISION, 1989, 136 (02): : 133 - 140