Audio Features for Music Emotion Recognition: A Survey

被引:42
|
作者
Panda, Renato [1 ,2 ]
Malheiro, Ricardo [1 ,3 ]
Paiva, Rui Pedro [1 ]
机构
[1] Univ Coimbra, Ctr Informat & Syst, Dept Informat Engn, P-3030290 Coimbra, Portugal
[2] Polytech Inst Tomar, Ci2, P-2300313 Tomar, Portugal
[3] Miguel Torga Higher Inst, P-3000132 Coimbra, Portugal
关键词
Rhythm; Feature extraction; Emotion recognition; Psychology; Indexes; Machine learning; Affective computing; music emotion recognition; audio feature design; music information retrieval; PERCEPTION; EXPRESSION; PITCH; EXTRACTION; SPEECH; TIMBRE; REPRESENTATIONS; CLASSIFICATION; REGRESSION; RESPONSES;
D O I
10.1109/TAFFC.2020.3032373
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The design of meaningful audio features is a key need to advance the state-of-the-art in music emotion recognition (MER). This article presents a survey on the existing emotionally-relevant computational audio features, supported by the music psychology literature on the relations between eight musical dimensions (melody, harmony, rhythm, dynamics, tone color, expressivity, texture and form) and specific emotions. Based on this review, current gaps and needs are identified and strategies for future research on feature engineering for MER are proposed, namely ideas for computational audio features that capture elements of musical form, texture and expressivity that should be further researched. Previous MER surveys offered broad reviews, covering topics such as emotion paradigms, approaches for the collection of ground-truth data, types of MER problems and overviewing different MER systems. On the contrary, our approach is to offer a deep and specific review on one key MER problem: the design of emotionally-relevant audio features.
引用
收藏
页码:68 / 88
页数:21
相关论文
共 50 条
  • [31] Using Circular Models to Improve Music Emotion Recognition
    Dufour, Isabelle
    Tzanetakis, George
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2021, 12 (03) : 666 - 681
  • [32] A New Multilabel System for Automatic Music Emotion Recognition
    Paolizzo, Fabio
    Pichierri, Natalia
    Giardino, Daniele
    Matta, Marco
    Casali, Daniele
    Costantini, Giovanni
    2021 IEEE INTERNATIONAL WORKSHOP ON METROLOGY FOR INDUSTRY 4.0 & IOT (IEEE METROIND4.0 & IOT), 2021, : 625 - 629
  • [33] Music Emotion Recognition Using Deep Gaussian Process
    Chen, Sih-Huei
    Lee, Yuan-Shan
    Hsieh, Wen-Chi
    Wang, Jia-Ching
    2015 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2015, : 495 - 498
  • [34] Music Emotion Recognition Using Two Level Classification
    Pouyanfar, Samira
    Sameti, Hossein
    2014 IRANIAN CONFERENCE ON INTELLIGENT SYSTEMS (ICIS), 2014,
  • [35] Music emotion recognition through data and digital technologies
    Lujan Villar, Roberto Carlos
    Lujan Villar, Juan David
    COMUNICACION Y HOMBRE, 2020, (16): : 59 - 82
  • [36] Deep Learning for Audio Visual Emotion Recognition
    Hussain, T.
    Wang, W.
    Bouaynaya, N.
    Fathallah-Shaykh, H.
    Mihaylova, L.
    2022 25TH INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION 2022), 2022,
  • [37] Audio-visual spontaneous emotion recognition
    Zeng, Zhihong
    Hu, Yuxiao
    Roisman, Glenn I.
    Wen, Zhen
    Fu, Yun
    Huang, Thomas S.
    ARTIFICIAL INTELLIGENCE FOR HUMAN COMPUTING, 2007, 4451 : 72 - +
  • [38] Speech Databases, Speech Features, and Classifiers in Speech Emotion Recognition: A Review
    Dar, G. H. Mohmad
    Delhibabu, Radhakrishnan
    IEEE ACCESS, 2024, 12 : 151122 - 151152
  • [39] EFFECTIVE EMOTION RECOGNITION IN MOVIE AUDIO TRACKS
    Kotti, Margarita
    Stylianou, Yannis
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5120 - 5124
  • [40] LATE INTEGRATION OF FEATURES FOR ACOUSTIC EMOTION RECOGNITION
    Cullen, Ailbhe
    Harte, Naomi
    2013 PROCEEDINGS OF THE 21ST EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2013,