Audio Features for Music Emotion Recognition: A Survey

被引:42
|
作者
Panda, Renato [1 ,2 ]
Malheiro, Ricardo [1 ,3 ]
Paiva, Rui Pedro [1 ]
机构
[1] Univ Coimbra, Ctr Informat & Syst, Dept Informat Engn, P-3030290 Coimbra, Portugal
[2] Polytech Inst Tomar, Ci2, P-2300313 Tomar, Portugal
[3] Miguel Torga Higher Inst, P-3000132 Coimbra, Portugal
关键词
Rhythm; Feature extraction; Emotion recognition; Psychology; Indexes; Machine learning; Affective computing; music emotion recognition; audio feature design; music information retrieval; PERCEPTION; EXPRESSION; PITCH; EXTRACTION; SPEECH; TIMBRE; REPRESENTATIONS; CLASSIFICATION; REGRESSION; RESPONSES;
D O I
10.1109/TAFFC.2020.3032373
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The design of meaningful audio features is a key need to advance the state-of-the-art in music emotion recognition (MER). This article presents a survey on the existing emotionally-relevant computational audio features, supported by the music psychology literature on the relations between eight musical dimensions (melody, harmony, rhythm, dynamics, tone color, expressivity, texture and form) and specific emotions. Based on this review, current gaps and needs are identified and strategies for future research on feature engineering for MER are proposed, namely ideas for computational audio features that capture elements of musical form, texture and expressivity that should be further researched. Previous MER surveys offered broad reviews, covering topics such as emotion paradigms, approaches for the collection of ground-truth data, types of MER problems and overviewing different MER systems. On the contrary, our approach is to offer a deep and specific review on one key MER problem: the design of emotionally-relevant audio features.
引用
收藏
页码:68 / 88
页数:21
相关论文
共 50 条
  • [1] Novel Audio Features for Music Emotion Recognition
    Panda, Renato
    Malheiro, Ricardo
    Paiva, Rui Pedro
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2020, 11 (04) : 614 - 626
  • [2] A survey of music emotion recognition
    Han, Donghong
    Kong, Yanru
    Han, Jiayi
    Wang, Guoren
    FRONTIERS OF COMPUTER SCIENCE, 2022, 16 (06)
  • [3] Continuous Music Emotion Recognition Using Selected Audio Features
    Chmulik, Michal
    Jarina, Roman
    Kuba, Michal
    Lieskovska, Eva
    2019 42ND INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP), 2019, : 589 - 592
  • [4] Human emotion recognition and analysis in response to audio music using brain signals
    Bhatti, Adnan Mehmood
    Majid, Muhammad
    Anwar, Syed Muhammad
    Khan, Bilal
    COMPUTERS IN HUMAN BEHAVIOR, 2016, 65 : 267 - 275
  • [5] Analyzing the Perceptual Salience of Audio Features for Musical Emotion Recognition
    Schmidt, Erik M.
    Prockup, Matthew
    Scott, Jeffrey
    Dolhansky, Brian
    Morton, Brandon G.
    Kim, Youngmoo E.
    FROM SOUNDS TO MUSIC AND EMOTIONS, 2013, 7900 : 278 - 300
  • [6] Music Emotion Recognition with the Extraction of Audio Features Using Machine Learning Approaches
    Juthi, Jannatul Humayra
    Gomes, Anthony
    Bhuiyan, Touhid
    Mahmud, Imran
    PROCEEDINGS OF ICETIT 2019: EMERGING TRENDS IN INFORMATION TECHNOLOGY, 2020, 605 : 318 - 329
  • [7] A survey on music emotion recognition using learning modelsA survey on music emotion recognition using learning modelsY. Wang et al.
    Yixin Wang
    Xujian Zhao
    Chuanpeng Deng
    Yao Xiao
    Haoxin Ruan
    Peiquan Jin
    Xuebo Cai
    Multimedia Systems, 2025, 31 (4)
  • [8] A survey of music emotion recognition
    Donghong Han
    Yanru Kong
    Jiayi Han
    Guoren Wang
    Frontiers of Computer Science, 2022, 16
  • [9] A survey of music emotion recognition
    HAN Donghong
    KONG Yanru
    HAN Jiayi
    WANG Guoren
    Frontiers of Computer Science, 2022, 16 (06)
  • [10] Exploring Emotion Features and Fusion Strategies for Audio-Video Emotion Recognition
    Zhou, Hengshun
    Meng, Debin
    Zhang, Yuanyuan
    Peng, Xiaojiang
    Du, Jun
    Wang, Kai
    Qiao, Yu
    ICMI'19: PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2019, : 562 - 566