A survey of music emotion recognition

被引:52
作者
Han, Donghong [1 ]
Kong, Yanru [1 ]
Han, Jiayi [2 ]
Wang, Guoren [3 ]
机构
[1] Northeastern Univ, Sch Comp Sci & Engn, Shenyang 110000, Peoples R China
[2] Fudan Univ, Inst Sci & Technol Brain Inspired Intelligence, Shanghai 200082, Peoples R China
[3] Beijing Inst Technol, Sch Comp Sci & Technol, Beijing 100089, Peoples R China
基金
国家重点研发计划;
关键词
artificial intelligence; deep learning; music emotion recognition; CIRCUMPLEX MODEL; CLASSIFICATION; FEATURES; REGRESSION; EXPRESSION; DISCRETE;
D O I
10.1007/s11704-021-0569-4
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Music is the language of emotions. In recent years, music emotion recognition has attracted widespread attention in the academic and industrial community since it can be widely used in fields like recommendation systems, automatic music composing, psychotherapy, music visualization, and so on. Especially with the rapid development of artificial intelligence, deep learning-based music emotion recognition is gradually becoming mainstream. This paper gives a detailed survey of music emotion recognition. Starting with some preliminary knowledge of music emotion recognition, this paper first introduces some commonly used evaluation metrics. Then a three-part research framework is put forward. Based on this three-part research framework, the knowledge and algorithms involved in each part are introduced with detailed analysis, including some commonly used datasets, emotion models, feature extraction, and emotion recognition algorithms. After that, the challenging problems and development trends of music emotion recognition technology are proposed, and finally, the whole paper is summarized.
引用
收藏
页数:11
相关论文
共 74 条
[31]   Bi-Modal Deep Boltzmann Machine Based Musical Emotion Classification [J].
Huang, Moyuan ;
Rong, Wenge ;
Arjannikov, Tom ;
Jiang, Nan ;
Xiong, Zhang .
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2016, PT II, 2016, 9887 :199-207
[32]   Expression, perception, and induction of musical emotions: A review and a questionnaire study of everyday listening [J].
Juslin, PN ;
Laukka, P .
JOURNAL OF NEW MUSIC RESEARCH, 2004, 33 (03) :217-238
[33]  
Keelawat P, 2019, 2019 IEEE 15TH INTERNATIONAL COLLOQUIUM ON SIGNAL PROCESSING & ITS APPLICATIONS (CSPA 2019), P21, DOI [10.1109/cspa.2019.8696054, 10.1109/CSPA.2019.8696054]
[34]  
Lartillot O., 2007, ISMIR, P237
[35]   Multimodal Music Mood Classification using Audio and Lyrics [J].
Laurier, Cyril ;
Grivolla, Jens ;
Herrera, Perfecto .
SEVENTH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, PROCEEDINGS, 2008, :688-+
[36]  
Li T, 2003, Proceedings of the International Conference on Music Information Retrieval, P239
[37]  
Li XX, 2016, IEEE INT CON MULTI
[38]  
Li XX, 2016, INT CONF ACOUST SPEE, P544, DOI 10.1109/ICASSP.2016.7471734
[39]  
Liu H P, 2018, P 2018 INT C MATH MO, P15
[40]   What Strikes the Strings of Your Heart?-Feature Mining for Music Emotion Analysis [J].
Liu, Yang ;
Liu, Yan ;
Zhao, Yu ;
Hua, Kien A. .
IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2015, 6 (03) :247-260