A survey of music emotion recognition

被引:52
作者
Han, Donghong [1 ]
Kong, Yanru [1 ]
Han, Jiayi [2 ]
Wang, Guoren [3 ]
机构
[1] Northeastern Univ, Sch Comp Sci & Engn, Shenyang 110000, Peoples R China
[2] Fudan Univ, Inst Sci & Technol Brain Inspired Intelligence, Shanghai 200082, Peoples R China
[3] Beijing Inst Technol, Sch Comp Sci & Technol, Beijing 100089, Peoples R China
基金
国家重点研发计划;
关键词
artificial intelligence; deep learning; music emotion recognition; CIRCUMPLEX MODEL; CLASSIFICATION; FEATURES; REGRESSION; EXPRESSION; DISCRETE;
D O I
10.1007/s11704-021-0569-4
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Music is the language of emotions. In recent years, music emotion recognition has attracted widespread attention in the academic and industrial community since it can be widely used in fields like recommendation systems, automatic music composing, psychotherapy, music visualization, and so on. Especially with the rapid development of artificial intelligence, deep learning-based music emotion recognition is gradually becoming mainstream. This paper gives a detailed survey of music emotion recognition. Starting with some preliminary knowledge of music emotion recognition, this paper first introduces some commonly used evaluation metrics. Then a three-part research framework is put forward. Based on this three-part research framework, the knowledge and algorithms involved in each part are introduced with detailed analysis, including some commonly used datasets, emotion models, feature extraction, and emotion recognition algorithms. After that, the challenging problems and development trends of music emotion recognition technology are proposed, and finally, the whole paper is summarized.
引用
收藏
页数:11
相关论文
共 74 条
[21]   Bidirectional Convolutional Recurrent Sparse Network (BCRSN): An Efficient Model for Music Emotion Recognition [J].
Dong, Yizhuo ;
Yang, Xinyu ;
Zhao, Xi ;
Li, Juan .
IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 21 (12) :3150-3163
[22]   A comparison of the discrete and dimensional models of emotion in music [J].
Eerola, Tuomas ;
Vuoskoski, Jonna K. .
PSYCHOLOGY OF MUSIC, 2011, 39 (01) :18-49
[23]  
Ferreira L., 2019, ISMIR, P384, DOI 10 . 5281 / zenodo.3527824
[24]  
Fukayama S, 2016, INT CONF ACOUST SPEE, P71, DOI 10.1109/ICASSP.2016.7471639
[25]  
Han Wen-Jing, 2014, Journal of Software, V25, P37
[26]  
He H, 2008, LECT NOTES COMPUT SC, V5370, P426, DOI 10.1007/978-3-540-92137-0_47
[27]   Experimental studies of the elements of expression in music [J].
Hevner, K .
AMERICAN JOURNAL OF PSYCHOLOGY, 1936, 48 :246-268
[28]  
Hu X., 2009, ISMIR, P411
[29]  
Hu X., 2018, INT SOC MUSIC INFORM, P362
[30]  
Hu Yajie., 2009, ISMIR, P123