Static Music Emotion Recognition Using Recurrent Neural Networks

被引:5
作者
Grekow, Jacek [1 ]
机构
[1] Bialystok Tech Univ, Fac Comp Sci, Wiejska 45A, PL-15351 Bialystok, Poland
来源
FOUNDATIONS OF INTELLIGENT SYSTEMS (ISMIS 2020) | 2020年 / 12117卷
关键词
Emotion detection; Audio features; Sequential data; Recurrent Neural Networks;
D O I
10.1007/978-3-030-59491-6_14
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The article presents experiments using recurrent neural networks for emotion detection for musical segments using Russell's circumplex model. A process of feature extraction and creating sequential data for learning networks with long short-term memory (LSTM) units is presented. Models were implemented using the WekaDeeplearning4j package and a number of experiments were carried out with data with different sets of features and varying segmentation. The usefulness of dividing data into sequences as well as the sense of using recurrent networks to recognize emotions in music, whose results have even exceeded the SVM algorithm for regression, were demonstrated. The author analyzed the effect of the network structure and the set of used features on the results of regressors recognizing values on two axes of the emotion model: arousal and valence.
引用
收藏
页码:150 / 160
页数:11
相关论文
共 16 条
[11]   WekaDeeplearning4j: A deep learning package for Weka based on Deeplearning4j [J].
Lang, Steven ;
Bravo-Marquez, Felipe ;
Beckham, Christopher ;
Hall, Mark ;
Frank, Eibe .
KNOWLEDGE-BASED SYSTEMS, 2019, 178 :48-50
[12]   Automatic mood detection and tracking of music audio signals [J].
Lu, L ;
Liu, D ;
Zhang, HJ .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (01) :5-18
[13]   A CIRCUMPLEX MODEL OF AFFECT [J].
RUSSELL, JA .
JOURNAL OF PERSONALITY AND SOCIAL PSYCHOLOGY, 1980, 39 (06) :1161-1178
[14]  
Smarzewski R, 2020, INT J APPROX REASON, V124, P123, DOI [10.1016/j.ijar.2020.06.001, 10.1787/4dd50c09-en]
[15]  
Weninger Felix, 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), P5412, DOI 10.1109/ICASSP.2014.6854637
[16]  
Witten IH, 2011, MOR KAUF D, P1