Static Music Emotion Recognition Using Recurrent Neural Networks

被引:5
作者
Grekow, Jacek [1 ]
机构
[1] Bialystok Tech Univ, Fac Comp Sci, Wiejska 45A, PL-15351 Bialystok, Poland
来源
FOUNDATIONS OF INTELLIGENT SYSTEMS (ISMIS 2020) | 2020年 / 12117卷
关键词
Emotion detection; Audio features; Sequential data; Recurrent Neural Networks;
D O I
10.1007/978-3-030-59491-6_14
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The article presents experiments using recurrent neural networks for emotion detection for musical segments using Russell's circumplex model. A process of feature extraction and creating sequential data for learning networks with long short-term memory (LSTM) units is presented. Models were implemented using the WekaDeeplearning4j package and a number of experiments were carried out with data with different sets of features and varying segmentation. The usefulness of dividing data into sequences as well as the sense of using recurrent networks to recognize emotions in music, whose results have even exceeded the SVM algorithm for regression, were demonstrated. The author analyzed the effect of the network structure and the set of used features on the results of regressors recognizing values on two axes of the emotion model: arousal and valence.
引用
收藏
页码:150 / 160
页数:11
相关论文
共 16 条
[1]   Developing a benchmark for emotional analysis of music [J].
Aljanaki, Anna ;
Yang, Yi-Hsuan ;
Soleymani, Mohammad .
PLOS ONE, 2017, 12 (03)
[2]  
[Anonymous], 2000, Organised Sound, DOI [DOI 10.1017/S1355771800003071, 10.1017/S13557718 00003071]
[3]   EMOTION IN MOTION: INVESTIGATING THE TIME-COURSE OF EMOTIONAL JUDGMENTS OF MUSICAL STIMULI [J].
Bachorik, Justin Pierre ;
Bangert, Marc ;
Loui, Psyche ;
Larke, Kevin ;
Berger, Jeff ;
Rowe, Robert ;
Schlaug, Gottfried .
MUSIC PERCEPTION, 2009, 26 (04) :355-364
[4]  
Bogdanov D., 2013, P 14 INT SOC MUS INF, DOI [DOI 10.5281/ZENODO.1415016, DOI 10.1145/2502081.2502229]
[5]  
Chowdhury Shreyan, 2019, P 20 INT SOC MUS INF, P237
[6]  
Delbouys R., 2018, P 19 INT SOC MUS INF, P370
[7]   Learning to forget: Continual prediction with LSTM [J].
Gers, FA ;
Schmidhuber, J ;
Cummins, F .
NEURAL COMPUTATION, 2000, 12 (10) :2451-2471
[8]  
Grekow J., 2018, CONTENT BASED MUSIC, P13, DOI DOI 10.1007/978-3-319-70609-2
[9]   Music Emotion Maps in Arousal-Valence Space [J].
Grekow, Jacek .
COMPUTER INFORMATION SYSTEMS AND INDUSTRIAL MANAGEMENT, CISIM 2016, 2016, 9842 :697-706
[10]   Audio Features Dedicated to the Detection of Four Basic Emotions [J].
Grekow, Jacek .
COMPUTER INFORMATION SYSTEMS AND INDUSTRIAL MANAGEMENT, 2015, 9339 :583-591