Music emotion recognition using recurrent neural networks and pretrained models

被引：0

作者：

Jacek Grekow

机构：

[1] Bialystok University of Technology,Faculty of Computer Science

来源：

Journal of Intelligent Information Systems | 2021年 / 57卷

关键词：

Emotion detection; Audio features; Sequential data; Recurrent neural networks;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

The article presents conducted experiments using recurrent neural networks for emotion detection in musical segments. Trained regression models were used to predict the continuous values of emotions on the axes of Russell’s circumplex model. A process of audio feature extraction and creating sequential data for learning networks with long short-term memory (LSTM) units is presented. Models were implemented using the WekaDeeplearning4j package and a number of experiments were carried out with data with different sets of features and varying segmentation. The usefulness of dividing the data into sequences as well as the point of using recurrent networks to recognize emotions in music, the results of which have even exceeded the SVM algorithm for regression, were demonstrated. The author analyzed the effect of the network structure and the set of used features on the results of the regressors recognizing values on two axes of the emotion model: arousal and valence. Finally, the use of a pretrained model for processing audio features and training a recurrent network with new sequences of features is presented.

引用

页码：531 / 546

页数：15

共 36 条

[1] Bachorik J(2009)Emotion in motion: Investigating the time-course of emotional judgments of musical stimuli Music Perception 26 355-364
[2] Bangert M(2000)Learning to forget: Continual prediction with LSTM Neural Computation 12 2451-2471
[3] Loui P(2018)Musical performance analysis in terms of emotions it evokes Journal of Intelligent Information Systems 51 415-437
[4] Larke K(2009)The WEKA data mining software: an update SIGKDD Explor Newsl 11 10-18
[5] Berger J(2019)WekaDeeplearning4j: A deep learning package for Weka based on Deeplearning4j Knowledge-Based Systems 178 48-50
[6] Rowe R(2006)Automatic mood detection and tracking of music audio signals Transactions on Audio, Speech, and Language Processing 14 5-18
[7] Schlaug G(2017)Labeling data and developing supervised framework for hindi music mood analysis Journal of Intelligent Information Systems 48 633-651
[8] Gers FA(2018)Multimodal mood classification of hindi and western songs Journal of Intelligent Information Systems 51 579-596
[9] Schmidhuber J(1980)A circumplex model of affect Journal of Personality and Social Psychology 39 1161-1178
[10] Cummins FA(2000)Marsyas: a framework for audio analysis Org Sound 4 169-175

← 1 2 3 4 →