Song Emotion Recognition Using Music Genre Information

被引:2
作者
Koutras, Athanasios [1 ]
机构
[1] TEI Western Greece, Informat & Mass Media Dept, Pyrgos 27100, Greece
来源
SPEECH AND COMPUTER, SPECOM 2017 | 2017年 / 10458卷
关键词
Music emotion; Music genre; ICA mixture models; CLASSIFICATION;
D O I
10.1007/978-3-319-66429-3_67
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Music Emotion Recognition (MER) is an important topic in music understanding, recommendation and retrieval that has gained great attention in the last years due to the constantly increasing number of people accessing digital musical content. In this paper we propose a new song emotion recognition system that takes into consideration the song's genre and we investigate the effect that genre has, on the recognition task of four basic music emotions of the valence-arousal (VA) plane: happy, angry, sad and peaceful. Experiments on a database consisting of 1100 songs from four different music genres (blues, country, pop and rock) using timbral, spectral, dynamical and chroma descriptors of the music, have shown that successful recognition of the song's genre as a pre-processing step, can improve the recognition of its emotion by a factor of 10-15%.
引用
收藏
页码:669 / 679
页数:11
相关论文
共 19 条
[1]  
[Anonymous], FREE MUSIC ARCHIVE H
[2]  
[Anonymous], 2013, Proceedings of the 21st ACM International Conference on Multimedia, DOI DOI 10.1145/2502081.2502224
[3]  
[Anonymous], 1989, The Biopsychology of Mood and Arousal
[4]  
Baniya Babu Kaji, 2015, 2015 17th International Conference on Advanced Communication Technology (ICACT), P434, DOI 10.1109/ICACT.2015.7224907
[5]   Blind signal separation: Statistical principles [J].
Cardoso, JF .
PROCEEDINGS OF THE IEEE, 1998, 86 (10) :2009-2025
[6]  
Eyben F, 2016, SPRINGER THESES-RECO, P1, DOI 10.1007/978-3-319-27299-3
[7]   A Survey of Audio-Based Music Classification and Annotation [J].
Fu, Zhouyu ;
Lu, Guojun ;
Ting, Kai Ming ;
Zhang, Dengsheng .
IEEE TRANSACTIONS ON MULTIMEDIA, 2011, 13 (02) :303-319
[8]  
Kim YoungmooE., 2010, Proceedings of the 11th International Society for Music Information Retrieval Conference (ISMIR 2010), P255
[9]   Emotion recognition from speech: a review [J].
Koolagudi, Shashidhar G. ;
Rao, K. Sreenivasa .
INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2012, 15 (02) :99-117
[10]   ICA mixture models for unsupervised classification of non-Gaussian classes and automatic context switching in blind signal separation [J].
Lee, TW ;
Lewicki, MS ;
Sejnowski, TJ .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2000, 22 (10) :1078-1089