Music Genre and Emotion Recognition Using Gaussian Processes

被引:55
|
作者
Markov, Konstantin [1 ]
Matsui, Tomoko [2 ]
机构
[1] Univ Aizu, Div Informat Syst, Aizu Wakamatsu, Fukushima 9658580, Japan
[2] Inst Stat Math, Dept Stat Modeling, Tokyo 1068569, Japan
来源
IEEE ACCESS | 2014年 / 2卷
关键词
Music genre classification; music emotion estimation; Gaussian processes; PROCESS DYNAMICAL MODELS; CLASSIFICATION; REGRESSION;
D O I
10.1109/ACCESS.2014.2333095
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Gaussian Processes (GPs) are Bayesian nonparametric models that are becoming more and more popular for their superior capabilities to capture highly nonlinear data relationships in various tasks, such as dimensionality reduction, time series analysis, novelty detection, as well as classical regression and classification tasks. In this paper, we investigate the feasibility and applicability of GP models for music genre classification and music emotion estimation. These are two of the main tasks in the music information retrieval (MIR) field. So far, the support vector machine (SVM) has been the dominant model used in MIR systems. Like SVM, GP models are based on kernel functions and Gram matrices; but, in contrast, they produce truly probabilistic outputs with an explicit degree of prediction uncertainty. In addition, there exist algorithms for GP hyperparameter learning-something the SVM framework lacks. In this paper, we built two systems, one for music genre classification and another for music emotion estimation using both SVM and GP models, and compared their performances on two databases of similar size. In all cases, the music audio signal was processed in the same way, and the effects of different feature extraction methods and their various combinations were also investigated. The evaluation experiments clearly showed that in both music genre classification and music emotion estimation tasks the GP performed consistently better than the SVM. The GP achieved a 13.6% relative genre classification error reduction and up to an 11% absolute increase of the coefficient of determination in the emotion estimation task.
引用
收藏
页码:688 / 697
页数:10
相关论文
共 50 条
  • [1] Song Emotion Recognition Using Music Genre Information
    Koutras, Athanasios
    SPEECH AND COMPUTER, SPECOM 2017, 2017, 10458 : 669 - 679
  • [2] Music Emotion Recognition Using Deep Gaussian Process
    Chen, Sih-Huei
    Lee, Yuan-Shan
    Hsieh, Wen-Chi
    Wang, Jia-Ching
    2015 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2015, : 495 - 498
  • [3] EVALUATING MUSIC EMOTION RECOGNITION: LESSONS FROM MUSIC GENRE RECOGNITION?
    Sturm, Bob L.
    ELECTRONIC PROCEEDINGS OF THE 2013 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO WORKSHOPS (ICMEW), 2013,
  • [4] A multi-genre model for music emotion recognition using linear regressors
    Griffiths, Darryl
    Cunningham, Stuart
    Weinel, Jonathan
    Picking, Richard
    JOURNAL OF NEW MUSIC RESEARCH, 2021, 50 (04) : 355 - 372
  • [5] Dynamic facial landmarking selection for emotion recognition using Gaussian processes
    Hernán F. García
    Mauricio A. Álvarez
    Álvaro A. Orozco
    Journal on Multimodal User Interfaces, 2017, 11 : 327 - 340
  • [6] Dynamic facial landmarking selection for emotion recognition using Gaussian processes
    Garcia, Hernan F.
    Alvarez, Mauricio A.
    Orozco, Alvaro A.
    JOURNAL ON MULTIMODAL USER INTERFACES, 2017, 11 (04) : 327 - 340
  • [7] MUSIC GENRE CLASSIFICATION USING GAUSSIAN PROCESS MODELS
    Markov, Konstantin
    Matsui, Tomoko
    2013 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2013,
  • [8] MUSIC EMOTION RECOGNITION WITH ADAPTIVE AGGREGATION OF GAUSSIAN PROCESS REGRESSORS
    Fukayama, Satoru
    Goto, Masataka
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 71 - 75
  • [9] EXPLOITING GENRE FOR MUSIC EMOTION CLASSIFICATION
    Lin, Yu-Ching
    Yang, Yi-Hsuan
    Chen, Homer H.
    Liao, I-Bin
    Ho, Yeh-Chin
    ICME: 2009 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-3, 2009, : 618 - +
  • [10] Music Genre Recognition Using Residual Neural Networks
    Bisharad, Dipjyoti
    Laskar, Rabul Hussain
    PROCEEDINGS OF THE 2019 IEEE REGION 10 CONFERENCE (TENCON 2019): TECHNOLOGY, KNOWLEDGE, AND SOCIETY, 2019, : 2063 - 2068