Music genre classification based on fusing audio and lyric information

被引:0
|
作者
You Li
Zhihai Zhang
Han Ding
Liang Chang
机构
[1] Guilin University of Electronic Technology,Guangxi Key Laboratory of Trusted Software
[2] Guilin University of Electronic Technology,School of Electronic Engineering and Automation
来源
Multimedia Tools and Applications | 2023年 / 82卷
关键词
Music genre classification; Audio information; Lyric information; Information fusion;
D O I
暂无
中图分类号
学科分类号
摘要
Music genre classification (MGC) has a wide range of application scenarios. Traditional MGC methods only consider either audio information or lyric information, resulting in an unsatisfactory recognition effect. In this paper, we propose a multimodal music genre classification framework that integrates both audio information and lyric information. By using the complementarity of multimodal information, music genres can be represented more comprehensively. First, the framework extracts the mel-spectrogram of audio, and a convolutional neural network is used to extract audio features. Simultaneously, BERT is used to obtain the distributed representation of the lyrics. Then, the two modal pieces of information are fused through different strategies, such as at the feature level and decision level. To solve the serious inconsistency between the convergence speed of the audio channel and the lyric channel, we adopt the strategy of asynchronous start training of two channels and different learning rates. A series of experiments are carried out to verify the effectiveness of the proposed model. The F1 score of the proposed model is 0.87 for music genre classification, which is approximately 4% higher than that of the best baseline in the experiment.
引用
收藏
页码:20157 / 20176
页数:19
相关论文
共 50 条
  • [21] Genre classification of symbolic pieces of music
    Armentano, Marcelo G.
    De Noni, Walter A.
    Cardoso, Hernan F.
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2017, 48 (03) : 579 - 599
  • [22] Genre classification of symbolic pieces of music
    Marcelo G. Armentano
    Walter A. De Noni
    Hernán F. Cardoso
    Journal of Intelligent Information Systems, 2017, 48 : 579 - 599
  • [23] Music genre classification based on auditory image, spectral and acoustic features
    Xin Cai
    Hongjuan Zhang
    Multimedia Systems, 2022, 28 : 779 - 791
  • [24] Music genre classification based on auditory image, spectral and acoustic features
    Cai, Xin
    Zhang, Hongjuan
    MULTIMEDIA SYSTEMS, 2022, 28 (03) : 779 - 791
  • [25] Music Genre and Melody Classification Using Embedding Based Topic Model
    Ramasubramanian, Vivek
    Ramasangu, Hariharan
    10TH INTERNATIONAL CONFERENCE ON ELECTRONICS, COMPUTING AND COMMUNICATION TECHNOLOGIES, CONECCT 2024, 2024,
  • [26] COMPARISON OF DIFFERENT REPRESENTATIONS BASED ON NONLINEAR FEATURES FOR MUSIC GENRE CLASSIFICATION
    Zlatintsi, Athanasia
    Maragos, Petros
    2014 PROCEEDINGS OF THE 22ND EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2014, : 1547 - 1551
  • [27] Brain and Music: Music Genre Classification using Brain Signals
    Ghaemmaghami, Pouya
    Sebe, Nicu
    2016 24TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2016, : 708 - 712
  • [28] Music Genre Classification by Analyzing the Subband Spectrogram
    Chou, Chih-Hsun
    Liao, Bo-Jun
    2014 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE, ELECTRONICS AND ELECTRICAL ENGINEERING (ISEEE), VOLS 1-3, 2014, : 1676 - +
  • [29] ON MUSIC GENRE CLASSIFICATION VIA COMPRESSIVE SAMPLING
    Sturm, Bob L.
    2013 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME 2013), 2013,
  • [30] Music Genre Classification Using Transfer Learning
    Liang, Beici
    Gu, Minwei
    THIRD INTERNATIONAL CONFERENCE ON MULTIMEDIA INFORMATION PROCESSING AND RETRIEVAL (MIPR 2020), 2020, : 392 - 393