Music genre classification based on fusing audio and lyric information

被引:0
|
作者
You Li
Zhihai Zhang
Han Ding
Liang Chang
机构
[1] Guilin University of Electronic Technology,Guangxi Key Laboratory of Trusted Software
[2] Guilin University of Electronic Technology,School of Electronic Engineering and Automation
来源
Multimedia Tools and Applications | 2023年 / 82卷
关键词
Music genre classification; Audio information; Lyric information; Information fusion;
D O I
暂无
中图分类号
学科分类号
摘要
Music genre classification (MGC) has a wide range of application scenarios. Traditional MGC methods only consider either audio information or lyric information, resulting in an unsatisfactory recognition effect. In this paper, we propose a multimodal music genre classification framework that integrates both audio information and lyric information. By using the complementarity of multimodal information, music genres can be represented more comprehensively. First, the framework extracts the mel-spectrogram of audio, and a convolutional neural network is used to extract audio features. Simultaneously, BERT is used to obtain the distributed representation of the lyrics. Then, the two modal pieces of information are fused through different strategies, such as at the feature level and decision level. To solve the serious inconsistency between the convergence speed of the audio channel and the lyric channel, we adopt the strategy of asynchronous start training of two channels and different learning rates. A series of experiments are carried out to verify the effectiveness of the proposed model. The F1 score of the proposed model is 0.87 for music genre classification, which is approximately 4% higher than that of the best baseline in the experiment.
引用
收藏
页码:20157 / 20176
页数:19
相关论文
共 50 条
  • [31] Robust handcrafted features for music genre classification
    Victor Hugo da Silva Muniz
    João Baptista de Oliveira e Souza Filho
    Neural Computing and Applications, 2023, 35 : 9335 - 9348
  • [32] A New Hierarchical Method for Music Genre Classification
    Du, Wei
    Lin, Hu
    Sun, Jianwei
    Yu, Bo
    Yang, Haibo
    2016 9TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI 2016), 2016, : 1033 - 1037
  • [33] Texture selection for automatic music genre classification
    Foleis, Juliano Henrique
    Tavares, Tiago Fernandes
    APPLIED SOFT COMPUTING, 2020, 89
  • [34] Music Genre Classification from Turkish Lyrics
    Coban, Onder
    Ozyer, Gulsah Tumuklu
    2016 24TH SIGNAL PROCESSING AND COMMUNICATION APPLICATION CONFERENCE (SIU), 2016, : 101 - 104
  • [35] Feature Vector Design for Music Genre Classification
    da Silva Muniz, Victor Hugo
    de Oliveira e Souza Filho, Joao Baptista
    2021 IEEE LATIN AMERICAN CONFERENCE ON COMPUTATIONAL INTELLIGENCE (LA-CCI), 2021,
  • [36] Music Genre Classification with Word and Document Vectors
    Coban, Onder
    Karabey, Isil
    2017 25TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2017,
  • [37] On the Use of Feature Selection for Music Genre Classification
    Al-Tamimi, Abdel-Karim
    Salem, Maher
    Al-Alami, Ahmad
    2020 SEVENTH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY TRENDS (ITT 2020), 2020, : 1 - 6
  • [38] A Study on Broadcast Networks for Music Genre Classification
    Heakl, Ahmed
    Abdelgawad, Abdelrahman
    Parque, Victor
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [39] Feature Mapping and Fusion for Music Genre Classification
    Balti, Haythem
    Frigui, Hichem
    2012 11TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2012), VOL 1, 2012, : 306 - 310
  • [40] Temporal feature integration for music genre classification
    Meng, Anders
    Ahrendt, Peter
    Larsen, Jan
    Hansen, Lars Kai
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (05): : 1654 - 1664