Transfer Learning with Deep Neural Embeddings for Music Classification Tasks

被引:0
作者
Modrzejewski, Mateusz [1 ]
Szachewicz, Piotr [1 ]
Rokita, Przemyslaw [1 ]
机构
[1] Warsaw Univ Technol, Fac Elect & Informat Technol, Div Comp Graph, Inst Comp Sci, Nowowiejska 15-19, PL-00665 Warsaw, Poland
来源
ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, ICAISC 2022, PT I | 2023年 / 13588卷
关键词
Artificial intelligence; Music information retrieval; Music classification; Deep embeddings; Neural networks; Transfer learning;
D O I
10.1007/978-3-031-23492-7_7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we present an approach for transfer learning with deep neural embeddings applied to a selection of music information retrieval (MIR) classification tasks with several datasets. The tasks include genre recognition, speech/music distinguishing, predominant instrument recognition and performer identification. We propose the usage of pre-trained L-3 neural networks for feature extraction and apply several supervised classification algorithms to the obtained feature representations in order to compare their performance. The deep neural embedding representations are compared with traditional, hand-crafted features and are shown to outperform the baselines.
引用
收藏
页码:72 / 81
页数:10
相关论文
共 46 条
  • [41] Musical genre classification of audio signals
    Tzanetakis, G
    Cook, P
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2002, 10 (05): : 293 - 302
  • [42] Tzanetakis G, 1999, GTZAN MUSIC SPEECH C
  • [43] Wilkinghoff K, 2021, EUR SIGNAL PR CONF, P800, DOI 10.23919/Eusipco47968.2020.9287705
  • [44] Wilkins J., 2018, Proceedings of the 19th international society for music information retrieval conference, ISMIR 2018, paris, france, september 23-27, 2018, P468
  • [45] Won M., 2021, Music Classification: Beyond Supervised Learning, Towards Real-world Applications
  • [46] Won M, 2021, Arxiv, DOI arXiv:2111.13457