Transfer Learning with Deep Neural Embeddings for Music Classification Tasks

被引:0
作者
Modrzejewski, Mateusz [1 ]
Szachewicz, Piotr [1 ]
Rokita, Przemyslaw [1 ]
机构
[1] Warsaw Univ Technol, Fac Elect & Informat Technol, Div Comp Graph, Inst Comp Sci, Nowowiejska 15-19, PL-00665 Warsaw, Poland
来源
ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, ICAISC 2022, PT I | 2023年 / 13588卷
关键词
Artificial intelligence; Music information retrieval; Music classification; Deep embeddings; Neural networks; Transfer learning;
D O I
10.1007/978-3-031-23492-7_7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we present an approach for transfer learning with deep neural embeddings applied to a selection of music information retrieval (MIR) classification tasks with several datasets. The tasks include genre recognition, speech/music distinguishing, predominant instrument recognition and performer identification. We propose the usage of pre-trained L-3 neural networks for feature extraction and apply several supervised classification algorithms to the obtained feature representations in order to compare their performance. The deep neural embedding representations are compared with traditional, hand-crafted features and are shown to outperform the baselines.
引用
收藏
页码:72 / 81
页数:10
相关论文
共 46 条
  • [1] Abu-El-Haija S., 2016, arXiv
  • [2] Huang CZA, 2018, Arxiv, DOI arXiv:1809.04281
  • [3] [Anonymous], 2017, 18 INT SOC MUS INF R
  • [4] Look, Listen and Learn
    Arandjelovic, Relja
    Zisserman, Andrew
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 609 - 617
  • [5] Bock S., 2019, Proceedings of the 20th International Society for Music Information Retrieval Conference (ISMIR), P486
  • [6] Bock S., 2020, Proceedings of the 21st International Society for Music Information Retrieval Conference (ISMIR), P574
  • [7] Bosch J. J., 2012, P 13 INT SOC MUS INF, P559
  • [8] Random forests
    Breiman, L
    [J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32
  • [9] Breiman L., 1984, INT GROUP, V432, P151
  • [10] Briot JP, 2019, Arxiv, DOI arXiv:1709.01620