Audio-Based Music Classification with DenseNet and Data Augmentation

被引:14
|
作者
Bian, Wenhao [1 ,2 ]
Wang, Jie [2 ]
Zhuang, Bojin [2 ]
Yang, Jiankui [1 ]
Wang, Shaojun [2 ]
Xiao, Jing [2 ]
机构
[1] Beijing Univ Posts & Telecommn, Beijing, Peoples R China
[2] Ping An Technol Shenzhen Co Ltd, Shenzhen, Peoples R China
来源
PRICAI 2019: TRENDS IN ARTIFICIAL INTELLIGENCE, PT III | 2019年 / 11672卷
关键词
Music classification; Spectrogram; CNN; ResNet; DenseNet; Deep learning;
D O I
10.1007/978-3-030-29894-4_5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, deep learning technique has received intense attention owing to its great success in image recognition. A tendency of adaption of deep learning in various information processing fields has formed, including music information retrieval (MIR). In this paper, we conduct a comprehensive study on music audio classification with improved convolutional neural networks (CNNs). To the best of our knowledge, this the first work to apply Densely Connected Convolutional Networks (DenseNet) to music audio tagging, which has been demonstrated to perform better than Residual neural network (ResNet). Additionally, two specific data augmentation approaches of time overlapping and pitch shifting have been proposed to address the deficiency of labelled data in the MIR. Moreover, an ensemble learning of stacking is employed based on SVM. We believe that the proposed combination of strong representation of DenseNet and data augmentation can be adapted to other audio processing tasks.
引用
收藏
页码:56 / 65
页数:10
相关论文
共 50 条
  • [1] Augmentation Methods on Monophonic Audio for Instrument Classification in Polyphonic Music
    Kratimenos, Agelos
    Avramidis, Kleanthis
    Garoufis, Christos
    Zlatintsi, Athanasia
    Maragos, Petros
    28TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2020), 2021, : 156 - 160
  • [2] An Audio-Based Deep Learning Framework For BBC Television Programme Classification
    Lam Pham
    Baume, Chris
    Kong, Qiuqiang
    Hussain, Tassadaq
    Wang, Wenwu
    Plumbley, Mark
    29TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2021), 2021, : 56 - 60
  • [3] Audio Songs Classification Based on Music Patterns
    Sharma, Rahul
    Murthy, Y. V. Srinivasa
    Koolagudi, Shashidhar G.
    PROCEEDINGS OF THE SECOND INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION TECHNOLOGIES, IC3T 2015, VOL 3, 2016, 381 : 157 - 166
  • [4] Multi-view Neural Networks for Raw Audio-based Music Emotion Recognition
    He, Na
    Ferguson, Sam
    2020 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM 2020), 2020, : 168 - 172
  • [5] MULTI-VIEW AUDIO AND MUSIC CLASSIFICATION
    Phan, Huy
    Le Nguyen, Huy
    Chen, Oliver Y.
    Pham, Lam
    Koch, Philipp
    McLoughlin, Ian
    Mertins, Alfred
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 611 - 615
  • [6] Exploring Data Augmentation to Improve Music Genre Classification with ConvNets
    Aguiar, Rafael L.
    Costa, Yandre M. G.
    Silla Jr, Carlos N.
    2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
  • [7] A NOVEL LIDAR DATA CLASSIFICATION ALGORITHM COMBINED DENSENET WITH STN
    Wang, Aili
    Wang, Minhui
    Jiang, Kaiyuan
    Zhao, Lanfei
    Iwahori, Yuji
    2019 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2019), 2019, : 2483 - 2486
  • [8] A survey on deep reinforcement learning for audio-based applications
    Latif, Siddique
    Cuayahuitl, Heriberto
    Pervez, Farrukh
    Shamshad, Fahad
    Ali, Hafiz Shehbaz
    Cambria, Erik
    ARTIFICIAL INTELLIGENCE REVIEW, 2023, 56 (03) : 2193 - 2240
  • [9] Music genre classification using deep neural networks and data augmentation
    Ba, Thanh Chu
    Le, Thuy Dao Thi
    Trinh, Van Loan
    ENTERTAINMENT COMPUTING, 2025, 53
  • [10] Audio-based Near-Duplicate Video Retrieval with Audio Similarity Learning
    Avgoustinakis, Pavlos
    Kordopatis-Zilos, Giorgos
    Papadopoulos, Symeon
    Symeonidis, Andreas L.
    Kompatsiaris, Ioannis
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 5828 - 5835