PRE-TRAINED DEEP NEURAL NETWORK USING SPARSE AUTOENCODERS AND SCATTERING WAVELET TRANSFORM FOR MUSICAL GENRE RECOGNITION

被引:6
|
作者
Klec, Mariusz [1 ]
Korzinek, Danijel [1 ]
机构
[1] Polish Japanese Acad Informat Technol, Warsaw, Poland
来源
COMPUTER SCIENCE-AGH | 2015年 / 16卷 / 02期
关键词
Sparse Autoencoders; deep learning; genre recognition; Scattering Wavelet Transform;
D O I
10.7494/csci.2015.16.2.133
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Research described in this paper tries to combine the approach of Deep Neural Networks (DNN) with the novel audio features extracted using the Scattering Wavelet Transform (SWT) for classifying musical genres. The SWT uses a sequence of Wavelet Transforms to compute the modulation spectrum coefficients of multiple orders, which has already shown to be promising for this task. The DNN in this work uses pre-trained layers using Sparse Autoencoders (SAE). Data obtained from the Creative Commons website jamendo.com is used to boost the well-known GTZAN database, which is a standard bench-mark for this task. The final classifier is tested using a 10-fold cross validation to achieve results similar to other state-of-the-art approaches.
引用
收藏
页码:133 / 144
页数:12
相关论文
共 50 条
  • [1] Unsupervised Feature Pre-training of the Scattering Wavelet Transform for Musical Genre Recognition
    Klec, Mariusz
    Korzinek, Danijel
    INTERNATIONAL WORKSHOP ON INNOVATIONS IN INFORMATION AND COMMUNICATION SCIENCE AND TECHNOLOGY, IICST 2014, 2014, 18 : 133 - 139
  • [2] Image Hashing by Pre-Trained Deep Neural Network
    Li Pingyuan
    Zhang Dan
    Yuan Xiaoguang
    Jiang Suiping
    2022 ASIA CONFERENCE ON ALGORITHMS, COMPUTING AND MACHINE LEARNING (CACML 2022), 2022, : 468 - 471
  • [3] Object Recognition using Template Matching and Pre-trained convolutional neural network
    Abbas, Qaisar
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2020, 20 (08): : 69 - 79
  • [4] Painting Classification Using a Pre-trained Convolutional Neural Network
    Banerji, Sugata
    Sinha, Atreyee
    COMPUTER VISION, GRAPHICS, AND IMAGE PROCESSING, ICVGIP 2016, 2017, 10481 : 168 - 179
  • [5] Hockey activity recognition using pre-trained deep learning model
    Rangasamy, Keerthana
    As'ari, Muhammad Amir
    Rahmad, Nur Azmina
    Ghazali, Nurul Fathiah
    ICT EXPRESS, 2020, 6 (03): : 170 - 174
  • [6] ConvTimeNet: A Pre-trained Deep Convolutional Neural Network for Time Series Classification
    Kashiparekh, Kathan
    Narwariya, Jyoti
    Malhotra, Pankaj
    Vig, Lovekesh
    Shroff, Gautam
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [7] Kurdish Sign Language Recognition Using Pre-Trained Deep Learning Models
    Alsaud, Ali A.
    Yousif, Raghad Z.
    Aziz, Marwan. M.
    Kareem, Shahab W.
    Maho, Amer J.
    JOURNAL OF ELECTRICAL SYSTEMS, 2024, 20 (06) : 1334 - 1344
  • [8] Olive Leaf Disease Detection via Wavelet Transform and Feature Fusion of Pre-Trained Deep Learning Models
    Mahmood, Mahmood A.
    Alsalem, Khalaf
    CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 78 (03): : 3431 - 3448
  • [9] Transfer Learning Effects on Image Steganalysis with Pre-Trained Deep Residual Neural Network Model
    Ozcan, Selim
    Mustacoglu, Ahmet Fatih
    2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 2280 - 2287
  • [10] Deep Neural Network for Musical Instrument Recognition Using MFCCs
    Mahanta, Saranga Kingkor
    Khilji, Abdullah Faiz Ur Rahman
    Pakray, Partha
    COMPUTACION Y SISTEMAS, 2021, 25 (02): : 351 - 360