PRE-TRAINED DEEP NEURAL NETWORK USING SPARSE AUTOENCODERS AND SCATTERING WAVELET TRANSFORM FOR MUSICAL GENRE RECOGNITION

被引:6
作者
Klec, Mariusz [1 ]
Korzinek, Danijel [1 ]
机构
[1] Polish Japanese Acad Informat Technol, Warsaw, Poland
来源
COMPUTER SCIENCE-AGH | 2015年 / 16卷 / 02期
关键词
Sparse Autoencoders; deep learning; genre recognition; Scattering Wavelet Transform;
D O I
10.7494/csci.2015.16.2.133
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Research described in this paper tries to combine the approach of Deep Neural Networks (DNN) with the novel audio features extracted using the Scattering Wavelet Transform (SWT) for classifying musical genres. The SWT uses a sequence of Wavelet Transforms to compute the modulation spectrum coefficients of multiple orders, which has already shown to be promising for this task. The DNN in this work uses pre-trained layers using Sparse Autoencoders (SAE). Data obtained from the Creative Commons website jamendo.com is used to boost the well-known GTZAN database, which is a standard bench-mark for this task. The final classifier is tested using a 10-fold cross validation to achieve results similar to other state-of-the-art approaches.
引用
收藏
页码:133 / 144
页数:12
相关论文
共 50 条
  • [41] Learning time-frequency mask for noisy speech enhancement using gaussian-bernoulli pre-trained deep neural networks
    Saleem, Nasir
    Khattak, Muhammad Irfan
    Al-Hasan, Mu'ath
    Jan, Atif
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 40 (01) : 849 - 864
  • [42] Three-dimensional prostate CT segmentation through fine-tuning of a pre-trained neural network using no reference labeling
    Caughlin, Kayla
    Shahedi, Maysam
    Shoag, Jonathan E.
    Barbieri, Christopher
    Margolis, Daniel
    Fei, Baowei
    MEDICAL IMAGING 2021: IMAGE-GUIDED PROCEDURES, ROBOTIC INTERVENTIONS, AND MODELING, 2021, 11598
  • [43] Compound damage detection using wavelet transform and deep neural network trained on healthy and single damage states: Validation on a laboratory-scale offshore jacket model
    Feng, Wei-Qiang
    Mousavi, Zohreh
    Lin, Jian-Fu
    Bayat, Meysam
    Ettefagh, Mir Mohammad
    Varahram, Sina
    STRUCTURAL HEALTH MONITORING-AN INTERNATIONAL JOURNAL, 2025,
  • [44] Maize leaf disease detection using convolutional neural network: a mobile application based on pre-trained VGG16 architecture
    Paul, Hansamali
    Udayangani, Hirunika
    Umesha, Kalani
    Lankasena, Nalaka
    Liyanage, Chamara
    Thambugala, Kasun
    NEW ZEALAND JOURNAL OF CROP AND HORTICULTURAL SCIENCE, 2024,
  • [45] Transfer learning by fine-tuning pre-trained convolutional neural network architectures for switchgear fault detection using thermal imaging
    Mahmoud, Karim A. A.
    Badr, Mohamed M.
    Elmalhy, Noha A.
    Hamdy, Ragi A.
    Ahmed, Shehab
    Mordi, Ahmed A.
    ALEXANDRIA ENGINEERING JOURNAL, 2024, 103 : 327 - 342
  • [46] Threshold Active Learning Approach for Physical Violence Detection on Images Obtained from Video (Frame-Level) Using Pre-Trained Deep Learning Neural Network Models
    Abundez, Itzel M.
    Alejo, Roberto
    Primero, Francisco Primero
    Granda-Gutierrez, Everardo E.
    Portillo-Rodriguez, Otniel
    Velazquez, Juan Alberto Antonio
    ALGORITHMS, 2024, 17 (07)
  • [47] Automated ultrasonography of hepatocellular carcinoma using discrete wavelet transform based deep-learning neural network
    Rhyou, Se-Yeol
    Yoo, Jae-Chern
    MEDICAL IMAGE ANALYSIS, 2025, 101
  • [48] Wind speed forecasting method based on deep learning strategy using empirical wavelet transform, long short term memory neural network and Elman neural network
    Liu, Hui
    Mi, Xi-Wei
    Li, Yan-Fei
    ENERGY CONVERSION AND MANAGEMENT, 2018, 156 : 498 - 514
  • [49] Lamb wave-based damage detection of composite structures using deep convolutional neural network and continuous wavelet transform
    Wu, Jun
    Xu, Xuebing
    Liu, Cheng
    Deng, Chao
    Shao, Xinyu
    COMPOSITE STRUCTURES, 2021, 276
  • [50] Early prediction of pathological complete response to neoadjuvant chemotherapy in breast cancer MRI images using combined Pre-trained convolutional neural network and machine learning
    Khanna, Priyanka
    Sahu, Mridu
    Singh, Bikesh Kumar
    Bhateja, Vikrant
    MEASUREMENT, 2023, 207