PRE-TRAINED DEEP NEURAL NETWORK USING SPARSE AUTOENCODERS AND SCATTERING WAVELET TRANSFORM FOR MUSICAL GENRE RECOGNITION

被引:6
|
作者
Klec, Mariusz [1 ]
Korzinek, Danijel [1 ]
机构
[1] Polish Japanese Acad Informat Technol, Warsaw, Poland
来源
COMPUTER SCIENCE-AGH | 2015年 / 16卷 / 02期
关键词
Sparse Autoencoders; deep learning; genre recognition; Scattering Wavelet Transform;
D O I
10.7494/csci.2015.16.2.133
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Research described in this paper tries to combine the approach of Deep Neural Networks (DNN) with the novel audio features extracted using the Scattering Wavelet Transform (SWT) for classifying musical genres. The SWT uses a sequence of Wavelet Transforms to compute the modulation spectrum coefficients of multiple orders, which has already shown to be promising for this task. The DNN in this work uses pre-trained layers using Sparse Autoencoders (SAE). Data obtained from the Creative Commons website jamendo.com is used to boost the well-known GTZAN database, which is a standard bench-mark for this task. The final classifier is tested using a 10-fold cross validation to achieve results similar to other state-of-the-art approaches.
引用
收藏
页码:133 / 144
页数:12
相关论文
共 50 条
  • [21] Automated Classification of Urinary Cells: Using Convolutional Neural Network Pre-trained on Lung Cells
    Teramoto, Atsushi
    Michiba, Ayano
    Kiriyama, Yuka
    Sakurai, Eiko
    Shiroki, Ryoichi
    Tsukamoto, Tetsuya
    APPLIED SCIENCES-BASEL, 2023, 13 (03):
  • [22] Improving the accuracy using pre-trained word embeddings on deep neural networks for Turkish text classification
    Aydogan, Murat
    Karci, Ali
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2020, 541
  • [23] Research on Power Load Forecasting Using Deep Neural Network and Wavelet Transform
    Tan, Xiangyu
    Ao, Gang
    Qian, Guochao
    Zhou, Fangrong
    Power, Wenyun Li
    Liu, Chuanbin
    INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGIES AND SYSTEMS APPROACH, 2023, 16 (02)
  • [24] Diagnosis of Tomato Plant Diseases Using Pre-trained Architectures and A Proposed Convolutional Neural Network Model
    Koc, Dilara Gerdan
    Koc, Caner
    Vatandas, Mustafa
    JOURNAL OF AGRICULTURAL SCIENCES-TARIM BILIMLERI DERGISI, 2023, 29 (02): : 627 - 638
  • [25] Food Detection by Fine-Tuning Pre-trained Convolutional Neural Network Using Noisy Labels
    Alshomrani, Shroog
    Aljoudi, Lina
    Aljabri, Banan
    Al-Shareef, Sarah
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2021, 21 (07): : 182 - 190
  • [26] Identifying gross post-mortem organ images using a pre-trained convolutional neural network
    Garland, Jack
    Hu, Mindy
    Kesha, Kilak
    Glenn, Charley
    Morrow, Paul
    Stables, Simon
    Ondruschka, Benjamin
    Tse, Rexson
    JOURNAL OF FORENSIC SCIENCES, 2021, 66 (02): : 630 - 635
  • [27] Rapid seismic damage state prediction of the subway station structure using the pre-trained network and convolutional neural network
    Fan, Yifan
    Chen, Zhiyi
    Luo, Xiaowei
    SOIL DYNAMICS AND EARTHQUAKE ENGINEERING, 2024, 185
  • [28] Incident detection and classification in renewable energy news using pre-trained language models on deep neural networks
    Wang, Qiqing
    Li, Cunbin
    JOURNAL OF COMPUTATIONAL METHODS IN SCIENCES AND ENGINEERING, 2022, 22 (01) : 57 - 76
  • [29] Human Monkeypox Classification from Skin Lesion Images with Deep Pre-trained Network using Mobile Application
    Veysel Harun Sahin
    Ismail Oztel
    Gozde Yolcu Oztel
    Journal of Medical Systems, 46
  • [30] Human Monkeypox Classification from Skin Lesion Images with Deep Pre-trained Network using Mobile Application
    Sahin, Veysel Harun
    Oztel, Ismail
    Yolcu Oztel, Gozde
    JOURNAL OF MEDICAL SYSTEMS, 2022, 46 (11)