Music Feature Recognition and Classification Using a Deep Learning Algorithm

被引:0
作者
Xu, Lihong [1 ]
Zhang, Shenghuan [2 ]
机构
[1] Minjiang Teachers Coll, Fuzhou 350108, Fujian, Peoples R China
[2] Jimei Univ, Mus Coll, Xiamen 361021, Fujian, Peoples R China
关键词
Deep learning; music type; feature extraction; recognition; classification; deep belief network; FEATURE-SELECTION; NETWORK;
D O I
10.1142/S1469026823500128
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper studied music feature recognition and classification. First, the common signal features were analyzed, and the signal pre-processing method was introduced. Then, the Mel-Phon coefficient (MPC) was proposed as a feature for subsequent recognition and classification. The deep belief network (DBN) model was applied and improved by the gray wolf optimization (GWO) algorithm to get the GWO-DBN model. The experiments were conducted on GTZAN and free music archive (FMA) datasets. It was found that the best hidden-layer structure of DBN was 1440-960-480-300. Compared with machine learning methods such as decision trees, the DBN model had better classification performance in recognizing and classifying music types. The classification accuracy of the GWO-DBN model reached 75.67%. The experimental results demonstrate the reliability of the GWO-DBN model. The GWO-DBN model can be further promoted and applied in actual music research.
引用
收藏
页数:12
相关论文
共 21 条
  • [1] Akalp Hasan, 2021, ISEEIE 2021: 2021 International Symposium on Electrical, Electronics and Information Engineering, P408, DOI 10.1145/3459104.3459171
  • [2] Discrete space reinforcement learning algorithm based on support vector machine classification
    An, Yuexuan
    Ding, Shifei
    Shi, Songhui
    Li, Jingcan
    [J]. PATTERN RECOGNITION LETTERS, 2018, 111 : 30 - 35
  • [3] Feature selection based on MBFOA for audio signal classification under consideration of Gaussian white noise
    Arumugam, Muthumari
    Kaliappan, Mala
    [J]. IET SIGNAL PROCESSING, 2018, 12 (06) : 777 - 785
  • [4] Bhatia J. K., 2021 5 INT C INF SYS, P1
  • [5] Bouayad D., 2021, IOP C SER EARTH ENV, V696, P1
  • [6] Fusing MFCC and LPC Features Using 1D Triplet CNN for Speaker Recognition in Severely Degraded Audio Signals
    Chowdhury, Anurag
    Ross, Arun
    [J]. IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2020, 15 : 1616 - 1629
  • [7] Modelling of biodiesel blend using optimised deep belief network: blending waste cooking oil methyl ester with tyre pyrolysis oil
    Ganitha, Sujesh
    Ganesan, Subbiah
    Ramesh, Sengottuvelu
    [J]. IET RENEWABLE POWER GENERATION, 2020, 14 (16) : 3238 - 3251
  • [8] github, US
  • [9] He NY, 2019, INT CONF INFO SCI, P263, DOI [10.1109/icist.2019.8836733, 10.1109/ICIST.2019.8836733]
  • [10] Chaotic opposition-based grey-wolf optimization algorithm based on differential evolution and disruption operator for global optimization
    Ibrahim, Rehab Ali
    Abd Elaziz, Mohamed
    Lu, Songfeng
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2018, 108 : 1 - 27