Music Feature Recognition and Classification Using a Deep Learning Algorithm

被引：0

作者：

Xu, Lihong ^{[1
]}

Zhang, Shenghuan ^{[2
]}

机构：

[1] Minjiang Teachers Coll, Fuzhou 350108, Fujian, Peoples R China

[2] Jimei Univ, Mus Coll, Xiamen 361021, Fujian, Peoples R China

来源：

INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE AND APPLICATIONS | 2023年 / 22卷 / 03期

关键词：

Deep learning; music type; feature extraction; recognition; classification; deep belief network; FEATURE-SELECTION; NETWORK;

D O I：

10.1142/S1469026823500128

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper studied music feature recognition and classification. First, the common signal features were analyzed, and the signal pre-processing method was introduced. Then, the Mel-Phon coefficient (MPC) was proposed as a feature for subsequent recognition and classification. The deep belief network (DBN) model was applied and improved by the gray wolf optimization (GWO) algorithm to get the GWO-DBN model. The experiments were conducted on GTZAN and free music archive (FMA) datasets. It was found that the best hidden-layer structure of DBN was 1440-960-480-300. Compared with machine learning methods such as decision trees, the DBN model had better classification performance in recognizing and classifying music types. The classification accuracy of the GWO-DBN model reached 75.67%. The experimental results demonstrate the reliability of the GWO-DBN model. The GWO-DBN model can be further promoted and applied in actual music research.

引用

页数：12

共 21 条

[1] Akalp Hasan, 2021, ISEEIE 2021: 2021 International Symposium on Electrical, Electronics and Information Engineering, P408, DOI 10.1145/3459104.3459171
[2] Discrete space reinforcement learning algorithm based on support vector machine classification
An, Yuexuan
Ding, Shifei
Shi, Songhui
Li, Jingcan
[J]. PATTERN RECOGNITION LETTERS, 2018, 111 : 30 - 35
[3] Feature selection based on MBFOA for audio signal classification under consideration of Gaussian white noise
Arumugam, Muthumari
Kaliappan, Mala
[J]. IET SIGNAL PROCESSING, 2018, 12 (06) : 777 - 785
[4] Bhatia J. K., 2021 5 INT C INF SYS, P1
[5] Bouayad D., 2021, IOP C SER EARTH ENV, V696, P1
[6] Fusing MFCC and LPC Features Using 1D Triplet CNN for Speaker Recognition in Severely Degraded Audio Signals
Chowdhury, Anurag
Ross, Arun
[J]. IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2020, 15 : 1616 - 1629
[7] Modelling of biodiesel blend using optimised deep belief network: blending waste cooking oil methyl ester with tyre pyrolysis oil
Ganitha, Sujesh
Ganesan, Subbiah
Ramesh, Sengottuvelu
[J]. IET RENEWABLE POWER GENERATION, 2020, 14 (16) : 3238 - 3251
[8] github, US
[9] He NY, 2019, INT CONF INFO SCI, P263, DOI [10.1109/icist.2019.8836733, 10.1109/ICIST.2019.8836733]
[10] Chaotic opposition-based grey-wolf optimization algorithm based on differential evolution and disruption operator for global optimization
Ibrahim, Rehab Ali
Abd Elaziz, Mohamed
Lu, Songfeng
[J]. EXPERT SYSTEMS WITH APPLICATIONS, 2018, 108 : 1 - 27

← 1 2 3 →