CNNbased musical instrument identification using time-frequency localized features

被引:7
作者
Dutta, Arindam [1 ]
Sil, Dibakar [2 ]
Chandra, Aniruddha [2 ]
Palit, Sarbani [3 ]
机构
[1] Natl Inst Technol, EE Dept, Durgapur, India
[2] Natl Inst Technol, ECE Dept, Durgapur 713209, India
[3] Indian Stat Inst, CVPR Unit, Kolkata, India
关键词
audio signal classification; convolutional neural network; instrument recognition; scalogram; SEPARATION;
D O I
10.1002/itl2.191
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
In this paper, the authors make an attempt to solve the convoluted problem of identifying musical instruments based on their audio excerpts, using a deep convolutional neural network. Continuous wavelet transform of audio signals are realized through Morse wavelet and two-dimensional feature maps are formed, which are then fed to a simple yet robust convolutional neural network. The outcome is appreciable in the sense that training the model with just 20% of the data and testing on the rest gives a classification accuracy of 85%.
引用
收藏
页数:6
相关论文
共 19 条
[1]   Automatic musical instrument classification using fractional fourier transform based- MFCC features and counter propagation neural network [J].
Bhalke, D. G. ;
Rao, C. B. Rama ;
Bormane, D. S. .
JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2016, 46 (03) :425-446
[2]   Musical Source Separation An introduction [J].
Cano, Estefania ;
FitzGerald, Derry ;
Liutkus, Antoine ;
Plumbley, Mark D. ;
Stoeter, Fabian-Robert .
IEEE SIGNAL PROCESSING MAGAZINE, 2019, 36 (01) :31-40
[3]  
Chon SH, 2017, EMPIR MUSICOL REV, V12, P116
[4]  
Ghosh A, 2018, 2018 3RD INTERNATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS, COMMUNICATION, COMPUTER, AND OPTIMIZATION TECHNIQUES (ICEECCOT - 2018), P509, DOI 10.1109/ICEECCOT43722.2018.9001486
[5]   Deep Convolutional Neural Networks for Predominant Instrument Recognition in Polyphonic Music [J].
Han, Yoonchang ;
Kim, Jaehun ;
Lee, Kyogu .
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (01) :208-221
[6]  
Heittola T., 2009, P INT SOC MUS INF RE, P327
[7]   ImageNet Classification with Deep Convolutional Neural Networks [J].
Krizhevsky, Alex ;
Sutskever, Ilya ;
Hinton, Geoffrey E. .
COMMUNICATIONS OF THE ACM, 2017, 60 (06) :84-90
[8]  
Martin K.D., 1998, The Journal of the Acoustical Society of America, V104, P1768
[9]  
MASOOD S, 2015, ANNU IEEE IND CONF, P1
[10]  
Pons J, 2017, EUR SIGNAL PR CONF, P2744, DOI 10.23919/EUSIPCO.2017.8081710