A Comparative Study of DenseNets for Vietnamese Traditional Music Genre Classification

被引:0
作者
Huy Nhat Nguyen [1 ]
Hung Thanh Le [1 ]
Quan Anh Mai [1 ]
Dung Anh Huvnh [1 ]
Thanh Nhat Tieu [1 ]
Hung Tung Bui [1 ]
Huy Quang [1 ]
机构
[1] FPT Univ, Ho Chi Minh, Vietnam
来源
2024 21ST INTERNATIONAL JOINT CONFERENCE ON COMPUTER SCIENCE AND SOFTWARE ENGINEERING, JCSSE 2024 | 2024年
关键词
Music genre classification; Convolutional Neural Networks; Densely Connected Convolutional Neural Networks; Vietnamese traditional music; Music;
D O I
10.1109/JCSSE61278.2024.10613709
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The rapid progress of artificial intelligence has led to a corresponding increase in the demand for classification of music genres. Numerous firms have enlisted architects with exceptional expertise for the execution of this project. However, it is crucial to recognize that Vietnamese traditional music, along with other types of Asian traditional music, has not attained the same degree of outstanding performance qualities as other musical genres. This study aimed to assess the effectiveness of two architectural designs, namely Late Fusion Convolutional Neural Networks and Densely Connected Convolutional Networks, through the utilization of diverse visual transformations of musical patterns. The ongoing inquiry involved doing a comparative study on a meticulously preserved dataset of Vietnamese traditional music. The results obtained from our research inquiry have shown the architectural design that is best suited for this particular undertaking. The results of our analysis indicate that Late Fusion Convolutional Neural Networks are a better suitable option for achieving this specific g oal. T his s tudy m akes a substantial contribution to the field of music information retrieval ( MIR) by investigating the effectiveness and precision of Densely Connected Convolutional Neural Networks (DenseNet)-based approaches in categorizing Vietnamese traditional music genres.
引用
收藏
页码:16 / 21
页数:6
相关论文
共 19 条
[1]   SHORT-TERM SPECTRAL ANALYSIS, SYNTHESIS, AND MODIFICATION BY DISCRETE FOURIER-TRANSFORM [J].
ALLEN, JB .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1977, 25 (03) :235-238
[2]   Music Information Retrieval: Recent Developments and Applications [J].
不详 .
FOUNDATIONS AND TRENDS IN INFORMATION RETRIEVAL, 2014, 8 (2-3) :128-+
[3]   Convolutional Neural Networks Approach for Music Genre Classification [J].
Cheng, Yu-Huei ;
Chang, Pang-Ching ;
Kuo, Che-Nan .
2020 INTERNATIONAL SYMPOSIUM ON COMPUTER, CONSUMER AND CONTROL (IS3C 2020), 2021, :399-403
[4]  
eCampusOntario, 2024, Open source computer vision library
[5]   Array programming with NumPy [J].
Harris, Charles R. ;
Millman, K. Jarrod ;
van der Walt, Stefan J. ;
Gommers, Ralf ;
Virtanen, Pauli ;
Cournapeau, David ;
Wieser, Eric ;
Taylor, Julian ;
Berg, Sebastian ;
Smith, Nathaniel J. ;
Kern, Robert ;
Picus, Matti ;
Hoyer, Stephan ;
van Kerkwijk, Marten H. ;
Brett, Matthew ;
Haldane, Allan ;
del Rio, Jaime Fernandez ;
Wiebe, Mark ;
Peterson, Pearu ;
Gerard-Marchant, Pierre ;
Sheppard, Kevin ;
Reddy, Tyler ;
Weckesser, Warren ;
Abbasi, Hameer ;
Gohlke, Christoph ;
Oliphant, Travis E. .
NATURE, 2020, 585 (7825) :357-362
[6]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[7]  
Huang Gao, 2018, Vis. Pattern Recognit. (CVPR)
[8]  
King DB, 2015, ACS SYM SER, V1214, P1, DOI 10.1021/bk-2015-1214.ch001
[9]  
Logan B., 2000, ISMIR PLYMOUTH, P11
[10]  
Mao A., 2023, INT C MACHINE LEARNI