A Comparative Study of DenseNets for Vietnamese Traditional Music Genre Classification

被引:0
作者
Huy Nhat Nguyen [1 ]
Hung Thanh Le [1 ]
Quan Anh Mai [1 ]
Dung Anh Huvnh [1 ]
Thanh Nhat Tieu [1 ]
Hung Tung Bui [1 ]
Huy Quang [1 ]
机构
[1] FPT Univ, Ho Chi Minh, Vietnam
来源
2024 21ST INTERNATIONAL JOINT CONFERENCE ON COMPUTER SCIENCE AND SOFTWARE ENGINEERING, JCSSE 2024 | 2024年
关键词
Music genre classification; Convolutional Neural Networks; Densely Connected Convolutional Neural Networks; Vietnamese traditional music; Music;
D O I
10.1109/JCSSE61278.2024.10613709
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The rapid progress of artificial intelligence has led to a corresponding increase in the demand for classification of music genres. Numerous firms have enlisted architects with exceptional expertise for the execution of this project. However, it is crucial to recognize that Vietnamese traditional music, along with other types of Asian traditional music, has not attained the same degree of outstanding performance qualities as other musical genres. This study aimed to assess the effectiveness of two architectural designs, namely Late Fusion Convolutional Neural Networks and Densely Connected Convolutional Networks, through the utilization of diverse visual transformations of musical patterns. The ongoing inquiry involved doing a comparative study on a meticulously preserved dataset of Vietnamese traditional music. The results obtained from our research inquiry have shown the architectural design that is best suited for this particular undertaking. The results of our analysis indicate that Late Fusion Convolutional Neural Networks are a better suitable option for achieving this specific g oal. T his s tudy m akes a substantial contribution to the field of music information retrieval ( MIR) by investigating the effectiveness and precision of Densely Connected Convolutional Neural Networks (DenseNet)-based approaches in categorizing Vietnamese traditional music genres.
引用
收藏
页码:16 / 21
页数:6
相关论文
共 19 条
  • [1] SHORT-TERM SPECTRAL ANALYSIS, SYNTHESIS, AND MODIFICATION BY DISCRETE FOURIER-TRANSFORM
    ALLEN, JB
    [J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1977, 25 (03): : 235 - 238
  • [2] Music Information Retrieval: Recent Developments and Applications
    不详
    [J]. FOUNDATIONS AND TRENDS IN INFORMATION RETRIEVAL, 2014, 8 (2-3): : 128 - +
  • [3] Convolutional Neural Networks Approach for Music Genre Classification
    Cheng, Yu-Huei
    Chang, Pang-Ching
    Kuo, Che-Nan
    [J]. 2020 INTERNATIONAL SYMPOSIUM ON COMPUTER, CONSUMER AND CONTROL (IS3C 2020), 2021, : 399 - 403
  • [4] Array programming with NumPy
    Harris, Charles R.
    Millman, K. Jarrod
    van der Walt, Stefan J.
    Gommers, Ralf
    Virtanen, Pauli
    Cournapeau, David
    Wieser, Eric
    Taylor, Julian
    Berg, Sebastian
    Smith, Nathaniel J.
    Kern, Robert
    Picus, Matti
    Hoyer, Stephan
    van Kerkwijk, Marten H.
    Brett, Matthew
    Haldane, Allan
    del Rio, Jaime Fernandez
    Wiebe, Mark
    Peterson, Pearu
    Gerard-Marchant, Pierre
    Sheppard, Kevin
    Reddy, Tyler
    Weckesser, Warren
    Abbasi, Hameer
    Gohlke, Christoph
    Oliphant, Travis E.
    [J]. NATURE, 2020, 585 (7825) : 357 - 362
  • [5] He K., 2015, P IEEE C COMP VIS PA, DOI [10.1109/CVPR.2016.90, DOI 10.1109/CVPR.2016.90]
  • [6] Huang Gao, 2018, CVPR
  • [7] Itseez, 2015, Open Source Computer Vision 3.4.20
  • [8] Kingma D.P., 2014, arXiv, DOI [DOI 10.48550/ARXIV.1412.6980, 10.48550/arXiv.1412.6980]
  • [9] Logan Beth, 2000, Ismir, V270, P11
  • [10] Mao A., 2023, Cross-Entropy Loss Functions: Theoretical Analysis and Applications