Music Feature Classification Based on Recurrent Neural Networks with Channel Attention Mechanism

被引:14
作者
Gan, Jie [1 ]
机构
[1] Huanghuai Univ, Zhumadian 463000, Peoples R China
关键词
Bidirectional recurrent neural networks - Classification accuracy - Classification tasks - Classification technology - Convolution structure - Feature classification - Overall characteristics - Timing characteristics;
D O I
10.1155/2021/7629994
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the advancement of multimedia and digital technologies, music resources are rapidly increasing over the Internet, which changed listeners' habits from hard drives to online music platforms. It has allowed the researchers to use classification technologies for efficient storage, organization, retrieval, and recommendation of music resources. The traditional music classification methods use many artificially designed acoustic features, which require knowledge in the music field. The features of different classification tasks are often not universal. This paper provides a solution to this problem by proposing a novel recurrent neural network method with a channel attention mechanism for music feature classification. The music classification method based on a convolutional neural network ignores the timing characteristics of the audio itself. Therefore, this paper combines convolution structure with the bidirectional recurrent neural network and uses the attention mechanism to assign different attention weights to the output of the recurrent neural network at different times; the weights are assigned for getting a better representation of the overall characteristics of the music. The classification accuracy of the model on the GTZAN data set has increased to 93.1%. The AUC on the multilabel labeling data set MagnaTagATune has reached 92.3%, surpassing other comparison methods. The labeling of different music labels has been analyzed. This method has good labeling ability for most of the labels of music genres. Also, it has good performance on some labels of musical instruments, singing, and emotion categories.
引用
收藏
页数:10
相关论文
共 25 条
[1]  
Barbieri F., 2018, Trans Int Soc Music Inf Retr, V1, P21, DOI DOI 10.5334/TISMIR.10
[2]   Remote Sensing Image Classification Based on a Cross-Attention Mechanism and Graph Convolution [J].
Cai, Weiwei ;
Wei, Zhanguo .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
[3]   TARDB-Net: triple-attention guided residual dense and BiLSTM networks for hyperspectral image classification [J].
Cai, Weiwei ;
Liu, Botao ;
Wei, Zhanguo ;
Li, Meilin ;
Kan, Jiangming .
MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (07) :11291-11312
[4]   Music of the 7Ts: Predicting and Decoding Multivoxel fMRI Responses with Acoustic, Schematic, and Categorical Music Features [J].
Casey, Michael A. .
FRONTIERS IN PSYCHOLOGY, 2017, 8
[5]   The Relationship Between Age and Mental Health Among Adults in Iran During the COVID-19 Pandemic [J].
Chen, Jiyao ;
Zhang, Stephen X. ;
Wang, Yifei ;
Afshar Jahanshahi, Asghar ;
Mokhtari Dinani, Maryam ;
Nazarian Madavani, Abbas ;
Nawaser, Khaled .
INTERNATIONAL JOURNAL OF MENTAL HEALTH AND ADDICTION, 2022, 20 (05) :3162-3177
[6]  
Choi K., 2017, A tutorial on deep learning for music information retrieval
[7]  
Holzapfel A, 2018, Transactions of the International Society for Music Information Retrieval, V1, P44, DOI DOI 10.5334/TISMIR.13
[8]   A Resource-Efficient Hybrid Proxy Mobile IPv6 Extension for Next-Generation IoT Networks [J].
Hussain, Anwar ;
Nazir, Shah ;
Khan, Fazlullah ;
Nkenyereye, Lewis ;
Ullah, Ayaz ;
Khan, Sulaiman ;
Verma, Sahil ;
Kavita .
IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (03) :2095-2103
[9]   A Secured and Reliable Continuous Transmission Scheme in Cognitive HARQ-Aided Internet of Things [J].
Khan, Fazlullah ;
Rehman, Ateeq Ur ;
Zhang, Yanliang ;
Mastorakis, Spyridon ;
Song, Houbing ;
Jan, Mian Ahmad ;
Dev, Kapal .
IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (19) :14835-14844
[10]   Deep Reinforcement Learning for Communication Flow Control in Wireless Mesh Networks [J].
Liu, Qingzhi ;
Cheng, Long ;
Jia, Adele Lu ;
Liu, Cong .
IEEE NETWORK, 2021, 35 (02) :112-119