SINGING MELODY EXTRACTION FROM POLYPHONIC MUSIC BASED ON SPECTRAL CORRELATION MODELING

被引:2
作者
Du, Xingjian [1 ]
Zhu, Bilei [1 ]
Kong, Qiuglang [1 ]
Ma, Zejun [1 ]
机构
[1] Bytedance AI Lab, Beijing, Peoples R China
来源
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021) | 2021年
关键词
Melody extraction; convolutional neural network (CNN); global spectral information; spectral correlation modeling; center frequency encoding;
D O I
10.1109/ICASSP39728.2021.9414190
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Convolutional neural network (CNN) based methods have achieved state-of-the-art performance for singing melody extraction from polyphonic music. However, most of these methods focus on the learning of local features, while relationships among spectral components locating far apart are often neglected. In this paper, we explore the idea of modeling spectral correlation explicitly for melody extraction. Specifically, we present a spectral correlation module (SCM) that can learn to model the relationships among all frequency bands in a time-frequency representation, thus allowing the encoding of global spectral information into a conventional CNN. Furthermore, we propose to integrate center frequencies with the input feature map of SCM to improve the performance. We implement a light-weight model comprised of SCM blocks to verify the efficacy of our system. Our system achieves a state-of-the-art overall accuracy of 83.5% on the MedleyDB dataset.
引用
收藏
页码:241 / 245
页数:5
相关论文
共 15 条
[1]  
Basaran D., 2018, 19 INT SOC MUS INF R, P82, DOI 10.5281/zenodo.1492349
[2]  
Bittner R. M., 2014, P INT SOC MUS INF RE, P155
[3]  
Goto S, 2003, IEEE MTT-S, P229, DOI 10.1109/MWSYM.2003.1210922
[4]  
Hsieh TH, 2019, INT CONF ACOUST SPEE, P156, DOI [10.1109/icassp.2019.8682389, 10.1109/ICASSP.2019.8682389]
[5]  
Huang G., 2017, P IEEE C COMP VIS PA, VVolume 1, P4700, DOI DOI 10.1109/CVPR.2017.243
[6]  
Kum S., 2020, INT SOC MUS INF RETR
[7]   Joint Detection and Classification of Singing Voice Melody Using Convolutional Recurrent Neural Networks [J].
Kum, Sangeun ;
Nam, Juhan .
APPLIED SCIENCES-BASEL, 2019, 9 (07)
[8]  
Liu Lu, 2020, INT C LEARN REPR ICL
[9]  
Liu R, 2018, ADV NEUR IN, V31
[10]  
Luo P., 2019, P ICLR, P1