F0 ESTIMATION USING BLIND SOURCE SEPARATION FOR ANALYZING NOH SINGING

被引:0
作者
Tamoto, Atsuki [1 ]
Itou, Katunobu [2 ]
机构
[1] Hosei Univ, Grad Sch Comp & Informat Sci, Koganei, Tokyo, Japan
[2] Hosei Univ, Fac Comp & Informat Sci, Koganei, Tokyo, Japan
来源
PROCEEDINGS OF THE 2020 IEEE 30TH INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP) | 2020年
关键词
Noh singing; Melody; F0; source separation; CNN; U-Net; Melodia; MELODY EXTRACTION;
D O I
10.1109/mlsp49062.2020.9231812
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The purpose of this study is to extract singing melody from mixed sounds related to Noh performances. Noh sounds include singing, accompaniments, and other elements. For analyzing Noh singing, we need singing solos, but they are hard to collect since there are only a few sources of solo passages. Therefore, we focus on the extraction of singing melody from mixtures of accompaniments and singing. In this paper, we demonstrate that source separation can be introduced as an efficient preprocessing step for Noh singing melody extraction. In addition, we compare melody extraction based on a convolutional neural network (CNN) approach with Melodia, a plug-in for melody extraction which is particularly accurate in the presence of music with wide fluctuations in pitch. We also demonstrate that CNN-based melody estimation can be efficiently trained using singing after source separation.
引用
收藏
页数:6
相关论文
共 10 条
[1]  
[Anonymous], 2017, INT SOC MUSIC INFORM
[2]  
Chen MT, 2019, INT CONF ACOUST SPEE, P1005, DOI 10.1109/ICASSP.2019.8683630
[3]  
Chou H, 2018, 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), P381, DOI 10.1109/ICASSP.2018.8461483
[4]  
Gomez E., 2012, P INT SOC MUSIC INFO, P601
[5]  
Huang X, 2014, INT CONF NANO MICRO, P562
[6]   Joint Singing Voice Separation and F0 Estimation with Deep U-Net Architectures [J].
Jansson, Andreas ;
Bittner, Rachel M. ;
Ewert, Sebastian ;
Weyde, Tillman .
2019 27TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2019,
[7]  
Raffel Colin, 2014, 15 INT SOC MUS INF R
[8]   Melody Extraction from Polyphonic Music Signals [J].
Salamon, Justin ;
Gomez, Emilia ;
Ellis, Daniel P. W. ;
Richard, Gael .
IEEE SIGNAL PROCESSING MAGAZINE, 2014, 31 (02) :118-134
[9]   Melody Extraction From Polyphonic Music Signals Using Pitch Contour Characteristics [J].
Salamon, Justin ;
Gomez, Emilia .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (06) :1759-1770
[10]  
Su H, 2016, INT CONF ACOUST SPEE, P579, DOI 10.1109/ICASSP.2016.7471741