Exploring vibrato-motivated acoustic features for singer identification

被引:51
作者
Nwe, Tin Lay [1 ]
Li, Haizhou [1 ]
机构
[1] Inst Infocomm Res, HCM, Speech & Dialogue Proc Lab, Singapore 119613, Singapore
来源
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2007年 / 15卷 / 02期
关键词
music information retrieval; music knowledge; singer identification; vibrato;
D O I
10.1109/TASL.2006.876756
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Vibrato is a slightly tremulous effect imparted to vocal or instrumental tone for added warmth and expressiveness through slight variation in pitch. It corresponds to a periodic fluctuation of the fundamental frequency. It is common for a singer to develop a vibrato function to personalize his/her singing style. In this paper, we explore the acoustic features that reflect vibrato information in order to identify singers of popular music. We start with an enhanced vocal detection method that allows us to select vocal segments with high confidence. From the selected vocal segments, the cepstral coefficients which reflect the vibrato characteristics are computed. These coefficients are derived using bandpass filters, such as parabolic and cascaded bandpass filters, spread according to the octave frequency scale. The strategy of our classifier formulation is to utilize the high level musical knowledge of song structure in singer modeling. Singer identification is validated on a database containing 84 popular songs from commercially available CD recordings from 12 singers. We achieve an average error rate of 16.2% in segment level identification.
引用
收藏
页码:519 / 530
页数:12
相关论文
共 38 条
[1]   Singing voice identification using spectral envelope estimation [J].
Bartsch, MA ;
Wakefield, GH .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2004, 12 (02) :100-109
[2]  
BARTSCH MA, 2004, THESIS U MICHIGAN AN
[3]  
BECCHETTI C, 1998, SPEECH RECOGNITION T
[4]  
Berenzweig A., 2002, P 22 AES INT C ESP F
[5]   Locating singing voice segments within music signals [J].
Berenzweig, AL ;
Ellis, DPW .
PROCEEDINGS OF THE 2001 IEEE WORKSHOP ON THE APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS, 2001, :119-122
[6]   Measurements of vibrato parameters in long sustained crescendo notes as sung by ten sopranos [J].
Bretos, J ;
Sundberg, J .
JOURNAL OF VOICE, 2003, 17 (03) :343-352
[7]  
CHOU W, 2000, P IEEEI NT C AC SPEE, V2, P865
[8]  
DAVID LJ, UNDERSTANDING VIBRAT
[9]  
DEJONCKERE PH, 1995, VIBRATO, pCH2
[10]   Vibrato rate adjustment [J].
Dromey, C ;
Carter, N ;
Hopkin, A .
JOURNAL OF VOICE, 2003, 17 (02) :168-178