A multi-layer F0 model for singing voice synthesis using a B-spline representation with intuitive controls

被引:0
作者
Ardaillon, Luc [1 ]
Degottex, Gilles [1 ]
Roebel, Axel [1 ]
机构
[1] Univ Paris 06, Sorbonne Univ, CNRS, IRCAM,UMR STMS, Paris, France
来源
16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5 | 2015年
关键词
singing voice synthesis; F0; model; concatenative synthesis; VIBRATO;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In singing voice, the fundamental frequency (F0) carries not only melody, but also music style, personal expressivity and other characteristics specific to voice production mechanism. The F0 modeling is therefore critical for a natural -sounding and expressive synthesis. In addition, for artistic purposes, composers also need to have control over expressive parameters of the F0 curve, which is missing in many current approaches. This paper presents a novel parametric F0 model for singing voice synthesis with intuitive control of expressive parameters. The proposed approach considers the various F0 variations of the singing voice as separate layers using B -splines to model the melodic component. This model has been implemented in a concatenative singing voice synthesis system and its perceived naturalness has been evaluated through listening tests. The validity of each layer is first evaluated independently, and the full model is then compared to real F0 curves from professional singers. The results of these tests suggest that the model is suitable to produce natural and expressive F0 contours.
引用
收藏
页码:3375 / +
页数:2
相关论文
共 26 条
[1]  
Bogaards N., 2004, P INT COMP MUS C ICM
[2]  
Bonada J., 2008, THESIS U POMPEII FAB
[3]  
Bonan J, 2003, Proceedings of the 46th IEEE International Midwest Symposium on Circuits & Systems, Vols 1-3, P1
[4]   Measurements of vibrato parameters in long sustained crescendo notes as sung by ten sopranos [J].
Bretos, J ;
Sundberg, J .
JOURNAL OF VOICE, 2003, 17 (03) :343-352
[5]  
Ikemiya Yukara, 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), P3127, DOI 10.1109/ICASSP.2014.6854176
[6]  
Kenmochi H., 2007, PROC 8 ANN C INT SPE, P4009
[7]  
Lee SW, 2012, INT CONF ACOUST SPEE, P429, DOI 10.1109/ICASSP.2012.6287908
[8]  
Liuni M., 2013, MUSICA TECNOLOGIA, V7
[9]  
Lolive D, 2006, P 11 INT C SPEECH CO, P333
[10]  
Macon M., 1997, Audio Engineering Society Convention, V103