A proposal of vibrato feature to reflect magnitude of fluctuation of fundamental frequency for evaluation similarity in singing voice

被引:0
作者
Suzuki C. [1 ]
Banno H. [1 ]
Asahi K. [1 ]
Morise M. [2 ]
机构
[1] Graduate School of Meijo University, 1-501, Shiogamaguchi, Tenpaku-ku, Nagoya
[2] University of Yamanashi, 4-4-37, Takeda, Koufu
关键词
Fundamental frequency; Singing voice; Vibrato;
D O I
10.1541/ieejeiss.137.1607
中图分类号
C [社会科学总论];
学科分类号
03 ; 0303 ;
摘要
In this paper, we propose a new vibrato feature in order to measure temporal characteristics of vibrato in singing voice. The data to evaluate the vibrato feature includes non-imitative singing voice and imitative singing voice of target singer, whose voice and singing style have strong individuality. The data also includes the target singer's voice extracted from CD. The proposed vibrato feature has an advantage that the feature properly reflects the magnitude of fluctuations of fundamental frequency. The experimental results indicated that fluctuations of fundamental frequency of the imitative singing are larger than those of the non-imitative singing, and the magnitude of the proposed vibrato feature of the imitative singing is close to that of CD. We also performed objective evaluation by using the database that measures a distance between features calculated from imitative voice or non-imitative voice and CD's voice. The evaluation showed that the proposed vibrato feature correctly indicates smaller distance between the imitative voice and the CD's voice. © 2017 The Institute of Electrical Engineers of Japan.
引用
收藏
页码:1607 / 1614
页数:7
相关论文
共 5 条
[1]  
Sundberg J., Research on the singing voice in retrospect, TMH-QPSR, 45, 1, pp. 11-22, (2003)
[2]  
Zetterholm E., A comparative survey of phonetic features of two impersonators, Proceedings of Fonetik TMH-QPSR, 44, 1, pp. 129-132, (2002)
[3]  
Kawahara H., STRAIGHT, exploitation of the other aspect of VOCODER:Perceptually isomorphic decomposition of speech sounds, Acoustic Science and Technology, 27, 6, pp. 349-353, (2006)
[4]  
Johnston J.D., Transform coding of audio signals using perceptual noise criteria, IEEE Journal on Selected Areas in Communications, 6, 2, pp. 314-323, (1988)
[5]  
Madhu N., Note on measures for spectral flatness, Electronics Letters, 45, 23, pp. 1195-1196, (2009)