CUTE: A CONCATENATIVE METHOD FOR VOICE CONVERSION USING EXEMPLAR-BASED UNIT SELECTION

被引:0
作者
Jin, Zeyu [1 ,2 ]
Finkelstein, Adam [1 ]
DiVerdi, Stephen [2 ]
Lu, Jingwan [2 ]
Mysore, Gautham J. [2 ]
机构
[1] Princeton Univ, Princeton, NJ 08540 USA
[2] Adobe Res, San Francisco, CA 94103 USA
来源
2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS | 2016年
关键词
Voice conversion; unit selection; concatenative synthesis; exemplar-based;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
State-of-the art voice conversion methods re-synthesize voice from spectral representations such as MFCCs and STRAIGHT, thereby introducing muffled artifacts. We propose a method that circumvents this concern using concatenative synthesis coupled with exemplarbased unit selection. Given parallel speech from source and target speakers as well as a new query from the source, our method stitches together pieces of the target voice. It optimizes for three goals: matching the query, using long consecutive segments, and smooth transitions between the segments. To achieve these goals, we perform unit selection at the frame level and introduce triphonebased preselection that greatly reduces computation and enforces selection of long, contiguous pieces. Our experiments show that the proposed method has better quality than baseline methods, while preserving high individuality.
引用
收藏
页码:5660 / 5664
页数:5
相关论文
共 21 条
[1]  
Aihara R., 2014, ICASSP 2014
[2]  
[Anonymous], 2010, THEORY APPL DIGITAL
[3]  
[Anonymous], P IEEE INT C AC SPEE
[4]  
[Anonymous], 2009, Text-to-speech synthesis
[5]   YIN, a fundamental frequency estimator for speech and music [J].
de Cheveigné, A ;
Kawahara, H .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2002, 111 (04) :1917-1930
[6]  
Desai S., 2009, ICASSP 2009
[7]  
Dutoit T., 2007, ICASSP 2007
[8]   VITERBI ALGORITHM [J].
FORNEY, GD .
PROCEEDINGS OF THE IEEE, 1973, 61 (03) :268-278
[9]  
Fujii K., 2007, INT J ELECT COMPUTER, V1, P1617
[10]  
HON HW, 1991, INT CONF ACOUST SPEE, P889, DOI 10.1109/ICASSP.1991.150482