High quality voice conversion through phoneme-based linear mapping functions with STRAIGHT for mandarin

被引:49
作者
Liu, Kun [1 ]
Zhang, Jianping [1 ]
Yan, Yonghong [1 ]
机构
[1] Chinese Acad Sci, Inst Acoust, Beijing 100083, Peoples R China
来源
FOURTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 4, PROCEEDINGS | 2007年
关键词
voice conversion; formant transitions; main vowel; phoneme-based mapping functions;
D O I
10.1109/FSKD.2007.347
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A novel voice conversion system using, phoneme-based linear mapping functions on main vowel phonemes is proposed in this paper. Our voice conversion algorithm has the, following three improvements. First, instead of has no all the Vocal Tract Resonance (VTR) vectors in the portion of a phoneme, we use the VTR vector at the steady-state of each phoneme to train phoneme-based GMM. Second, different linear mapping functions have been trained to describe the mapping relationships for corresponding phonemes. Third, in the transformation procedure. the transformed formant frequencies at the main vowel phonemes are obtained using the corresponding GMM. Besides, prosody parameters are also transformed. Finally the converted speech is re-synthesized with the transformed parameters by high quality speech manipulation framework STRAIGHT (Speech Transformation and Representation based on Adaptive Interpolation of weiGHTed spectrogram). Perceptual results for F-M and M-F conversion show that our MOS score of the converted voice is improved from 3.8 to 4.1 and ABX score front 3.3 to 3.8 compared with IBM's system. Comparisons with other systems are also given in this paper.
引用
收藏
页码:410 / 414
页数:5
相关论文
共 22 条
  • [1] ABE M, 1998, P IEEE INT C AC SPEE, P665
  • [2] [Anonymous], P ICSLP
  • [3] Speaker Transformation Algorithm using Segmental Codebooks (STASC)
    Arslan, LM
    [J]. SPEECH COMMUNICATION, 1999, 28 (03) : 211 - 226
  • [4] BOCCARDI F, 2001, P 4 COST G 6 WORKSH
  • [5] HUI Y, ICASSP 2004 MONTR CA
  • [6] KAIN A, 2001, P ICASSP SALT LAK CI
  • [7] KAIN A, P IEEE ICASSP 1998, V1, P285
  • [8] KAWAHARA H, 2006, ICASSP, P1303
  • [9] Kim E. K., 1997, P EUR, P2519
  • [10] KUN L, ICASSP 07