A ANN BASED HIGH QUALITY METHOD FOR VOICE CONVERSION

被引:0
|
作者
Chen, Z. [1 ]
Zhang, L. H. [1 ]
机构
[1] Nanjing Univ Post & Telecommun, Coll Telecommun & Informat Engn, Nanjing, Jiangsu, Peoples R China
来源
2010 6TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS NETWORKING AND MOBILE COMPUTING (WICOM) | 2010年
关键词
voice conversion; ANN; GMM; pitch conversion; TRANSFORMATION;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we describe a novel conversion method for voice conversion (VC). Artificial Neural Network (ANN) model is employed for performing joint spectrum and pitch conversion between speakers. The conventional method converts spectral parameters and pitch independently. Those separate transformations lead to an unsatisfactory speech quality. The main reason maybe that F-0 sequences are usually converted by a simply linear function. To overcome this problem, we apply joint parameters for train and conversion. A comparative study of voice conversion with ANN and Gaussian Mixture Model (GMM) is conducted. Experimental results indicate that the performance of VC can be dramatically improved by the proposed method in view of both subjective evaluation and objective measurement.
引用
收藏
页数:4
相关论文
共 50 条
  • [1] An Improved ANN Method Based on Clustering Optimization for Voice Conversion
    Chen Xiantong
    Zhang Linghua
    2014 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING (ICALIP), VOLS 1-2, 2014, : 464 - 469
  • [2] Design and Implementation of Voice Conversion System Based on GMM and ANN
    Yang, Man
    Que, Dashun
    Li, Bei
    MULTIMEDIA AND SIGNAL PROCESSING, 2012, 346 : 624 - 631
  • [3] High Quality Voice Conversion based on ISODATA Clustering Algorithm
    Li, Yanping
    Zuo, Yutao
    Yang, Zhen
    Shao, Xi
    2017 12TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND KNOWLEDGE ENGINEERING (IEEE ISKE), 2017,
  • [4] IMPROVING VOICE QUALITY OF HMM-BASED SPEECH SYNTHESIS USING VOICE CONVERSION METHOD
    Jiao, Yishan
    Xie, Xiang
    Na, Xingyu
    Tu, Ming
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [5] Comparing ANN and GMM in a voice conversion framework
    Laskar, R. H.
    Chakrabarty, D.
    Talukdar, F. A.
    Rao, K. Sreenivasa
    Banerjee, K.
    APPLIED SOFT COMPUTING, 2012, 12 (11) : 3332 - 3342
  • [6] Runtime and Speech Quality Survey of a Voice Conversion Method
    Jokisch, Oliver
    Birhanu, Yitagessu
    Hoffmann, Ruediger
    2013 IEEE EUROCON, 2013, : 1684 - 1688
  • [7] Modeling glottal source for high quality voice conversion
    Sun, Jun
    Dai, Beiqian
    Zhang, Jian
    Xie, Yanlu
    WCICA 2006: SIXTH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-12, CONFERENCE PROCEEDINGS, 2006, : 319 - 319
  • [8] High-quality Voice Conversion Using Spectrogram-Based WaveNet Vocoder
    Chen, Kuan
    Chen, Bo
    Lai, Jiahao
    Yu, Kai
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1993 - 1997
  • [9] A novel method for voice conversion based on non-parallel corpus
    Sayadian A.
    Mozaffari F.
    International Journal of Speech Technology, 2017, 20 (3) : 587 - 592
  • [10] Voice Conversion Using Dynamic Features for High Quality Transformation
    Wang, Wei
    Yang, Zhen
    SECOND INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING, 2010, 7546