A ANN BASED HIGH QUALITY METHOD FOR VOICE CONVERSION

被引：0

作者：

Chen, Z. ^{[1
]}

Zhang, L. H. ^{[1
]}

机构：

[1] Nanjing Univ Post & Telecommun, Coll Telecommun & Informat Engn, Nanjing, Jiangsu, Peoples R China

来源：

2010 6TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS NETWORKING AND MOBILE COMPUTING (WICOM) | 2010年

关键词：

voice conversion; ANN; GMM; pitch conversion; TRANSFORMATION;

D O I：

暂无

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper, we describe a novel conversion method for voice conversion (VC). Artificial Neural Network (ANN) model is employed for performing joint spectrum and pitch conversion between speakers. The conventional method converts spectral parameters and pitch independently. Those separate transformations lead to an unsatisfactory speech quality. The main reason maybe that F-0 sequences are usually converted by a simply linear function. To overcome this problem, we apply joint parameters for train and conversion. A comparative study of voice conversion with ANN and Gaussian Mixture Model (GMM) is conducted. Experimental results indicate that the performance of VC can be dramatically improved by the proposed method in view of both subjective evaluation and objective measurement.

引用

页数：4

共 50 条

[1] An Improved ANN Method Based on Clustering Optimization for Voice Conversion
Chen Xiantong
Zhang Linghua
2014 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING (ICALIP), VOLS 1-2, 2014, : 464 - 469
[2] Design and Implementation of Voice Conversion System Based on GMM and ANN
Yang, Man
Que, Dashun
Li, Bei
MULTIMEDIA AND SIGNAL PROCESSING, 2012, 346 : 624 - 631
[3] High Quality Voice Conversion based on ISODATA Clustering Algorithm
Li, Yanping
Zuo, Yutao
Yang, Zhen
Shao, Xi
2017 12TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND KNOWLEDGE ENGINEERING (IEEE ISKE), 2017,
[4] IMPROVING VOICE QUALITY OF HMM-BASED SPEECH SYNTHESIS USING VOICE CONVERSION METHOD
Jiao, Yishan
Xie, Xiang
Na, Xingyu
Tu, Ming
2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
[5] Comparing ANN and GMM in a voice conversion framework
Laskar, R. H.
Chakrabarty, D.
Talukdar, F. A.
Rao, K. Sreenivasa
Banerjee, K.
APPLIED SOFT COMPUTING, 2012, 12 (11) : 3332 - 3342
[6] Runtime and Speech Quality Survey of a Voice Conversion Method
Jokisch, Oliver
Birhanu, Yitagessu
Hoffmann, Ruediger
2013 IEEE EUROCON, 2013, : 1684 - 1688
[7] Modeling glottal source for high quality voice conversion
Sun, Jun
Dai, Beiqian
Zhang, Jian
Xie, Yanlu
WCICA 2006: SIXTH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-12, CONFERENCE PROCEEDINGS, 2006, : 319 - 319
[8] High-quality Voice Conversion Using Spectrogram-Based WaveNet Vocoder
Chen, Kuan
Chen, Bo
Lai, Jiahao
Yu, Kai
19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1993 - 1997
[9] A novel method for voice conversion based on non-parallel corpus
Sayadian A.
Mozaffari F.
International Journal of Speech Technology, 2017, 20 (3) : 587 - 592
[10] Voice Conversion Using Dynamic Features for High Quality Transformation
Wang, Wei
Yang, Zhen
SECOND INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING, 2010, 7546

← 1 2 3 4 5 →