Enhancement of Spectral Tilt in Synthesized Speech

被引：6

作者：

Sharma, Bidisha ^{[1
]}

Prasanna, S. R. Mahadeva ^{[1
]}

机构：

[1] Indian Inst Technol Guwahati, Dept Elect & Elect Engn, Gauhati 781039, India

来源：

IEEE SIGNAL PROCESSING LETTERS | 2017年 / 24卷 / 04期

关键词：

Spectral tilt; speech enhancement; SPSS; TTS;

D O I：

10.1109/LSP.2017.2662805

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

The research in statistical parametric speech synthesis is towards improving naturalness and intelligibility. In this work, the deviation in spectral tilt of the natural and synthesized speech is analyzed and observed a large gap between the two. Furthermore, the same is analyzed for different classes of sounds, namely low-vowels, mid-vowels, high-vowels, semi-vowels, nasals, and found to be varying with category of sound units. Based on variation, a novel method for spectral tilt enhancement is proposed, where the amount of enhancement introduced is different for different classes of sound units. The proposed method yields improvement in terms of intelligibility, naturalness, and speaker similarity of the synthesized speech.

引用

页码：382 / 386

页数：5

共 25 条

[1] Spectral tilt change in stop consonant perception
Alexander, Joshua M.
Kluender, Keith R.
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2008, 123 (01) : 386 - 396
[2] [Anonymous], 2004, 5 ISCA WORKSH SPEECH
[3] [Anonymous], 1999, P EUROSPEECH
[4] [Anonymous], 2011, P INT
[5] [Anonymous], 2006, P BLIZZ CHALL WORKSH
[6] A Deep Generative Architecture for Postfiltering in Statistical Parametric Speech Synthesis
Chen, Ling-Hui
Raitio, Tuomo
Valentini-Botinhao, Cassia
Ling, Zhen-Hua
Yamagishi, Junichi
[J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (11) : 2003 - 2014
[7] VOCAL QUALITY FACTORS - ANALYSIS, SYNTHESIS, AND PERCEPTION
CHILDERS, DG
LEE, CK
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1991, 90 (05) : 2394 - 2410
[8] Voiced/Nonvoiced Detection Based on Robustness of Voiced Epochs
Dhananjaya, N.
Yegnanarayana, B.
[J]. IEEE SIGNAL PROCESSING LETTERS, 2010, 17 (03) : 273 - 276
[9] Grancharov V., 2008, Springer Handbook of Speech Processing, P83, DOI DOI 10.1007/978-3-540-49127-9_5
[10] Jokinen E, 2014, 2014 14TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), P164, DOI 10.1109/IWAENC.2014.6953999

← 1 2 3 →