Enhancement of Spectral Tilt in Synthesized Speech

被引:6
作者
Sharma, Bidisha [1 ]
Prasanna, S. R. Mahadeva [1 ]
机构
[1] Indian Inst Technol Guwahati, Dept Elect & Elect Engn, Gauhati 781039, India
关键词
Spectral tilt; speech enhancement; SPSS; TTS;
D O I
10.1109/LSP.2017.2662805
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The research in statistical parametric speech synthesis is towards improving naturalness and intelligibility. In this work, the deviation in spectral tilt of the natural and synthesized speech is analyzed and observed a large gap between the two. Furthermore, the same is analyzed for different classes of sounds, namely low-vowels, mid-vowels, high-vowels, semi-vowels, nasals, and found to be varying with category of sound units. Based on variation, a novel method for spectral tilt enhancement is proposed, where the amount of enhancement introduced is different for different classes of sound units. The proposed method yields improvement in terms of intelligibility, naturalness, and speaker similarity of the synthesized speech.
引用
收藏
页码:382 / 386
页数:5
相关论文
共 25 条
  • [1] Spectral tilt change in stop consonant perception
    Alexander, Joshua M.
    Kluender, Keith R.
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2008, 123 (01) : 386 - 396
  • [2] [Anonymous], 2004, 5 ISCA WORKSH SPEECH
  • [3] [Anonymous], 1999, P EUROSPEECH
  • [4] [Anonymous], 2011, P INT
  • [5] [Anonymous], 2006, P BLIZZ CHALL WORKSH
  • [6] A Deep Generative Architecture for Postfiltering in Statistical Parametric Speech Synthesis
    Chen, Ling-Hui
    Raitio, Tuomo
    Valentini-Botinhao, Cassia
    Ling, Zhen-Hua
    Yamagishi, Junichi
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (11) : 2003 - 2014
  • [7] VOCAL QUALITY FACTORS - ANALYSIS, SYNTHESIS, AND PERCEPTION
    CHILDERS, DG
    LEE, CK
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1991, 90 (05) : 2394 - 2410
  • [8] Voiced/Nonvoiced Detection Based on Robustness of Voiced Epochs
    Dhananjaya, N.
    Yegnanarayana, B.
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2010, 17 (03) : 273 - 276
  • [9] Grancharov V., 2008, Springer Handbook of Speech Processing, P83, DOI DOI 10.1007/978-3-540-49127-9_5
  • [10] Jokinen E, 2014, 2014 14TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), P164, DOI 10.1109/IWAENC.2014.6953999