Formant speech synthesis

被引:0
|
作者
Pinto, N.B.
Childers, D.G.
机构
来源
Journal of the Institution of Electronics and Telecommunication Engineers | 1988年 / 34卷 / 01期
关键词
Vocoders;
D O I
暂无
中图分类号
学科分类号
摘要
This paper describes analysis and synthesis methods for a digital formant synthesizer. It is shown that synthetic speech generated using excitation pulses which resemble the true glottal volume-velocity excitation waveform is preferred over speech synthesized using a two pole glottal filter and impulse excitation. A series of algorithms for voice/unvoiced/mixed/silent interval clasification, pitch detection, and formant estimation and racking are described. We have also initiated an investigation into the feasibility of using the digital formant synthesizer to study the acoustic correlates of voice quality. A number of experiments involving male/female voice conversion, and the stimulation of various vocal characteristics, such as breathiness, roughness, and vocal fry, were undertaken. The results have helped to establish the importance of various acoustic features as descriptors of specific voice qualities.
引用
收藏
页码:5 / 20
相关论文
共 42 条
  • [1] New adaptive formant vocoder
    Yi, Hu
    He, Dehuan
    Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 1991, 2 (01): : 89 - 96
  • [2] VOCAINE THE VOCODER AND APPLICATIONS IN SPEECH SYNTHESIS
    Agiomyrgiannakis, Yannis
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4230 - 4234
  • [3] VOCBENCH: A NEURAL VOCODER BENCHMARK FOR SPEECH SYNTHESIS
    AlBadawy, Ehab A.
    Gibiansky, Andrew
    He, Qing
    Wu, Jilong
    Chang, Ming-Ching
    Lyu, Siwei
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 881 - 885
  • [4] Speech Enhancement With Integration of Neural Homomorphic Synthesis and Spectral Masking
    Jiang, Wenbin
    Yu, Kai
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 31 : 1758 - 1770
  • [5] SPEECH ANALYSIS AND SYNTHESIS BECOME PRACTICAL ON MU-C CHIP
    SECREST, B
    ARJMAND, M
    NI, M
    ELECTRONIC DESIGN, 1982, 30 (11) : 129 - &
  • [6] Lhasa-Tibetan Speech Synthesis Using End-to-End Model
    Zhao, Yue
    Hu, Panhua
    Xu, Xiaona
    Wu, Licheng
    Li, Xiali
    IEEE ACCESS, 2019, 7 (140305-140311) : 140305 - 140311
  • [7] Denoising-and-Dereverberation Hierarchical Neural Vocoder for Statistical Parametric Speech Synthesis
    Ai, Yang
    Ling, Zhen-Hua
    Wu, Wei-Lu
    Li, Ang
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 2036 - 2048
  • [8] A Neural Vocoder With Hierarchical Generation of Amplitude and Phase Spectra for Statistical Parametric Speech Synthesis
    Ai, Yang
    Ling, Zhen-Hua
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 (839-851) : 839 - 851
  • [9] Japanese Neural Incremental Text-to-Speech Synthesis Framework With an Accent Phrase Input
    Yanagita, Tomoya
    Sakti, Sakriani
    Nakamura, Satoshi
    IEEE ACCESS, 2023, 11 : 22355 - 22363
  • [10] NaturalSpeech: End-to-End Text-to-Speech Synthesis With Human-Level Quality
    Tan, Xu
    Chen, Jiawei
    Liu, Haohe
    Cong, Jian
    Zhang, Chen
    Liu, Yanqing
    Wang, Xi
    Leng, Yichong
    Yi, Yuanhao
    He, Lei
    Zhao, Sheng
    Qin, Tao
    Soong, Frank
    Liu, Tie-Yan
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (06) : 4234 - 4245