Formant speech synthesis

被引:0
作者
Pinto, N.B.
Childers, D.G.
机构
来源
Journal of the Institution of Electronics and Telecommunication Engineers | 1988年 / 34卷 / 01期
关键词
Vocoders;
D O I
暂无
中图分类号
学科分类号
摘要
This paper describes analysis and synthesis methods for a digital formant synthesizer. It is shown that synthetic speech generated using excitation pulses which resemble the true glottal volume-velocity excitation waveform is preferred over speech synthesized using a two pole glottal filter and impulse excitation. A series of algorithms for voice/unvoiced/mixed/silent interval clasification, pitch detection, and formant estimation and racking are described. We have also initiated an investigation into the feasibility of using the digital formant synthesizer to study the acoustic correlates of voice quality. A number of experiments involving male/female voice conversion, and the stimulation of various vocal characteristics, such as breathiness, roughness, and vocal fry, were undertaken. The results have helped to establish the importance of various acoustic features as descriptors of specific voice qualities.
引用
收藏
页码:5 / 20
相关论文
共 42 条
  • [41] ANALYSIS AND SYNTHESIS OF STRONG VOCAL EXPRESSIONS: EXTENSION AND APPLICATION OF AUDIO TEXTURE FEATURES TO SINGING VOICE
    Kawahara, Hideki
    Morise, Masanori
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 5389 - 5392
  • [42] FGP-GAN: Fine-Grained Perception Integrated Generative Adversarial Network for Expressive Mandarin Singing Voice Synthesis
    Liu, Xin
    Zhang, Weiwei
    Zheng, Zhaohui
    Pan, Mingyang
    Wang, Rong
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2024, 70 (03) : 6054 - 6063