Formant speech synthesis

被引：0

作者：

Pinto, N.B.

Childers, D.G.

机构：

来源：

Journal of the Institution of Electronics and Telecommunication Engineers | 1988年 / 34卷 / 01期

关键词：

Vocoders;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

This paper describes analysis and synthesis methods for a digital formant synthesizer. It is shown that synthetic speech generated using excitation pulses which resemble the true glottal volume-velocity excitation waveform is preferred over speech synthesized using a two pole glottal filter and impulse excitation. A series of algorithms for voice/unvoiced/mixed/silent interval clasification, pitch detection, and formant estimation and racking are described. We have also initiated an investigation into the feasibility of using the digital formant synthesizer to study the acoustic correlates of voice quality. A number of experiments involving male/female voice conversion, and the stimulation of various vocal characteristics, such as breathiness, roughness, and vocal fry, were undertaken. The results have helped to establish the importance of various acoustic features as descriptors of specific voice qualities.

引用

页码：5 / 20

共 42 条

[41] ANALYSIS AND SYNTHESIS OF STRONG VOCAL EXPRESSIONS: EXTENSION AND APPLICATION OF AUDIO TEXTURE FEATURES TO SINGING VOICE
Kawahara, Hideki
Morise, Masanori
2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 5389 - 5392
[42] FGP-GAN: Fine-Grained Perception Integrated Generative Adversarial Network for Expressive Mandarin Singing Voice Synthesis
Liu, Xin
Zhang, Weiwei
Zheng, Zhaohui
Pan, Mingyang
Wang, Rong
IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2024, 70 (03) : 6054 - 6063

← 1 2 3 4 5 →