Formant speech synthesis

被引：0

作者：

Pinto, N.B.

Childers, D.G.

机构：

来源：

Journal of the Institution of Electronics and Telecommunication Engineers | 1988年 / 34卷 / 01期

关键词：

Vocoders;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

This paper describes analysis and synthesis methods for a digital formant synthesizer. It is shown that synthetic speech generated using excitation pulses which resemble the true glottal volume-velocity excitation waveform is preferred over speech synthesized using a two pole glottal filter and impulse excitation. A series of algorithms for voice/unvoiced/mixed/silent interval clasification, pitch detection, and formant estimation and racking are described. We have also initiated an investigation into the feasibility of using the digital formant synthesizer to study the acoustic correlates of voice quality. A number of experiments involving male/female voice conversion, and the stimulation of various vocal characteristics, such as breathiness, roughness, and vocal fry, were undertaken. The results have helped to establish the importance of various acoustic features as descriptors of specific voice qualities.

引用

页码：5 / 20

共 42 条

[21] SelfRemaster: Self-Supervised Speech Restoration for Historical Audio Resources
Saeki, Takaaki
Takamichi, Shinnosuke
Nakamura, Tomohiko
Tanji, Naoko
Saruwatari, Hiroshi
IEEE ACCESS, 2023, 11 : 144831 - 144843
[22] A Joint Framework of Denoising Autoencoder and Generative Vocoder for Monaural Speech Enhancement
Du, Zhihao
Zhang, Xueliang
Han, Jiqing
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 : 1493 - 1505
[23] Emotional Text-To-Speech in Japanese Using Artificially Augmented Dataset
Khalifah, Mujahid Jamal A.
Ptaszynski, Michal
Masui, Fumito
IEEE ACCESS, 2024, 12 : 167724 - 167777
[24] The perceptual quality of MELP speech over error tolerant IP networks
Gavula, Ben
Scheets, George
Teague, Keith
Weber, Justin
2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 1633 - 1636
[25] MIXED FOURIER WALSH TRANSFORM SCHEME FOR SPEECH CODING AT 4.0 KBIT/S
SPANIAS, AS
LOIZOU, PC
IEE PROCEEDINGS-I COMMUNICATIONS SPEECH AND VISION, 1992, 139 (05): : 473 - 481
[26] Neural Speech and Audio Coding: Modern AI technology meets traditional codecs
Kim, Minje
Skoglund, Jan
IEEE SIGNAL PROCESSING MAGAZINE, 2024, 41 (06) : 85 - 93
[27] Improved Alias-and-Separate Speech Coding Framework With Minimal Algorithmic Delay
Lee, Eunkyun
Beack, Seungkwon
Shin, Jong Won
IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2024, 18 (08) : 1414 - 1426
[28] Subjective SNR measure for quality assessment of speech coders - a cross language study
Nakatsui, Mamoru
Noda, Hideki
Journal of the Acoustical Society of Japan (E) (English translation of Nippon Onkyo Gakkaishi), 1994, 15 (06): : 377 - 381
[29] Hands-Free Devices for Displaying Speech and Language in the Tactile Modality - Methods and Approaches
Kappers, Astrid M. L.
Plaisier, Myrthe A.
IEEE TRANSACTIONS ON HAPTICS, 2021, 14 (03) : 465 - 478
[30] Combination and Comparison of Sound Coding Strategies Using Cochlear Implant Simulation With Mandarin Speech
Huang, Enoch Hsin-Ho
Wu, Chao-Min
Lin, Hung-Ching
IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, 2021, 29 : 2407 - 2416

← 1 2 3 4 5 →