Formant speech synthesis

被引：0

作者：

Pinto, N.B.

Childers, D.G.

机构：

来源：

Journal of the Institution of Electronics and Telecommunication Engineers | 1988年 / 34卷 / 01期

关键词：

Vocoders;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

This paper describes analysis and synthesis methods for a digital formant synthesizer. It is shown that synthetic speech generated using excitation pulses which resemble the true glottal volume-velocity excitation waveform is preferred over speech synthesized using a two pole glottal filter and impulse excitation. A series of algorithms for voice/unvoiced/mixed/silent interval clasification, pitch detection, and formant estimation and racking are described. We have also initiated an investigation into the feasibility of using the digital formant synthesizer to study the acoustic correlates of voice quality. A number of experiments involving male/female voice conversion, and the stimulation of various vocal characteristics, such as breathiness, roughness, and vocal fry, were undertaken. The results have helped to establish the importance of various acoustic features as descriptors of specific voice qualities.

引用

页码：5 / 20

共 42 条

[1] New adaptive formant vocoder
Yi, Hu
He, Dehuan
Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 1991, 2 (01): : 89 - 96
[2] VOCAINE THE VOCODER AND APPLICATIONS IN SPEECH SYNTHESIS
Agiomyrgiannakis, Yannis
2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4230 - 4234
[3] VOCBENCH: A NEURAL VOCODER BENCHMARK FOR SPEECH SYNTHESIS
AlBadawy, Ehab A.
Gibiansky, Andrew
He, Qing
Wu, Jilong
Chang, Ming-Ching
Lyu, Siwei
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 881 - 885
[4] Speech Enhancement With Integration of Neural Homomorphic Synthesis and Spectral Masking
Jiang, Wenbin
Yu, Kai
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 31 : 1758 - 1770
[5] SPEECH ANALYSIS AND SYNTHESIS BECOME PRACTICAL ON MU-C CHIP
SECREST, B
ARJMAND, M
NI, M
ELECTRONIC DESIGN, 1982, 30 (11) : 129 - &
[6] Lhasa-Tibetan Speech Synthesis Using End-to-End Model
Zhao, Yue
Hu, Panhua
Xu, Xiaona
Wu, Licheng
Li, Xiali
IEEE ACCESS, 2019, 7 (140305-140311) : 140305 - 140311
[7] Denoising-and-Dereverberation Hierarchical Neural Vocoder for Statistical Parametric Speech Synthesis
Ai, Yang
Ling, Zhen-Hua
Wu, Wei-Lu
Li, Ang
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 2036 - 2048
[8] A Neural Vocoder With Hierarchical Generation of Amplitude and Phase Spectra for Statistical Parametric Speech Synthesis
Ai, Yang
Ling, Zhen-Hua
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 (839-851) : 839 - 851
[9] Japanese Neural Incremental Text-to-Speech Synthesis Framework With an Accent Phrase Input
Yanagita, Tomoya
Sakti, Sakriani
Nakamura, Satoshi
IEEE ACCESS, 2023, 11 : 22355 - 22363
[10] NaturalSpeech: End-to-End Text-to-Speech Synthesis With Human-Level Quality
Tan, Xu
Chen, Jiawei
Liu, Haohe
Cong, Jian
Zhang, Chen
Liu, Yanqing
Wang, Xi
Leng, Yichong
Yi, Yuanhao
He, Lei
Zhao, Sheng
Qin, Tao
Soong, Frank
Liu, Tie-Yan
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (06) : 4234 - 4245

← 1 2 3 4 5 →