Formant speech synthesis

被引:0
|
作者
Pinto, N.B.
Childers, D.G.
机构
来源
Journal of the Institution of Electronics and Telecommunication Engineers | 1988年 / 34卷 / 01期
关键词
Vocoders;
D O I
暂无
中图分类号
学科分类号
摘要
This paper describes analysis and synthesis methods for a digital formant synthesizer. It is shown that synthetic speech generated using excitation pulses which resemble the true glottal volume-velocity excitation waveform is preferred over speech synthesized using a two pole glottal filter and impulse excitation. A series of algorithms for voice/unvoiced/mixed/silent interval clasification, pitch detection, and formant estimation and racking are described. We have also initiated an investigation into the feasibility of using the digital formant synthesizer to study the acoustic correlates of voice quality. A number of experiments involving male/female voice conversion, and the stimulation of various vocal characteristics, such as breathiness, roughness, and vocal fry, were undertaken. The results have helped to establish the importance of various acoustic features as descriptors of specific voice qualities.
引用
收藏
页码:5 / 20
相关论文
共 42 条
  • [21] SelfRemaster: Self-Supervised Speech Restoration for Historical Audio Resources
    Saeki, Takaaki
    Takamichi, Shinnosuke
    Nakamura, Tomohiko
    Tanji, Naoko
    Saruwatari, Hiroshi
    IEEE ACCESS, 2023, 11 : 144831 - 144843
  • [22] A Joint Framework of Denoising Autoencoder and Generative Vocoder for Monaural Speech Enhancement
    Du, Zhihao
    Zhang, Xueliang
    Han, Jiqing
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 : 1493 - 1505
  • [23] Emotional Text-To-Speech in Japanese Using Artificially Augmented Dataset
    Khalifah, Mujahid Jamal A.
    Ptaszynski, Michal
    Masui, Fumito
    IEEE ACCESS, 2024, 12 : 167724 - 167777
  • [24] The perceptual quality of MELP speech over error tolerant IP networks
    Gavula, Ben
    Scheets, George
    Teague, Keith
    Weber, Justin
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 1633 - 1636
  • [25] MIXED FOURIER WALSH TRANSFORM SCHEME FOR SPEECH CODING AT 4.0 KBIT/S
    SPANIAS, AS
    LOIZOU, PC
    IEE PROCEEDINGS-I COMMUNICATIONS SPEECH AND VISION, 1992, 139 (05): : 473 - 481
  • [26] Neural Speech and Audio Coding: Modern AI technology meets traditional codecs
    Kim, Minje
    Skoglund, Jan
    IEEE SIGNAL PROCESSING MAGAZINE, 2024, 41 (06) : 85 - 93
  • [27] Improved Alias-and-Separate Speech Coding Framework With Minimal Algorithmic Delay
    Lee, Eunkyun
    Beack, Seungkwon
    Shin, Jong Won
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2024, 18 (08) : 1414 - 1426
  • [28] Subjective SNR measure for quality assessment of speech coders - a cross language study
    Nakatsui, Mamoru
    Noda, Hideki
    Journal of the Acoustical Society of Japan (E) (English translation of Nippon Onkyo Gakkaishi), 1994, 15 (06): : 377 - 381
  • [29] Hands-Free Devices for Displaying Speech and Language in the Tactile Modality - Methods and Approaches
    Kappers, Astrid M. L.
    Plaisier, Myrthe A.
    IEEE TRANSACTIONS ON HAPTICS, 2021, 14 (03) : 465 - 478
  • [30] Combination and Comparison of Sound Coding Strategies Using Cochlear Implant Simulation With Mandarin Speech
    Huang, Enoch Hsin-Ho
    Wu, Chao-Min
    Lin, Hung-Ching
    IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, 2021, 29 : 2407 - 2416