Formant estimation from speech signal using the magnitude spectrum modified with group delay spectrum

被引:3
|
作者
Chowdhury, Husne Ara [1 ]
Rahman, Mohammad Shahidur [1 ]
机构
[1] Shahjalal Univ Sci & Technol, Dept Comp Sci & Engn, Sylhet 3114, Bangladesh
关键词
Modified spectrum; Phase domain analysis; Spectrogram; GD spectrum; Glottal formant effect; SYSTEM;
D O I
10.1250/ast.42.93
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The magnitude spectrum is a popular mathematical tool for speech signal analysis. In this paper, we propose a new technique for improving the performance of the magnitude spectrum by utilizing the benefits of the group delay (GD) spectrum to estimate the characteristics of a vocal tract accurately. The traditional magnitude spectrum suffers from difficulties when estimating vocal tract characteristics, particularly for high-pitched speech owing to its low resolution and high spectral leakage. After phase domain analysis, it is observed that the GD spectrum has low spectral leakage and high resolution for its additive property. Thus, the magnitude spectrum modified with its GD spectrum, referred to as the modified spectrum, is found to significantly improve the estimation of formant frequency over traditional methods. The accuracy is tested on synthetic vowels for a wide range of fundamental frequencies up to the high-pitched female speaker range. The validity of the proposed method is also verified by inspecting the formant contour of an utterance from the Texas Instruments and Massachusetts Institute of Technology (TIMIT) database and standard F2-F1 plot of natural vowel speech spoken by male and female speakers. The result is compared with two state-of-the-art methods. Our proposed method performs better than both of these two methods.
引用
收藏
页码:93 / 102
页数:10
相关论文
共 50 条
  • [1] Multipath delay estimation using the magnitude spectrum
    Hickman, Granger
    Krolik, Jeffrey
    2005 IEEE/SP 13TH WORKSHOP ON STATISTICAL SIGNAL PROCESSING (SSP), VOLS 1 AND 2, 2005, : 615 - 620
  • [2] Improved estimation of evolutionary spectrum based on short time Fourier transforms and modified magnitude group delay by signal decomposition
    Lakshminarayana, H.K.
    Bhat, J.S.
    Mahesh, H.M.
    World Academy of Science, Engineering and Technology, 2009, 35 : 800 - 811
  • [3] Formant estimation of high-pitched noisy speech using homomorphic deconvolution of higher-order group delay spectrum
    Chowdhury, Husne Ara
    Rahman, Mohammad Shahidur
    ACOUSTICAL SCIENCE AND TECHNOLOGY, 2023, 44 (02) : 84 - 92
  • [4] Estimation of Glottal Closure Instants from Telephone Speech using a Group Delay-Based Approach that Considers Speech Signal as a Spectrum
    Rachel, G. Anushiya
    Vijayalakshmi, P.
    Nagarajan, T.
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1181 - 1185
  • [5] Time Delay Estimation for Speech Signal Based on FOC-Spectrum
    Liu, Hong
    Li, Xiaofei
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1730 - 1733
  • [6] Estimation of Evolutionary spectrum based on STFT and modified group delay
    Narasimhan, SV
    Pavanalatha, S
    IEEE TENCON 2003: CONFERENCE ON CONVERGENT TECHNOLOGIES FOR THE ASIA-PACIFIC REGION, VOLS 1-4, 2003, : 1199 - 1203
  • [7] Blind Channel Magnitude Response Estimation in Speech Using Spectrum Classification
    Gaubitch, Nikolay D.
    Brookes, Mike
    Naylor, Patrick A.
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (10): : 2162 - 2171
  • [8] SIGNIFICANCE OF GROUP DELAY FUNCTIONS IN SPECTRUM ESTIMATION
    YEGNANARAYANA, B
    MURTHY, HA
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1992, 40 (09) : 2281 - 2289
  • [9] MAGNITUDE SPECTRUM SPEECH HIDING
    Rabie, Tamer
    Guerchi, Driss
    ICSPC: 2007 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS, VOLS 1-3, PROCEEDINGS, 2007, : 1147 - 1150
  • [10] Simultaneous Speech Detection and Magnitude Squared Spectrum Estimation Approach for Speech Enhancement
    Han, Ruirui
    Ou, Shifeng
    Liu, Wei
    Chen, Chen
    Zhang, Shuo
    PROCEEDINGS OF 2018 14TH IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP), 2018, : 281 - 285