Formant estimation from speech signal using the magnitude spectrum modified with group delay spectrum

被引：3

作者：

Chowdhury, Husne Ara ^{[1
]}

Rahman, Mohammad Shahidur ^{[1
]}

机构：

[1] Shahjalal Univ Sci & Technol, Dept Comp Sci & Engn, Sylhet 3114, Bangladesh

来源：

ACOUSTICAL SCIENCE AND TECHNOLOGY | 2021年 / 42卷 / 02期

关键词：

Modified spectrum; Phase domain analysis; Spectrogram; GD spectrum; Glottal formant effect; SYSTEM;

D O I：

10.1250/ast.42.93

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

The magnitude spectrum is a popular mathematical tool for speech signal analysis. In this paper, we propose a new technique for improving the performance of the magnitude spectrum by utilizing the benefits of the group delay (GD) spectrum to estimate the characteristics of a vocal tract accurately. The traditional magnitude spectrum suffers from difficulties when estimating vocal tract characteristics, particularly for high-pitched speech owing to its low resolution and high spectral leakage. After phase domain analysis, it is observed that the GD spectrum has low spectral leakage and high resolution for its additive property. Thus, the magnitude spectrum modified with its GD spectrum, referred to as the modified spectrum, is found to significantly improve the estimation of formant frequency over traditional methods. The accuracy is tested on synthetic vowels for a wide range of fundamental frequencies up to the high-pitched female speaker range. The validity of the proposed method is also verified by inspecting the formant contour of an utterance from the Texas Instruments and Massachusetts Institute of Technology (TIMIT) database and standard F2-F1 plot of natural vowel speech spoken by male and female speakers. The result is compared with two state-of-the-art methods. Our proposed method performs better than both of these two methods.

引用

页码：93 / 102

页数：10

共 50 条

[1] Multipath delay estimation using the magnitude spectrum
Hickman, Granger
Krolik, Jeffrey
2005 IEEE/SP 13TH WORKSHOP ON STATISTICAL SIGNAL PROCESSING (SSP), VOLS 1 AND 2, 2005, : 615 - 620
[2] Improved estimation of evolutionary spectrum based on short time Fourier transforms and modified magnitude group delay by signal decomposition
Lakshminarayana, H.K.
Bhat, J.S.
Mahesh, H.M.
World Academy of Science, Engineering and Technology, 2009, 35 : 800 - 811
[3] Formant estimation of high-pitched noisy speech using homomorphic deconvolution of higher-order group delay spectrum
Chowdhury, Husne Ara
Rahman, Mohammad Shahidur
ACOUSTICAL SCIENCE AND TECHNOLOGY, 2023, 44 (02) : 84 - 92
[4] Estimation of Glottal Closure Instants from Telephone Speech using a Group Delay-Based Approach that Considers Speech Signal as a Spectrum
Rachel, G. Anushiya
Vijayalakshmi, P.
Nagarajan, T.
16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1181 - 1185
[5] Time Delay Estimation for Speech Signal Based on FOC-Spectrum
Liu, Hong
Li, Xiaofei
13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1730 - 1733
[6] Estimation of Evolutionary spectrum based on STFT and modified group delay
Narasimhan, SV
Pavanalatha, S
IEEE TENCON 2003: CONFERENCE ON CONVERGENT TECHNOLOGIES FOR THE ASIA-PACIFIC REGION, VOLS 1-4, 2003, : 1199 - 1203
[7] Blind Channel Magnitude Response Estimation in Speech Using Spectrum Classification
Gaubitch, Nikolay D.
Brookes, Mike
Naylor, Patrick A.
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (10): : 2162 - 2171
[8] SIGNIFICANCE OF GROUP DELAY FUNCTIONS IN SPECTRUM ESTIMATION
YEGNANARAYANA, B
MURTHY, HA
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1992, 40 (09) : 2281 - 2289
[9] MAGNITUDE SPECTRUM SPEECH HIDING
Rabie, Tamer
Guerchi, Driss
ICSPC: 2007 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS, VOLS 1-3, PROCEEDINGS, 2007, : 1147 - 1150
[10] Simultaneous Speech Detection and Magnitude Squared Spectrum Estimation Approach for Speech Enhancement
Han, Ruirui
Ou, Shifeng
Liu, Wei
Chen, Chen
Zhang, Shuo
PROCEEDINGS OF 2018 14TH IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP), 2018, : 281 - 285

← 1 2 3 4 5 →