Spectral Multi-scale Product Analysis for Pitch Estimation from Noisy Speech Signal

被引:0
作者
Ben Messaoud, Mohamed Anouar [1 ]
Bouzid, Aicha [1 ]
Ellouze, Noureddine [1 ]
机构
[1] Natl Sch Engineers Tunis, Dept Elect Engn, Tunis 1002, Tunisia
来源
ADVANCES IN NONLINEAR SPEECH PROCESSING | 2010年 / 5933卷
关键词
Speech; wavelet transform; multi-scale product; spectral analysis; fundamental frequency; WAVELET TRANSFORM;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, we present an algorithm for estimating the fundamental frequency in speech signals. Our approach is based on the spectral multi-scale product analysis. It consists of operating a short Fourier transform on the speech multi-scale product. The multi-scale product is based on making the product of the speech wavelet transform coefficients at three successive dyadic scales. The wavelet used is the quadratic spline function with a support of 0.8 ms. We estimate the pitch for each time frame based on its multi-scale product harmonic structure. We evaluate our approach on the Keele database. Experimental results show the effectiveness of our method presenting a good performance surpassing other algorithms. Besides, the proposed approach is robust for noisy speech.
引用
收藏
页码:95 / 102
页数:8
相关论文
共 17 条