Voiced/unvoiced classification and pitch period detection algorithm based on wavelet transform

被引:0
|
作者
College of Information Science and Engineering, Central South University, Changsha 410083, China [1 ]
机构
[1] College of Information Science and Engineering, Central South University
来源
Dianzi Yu Xinxi Xuebao | 2008年 / 2卷 / 353-356期
关键词
Pitch detection; Spatial correlation function; Teager Energy Operator (TEO); Wavelet transform;
D O I
10.3724/sp.j.1146.2006.01066
中图分类号
学科分类号
摘要
This paper proposes a robust pitch period detection method based on wavelet transformation. A Voiced Regions Detection (VRD) algorithm based on wavelet transform and Teager energy operator is proposed firstly. Then an algorithm based on spatial correlation function for estimating pitch frequency only in voiced regions is presented. Experiments show that this algorithm has a better robustness and more precision compared with the classical wavelet-based methods and auto-correlated function (ACF).
引用
收藏
页码:353 / 356
页数:3
相关论文
共 9 条
  • [1] Rabiner L.R., Cheng M.J., Rosenberg A.E., A comparative performance study of several pitch detection algorithms, IEEE Trans. on Acoustics, Speech, and Signal Processing, ASSP-24, 5, pp. 399-418, (1976)
  • [2] Mallat S., Zhong S., Characterization of signals from multiscale edges, IEEE Trans. on PAMI, 14, 7, pp. 710-732, (1992)
  • [3] Kadambe S., Boudreaux-Bartels G.F., Application of the wavelet transform for pitch detection of speech signal, IEEE Trans. on Information Theory, 38, 2, pp. 917-924, (1992)
  • [4] Wang C.-F., Dai B., Pitch period extraction of speech signals based on estimation of glottal closure intants, Journal of China University of Science and Technology, 28, 3, pp. 362-367, (1998)
  • [5] Obaidat M.S., Lee C., Sadoun B., Nelson D., Estimation of pitch period of speech signal using a new dyadic wavelet algorithm, Information Sciences, 119, 122, pp. 21-39, (1999)
  • [6] Kaiser J.F., On a simple algorithm to calculate the energy of a signal, Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP'90), pp. 381-384, (1990)
  • [7] Bahoura M., Rouat J., Wavelet speech enhancement based on the Teager energy operator, IEEE. Signal Processing Letters, 8, 1, pp. 10-12, (2001)
  • [8] Xu Y., Wavelet transform domain filters: A spatially selective noise filtration technique, IEEE Trans. on Image Processing, 3, 6, pp. 747-758, (1994)
  • [9] Wang Y., Zhao L., Pitch frequency estimation of Chinese multi-syllable words based on wavelet transform, Proceedings of the Third International Conference on Machine Learning and Cybernetics, pp. 26-29, (2004)