A multifeature voiced/unvoiced decision algorithm for noisy speech

被引:10
作者
Shahnaz, C. [1 ]
Zhu, W. -P. [1 ]
Ahmad, M. O. [1 ]
机构
[1] Concordia Univ, Ctr Commun & Signal Proc, Dept Elect & Comp Engn, Montreal, PQ H3G 1M8, Canada
来源
2006 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-11, PROCEEDINGS | 2006年
关键词
D O I
10.1109/ISCAS.2006.1693137
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a new algorithm for the voiced/unvoiced (V/UV) decision of noise-corrupted speech. A speech periodicity-harmonic function (SPHF) is proposed to manifest distinctive characteristics between voiced and unvoiced regions. A composite feature vector is developed by combining a periodicity measure obtained from the SPHF with some energy measures such as zero-crossing rate-weighted RMS energy, Kaiser-Teager frame energy and the normalized low-frequency energy ratio. Unlike the conventional hard threshold, a signal-dependent initial-threshold (SDIT) for each feature is determined based on its statistical properties. The SDIT is exploited to develop a logical expression that returns an objective score regarding V/UV region. Additional voicing criteria are introduced to remove the artifacts that may exist due to the overlapping between decision regions. Simulation results of the proposed multifeature classification scheme, using the Keele reference database, show superior efficacy at a low SNR relative to some of the existing V/UV decision algorithms.
引用
收藏
页码:2525 / +
页数:2
相关论文
共 9 条
  • [1] [Anonymous], 1983, PITCH DETERMINATION, DOI DOI 10.1007/978-3-642-81926-1
  • [2] Arifianto D, 2003, 2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS, P812
  • [3] MEYER G, 1995, EUROSPEECH 95, P827
  • [4] MOUSSET E, 1996, P INT C SPOK LANG PR, V2, P1273
  • [5] CEPSTRUM PITCH DETERMINATION
    NOLL, AM
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1967, 41 (02) : 293 - &
  • [6] COMPARATIVE PERFORMANCE STUDY OF SEVERAL PITCH DETECTION ALGORITHMS
    RABINER, LR
    CHENG, MJ
    ROSENBERG, AE
    MCGONEGAL, CA
    [J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1976, 24 (05): : 399 - 418
  • [7] AVERAGE MAGNITUDE DIFFERENCE FUNCTION PITCH EXTRACTOR
    ROSS, MJ
    SHAFFER, HL
    COHEN, A
    FREUDBERG, R
    MANLEY, HJ
    [J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1974, AS22 (05): : 353 - 362
  • [8] SHAHNAZ C, 2005, P ISCAS2005 JAP MAY, P3143
  • [9] Zhang WY, 2002, INT CONF ACOUST SPEE, P341