Robust and High-resolution Voiced/Unvoiced Classification in Noisy Speech Using A Signal Smoothness Criterion

被引:0
|
作者
Murthy, A. Sreenivasa [1 ]
Sekhar, S. Chandra [1 ]
Sreenivas, T. V. [1 ]
机构
[1] Indian Inst Sci, Dept Elect Commun Engn, Bangalore 560012, Karnataka, India
来源
INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4 | 2007年
关键词
voiced; unvoiced; local polynomial model; regression; signal-to-noise ratio;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a novel technique for robust voiced/unvoiced segment detection in noisy speech, based on local polynomial regression. The local polynomial model is well-suited for voiced segments in speech. The unvoiced segments are noise-like and do not exhibit any smooth structure. This property of smoothness is used for devising a new metric called the variance ratio metric, which, after thresholding, indicates the voiced/unvoiced boundaries with 75% accuracy for 0dB global signal-to-noise ratio (SNR). A novelty of our algorithm is that it processes the signal continuously, sample-by-sample rather than frame-by-frame. Simulation results on TIMIT speech database (downsampled to 8kHz) for various SNRs are presented to illustrate the performance of the new algorithm. Results indicate that the algorithm is robust even in high noise levels.
引用
收藏
页码:2260 / 2263
页数:4
相关论文
共 3 条
  • [1] On a Classification of Voiced/Unvoiced by using SNR for Speech Recognition
    Kim, Jongkuk
    Hahn, Hernsoo
    PROCEEDINGS OF THE 2013 INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER SCIENCE AND ELECTRONICS INFORMATION (ICACSEI 2013), 2013, 41 : 472 - 476
  • [2] Object Extraction From Very High-Resolution Images Using a Convolutional Neural Network Based on a Noisy Large-Scale Dataset
    Li, Panle
    He, Xiaohui
    Cheng, Xijie
    Gao, Xu
    Li, Runchuan
    Qia, Mengjia
    Li, Daidong
    Qiu, Fangbing
    Li, Zhiqiang
    IEEE ACCESS, 2019, 7 : 122784 - 122795
  • [3] Revisiting DCE-MRI Classification of Prostate Tissue Using Descriptive Signal Enhancement Features Derived From DCE-MRI Acquisition With High Spatiotemporal Resolution
    Breit, Hanns C.
    Block, Tobias K.
    Winkel, David J.
    Gehweiler, Julian E.
    Glessgen, Carl G.
    Seifert, Helge
    Wetterauer, Christian
    Boll, Daniel T.
    Heye, Tobias J.
    INVESTIGATIVE RADIOLOGY, 2021, 56 (09) : 553 - 562