Robust and High-resolution Voiced/Unvoiced Classification in Noisy Speech Using A Signal Smoothness Criterion

被引：0

作者：

Murthy, A. Sreenivasa ^{[1
]}

Sekhar, S. Chandra ^{[1
]}

Sreenivas, T. V. ^{[1
]}

机构：

[1] Indian Inst Sci, Dept Elect Commun Engn, Bangalore 560012, Karnataka, India

来源：

INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4 | 2007年

关键词：

voiced; unvoiced; local polynomial model; regression; signal-to-noise ratio;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We propose a novel technique for robust voiced/unvoiced segment detection in noisy speech, based on local polynomial regression. The local polynomial model is well-suited for voiced segments in speech. The unvoiced segments are noise-like and do not exhibit any smooth structure. This property of smoothness is used for devising a new metric called the variance ratio metric, which, after thresholding, indicates the voiced/unvoiced boundaries with 75% accuracy for 0dB global signal-to-noise ratio (SNR). A novelty of our algorithm is that it processes the signal continuously, sample-by-sample rather than frame-by-frame. Simulation results on TIMIT speech database (downsampled to 8kHz) for various SNRs are presented to illustrate the performance of the new algorithm. Results indicate that the algorithm is robust even in high noise levels.

引用

页码：2260 / 2263

页数：4

共 3 条

[1] On a Classification of Voiced/Unvoiced by using SNR for Speech Recognition
Kim, Jongkuk
Hahn, Hernsoo
PROCEEDINGS OF THE 2013 INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER SCIENCE AND ELECTRONICS INFORMATION (ICACSEI 2013), 2013, 41 : 472 - 476
[2] Object Extraction From Very High-Resolution Images Using a Convolutional Neural Network Based on a Noisy Large-Scale Dataset
Li, Panle
He, Xiaohui
Cheng, Xijie
Gao, Xu
Li, Runchuan
Qia, Mengjia
Li, Daidong
Qiu, Fangbing
Li, Zhiqiang
IEEE ACCESS, 2019, 7 : 122784 - 122795
[3] Revisiting DCE-MRI Classification of Prostate Tissue Using Descriptive Signal Enhancement Features Derived From DCE-MRI Acquisition With High Spatiotemporal Resolution
Breit, Hanns C.
Block, Tobias K.
Winkel, David J.
Gehweiler, Julian E.
Glessgen, Carl G.
Seifert, Helge
Wetterauer, Christian
Boll, Daniel T.
Heye, Tobias J.
INVESTIGATIVE RADIOLOGY, 2021, 56 (09) : 553 - 562

← 1 →