Voiced/unvoiced/silence classification of speech using 2-stage neural networks with delayed decision input

被引：0

作者：

Ahn, R

Holmes, WH

机构：

来源：

ISSPA 96 - FOURTH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, PROCEEDINGS, VOLS 1 AND 2 | 1996年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper proposes a two stage feed-forward neural network classifier capable of determining voiced, unvoiced and silence in the first stage and refining unvoiced and silence decisions in the second stage. Delayed decision from the previous frame's classification along with preliminary decision by the first stage network, zero-crossing ratio and energy ratio enables the second stage to correct the mistakes made by the first stage in classifying unvoiced and silence frames. Comparisons with a single stage classifier demonstrates the necessity of two stage classification techniques. It also shows that the proposed classifier performs excellently.

引用

页码：389 / 390

页数：2

共 50 条

[1] VOICED UNVOICED SILENCE CLASSIFICATION OF SPEECH SIGNALS BASED ON STATISTICAL APPROACHES
ALHASHEMY, BAR
TAHA, SMR
APPLIED ACOUSTICS, 1988, 25 (03) : 169 - 179
[2] Speech enhancement using voiced/unvoiced classification
Lachiri, Z
Ellouze, N
8TH WORLD MULTI-CONFERENCE ON SYSTEMICS, CYBERNETICS, AND INFORMATICS, VOL XVI, PROCEEDINGS, 2004, : 345 - 349
[3] On a Classification of Voiced/Unvoiced by using SNR for Speech Recognition
Kim, Jongkuk
Hahn, Hernsoo
PROCEEDINGS OF THE 2013 INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER SCIENCE AND ELECTRONICS INFORMATION (ICACSEI 2013), 2013, 41 : 472 - 476
[4] Voiced-unvoiced-silence speech sound classification based on unsupervised learning
Deng, Huiqun
O'Shaughnessy, Douglas
2007 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-5, 2007, : 176 - 179
[5] PATTERN-RECOGNITION APPROACH TO VOICED UNVOICED SILENCE CLASSIFICATION WITH APPLICATIONS TO SPEECH RECOGNITION
ATAL, BS
RABINER, LR
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1976, 24 (03): : 201 - 212
[6] Robust voiced/unvoiced speech classification using fuzzy rules
Beritelli, F
Casale, S
1997 IEEE WORKSHOP ON SPEECH CODING FOR TELECOMMUNICATIONS, PROCEEDINGS: BACK TO BASICS: ATTACKING FUNDAMENTAL PROBLEMS IN SPEECH CODING, 1997, : 5 - 6
[7] Speech De-noising using Wavelet based Methods with Focus on Classification of Speech into Voiced, Unvoiced and Silence Regions
Baishya, Anamika
Kumar, Priyatam
2018 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND INTEGRATED NETWORKS (SPIN), 2018, : 419 - 424
[8] A novel two-step SVM classifier for voiced/unvoiced/silence classification of speech
Qi, FY
Bao, CC
Liu, Y
2004 International Symposium on Chinese Spoken Language Processing, Proceedings, 2004, : 77 - 80
[9] DIRECTLY MODELING VOICED AND UNVOICED COMPONENTS IN SPEECH WAVEFORMS BY NEURAL NETWORKS
Tokuda, Keiichi
Zen, Heiga
2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 5640 - 5644
[10] Voiced-Unvoiced Classification of Speech Using a Neural Network Trained with LPC Coefficients
Struwe, Kevin
2017 INTERNATIONAL CONFERENCE ON CONTROL, ARTIFICIAL INTELLIGENCE, ROBOTICS & OPTIMIZATION (ICCAIRO), 2017, : 56 - 59

← 1 2 3 4 5 →