Voiced/unvoiced/silence classification of speech using 2-stage neural networks with delayed decision input

被引：0

作者：

Ahn, R

Holmes, WH

机构：

来源：

ISSPA 96 - FOURTH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, PROCEEDINGS, VOLS 1 AND 2 | 1996年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper proposes a two stage feed-forward neural network classifier capable of determining voiced, unvoiced and silence in the first stage and refining unvoiced and silence decisions in the second stage. Delayed decision from the previous frame's classification along with preliminary decision by the first stage network, zero-crossing ratio and energy ratio enables the second stage to correct the mistakes made by the first stage in classifying unvoiced and silence frames. Comparisons with a single stage classifier demonstrates the necessity of two stage classification techniques. It also shows that the proposed classifier performs excellently.

引用

页码：389 / 390

页数：2

共 50 条

[21] Towards a robust/fast continuous speech recognition system using a voiced-unvoiced decision
O'Shaughnessy, D
Tolba, H
ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 413 - 416
[22] Improved Silence-Unvoiced-Voiced (SUV) Segmentation for Dysarthric Speech Signals using Linear Prediction Error Variance
Ijitona, Tolulope
Yue, Hong
Soraghan, John
Lowit, Anja
2020 5TH INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION SYSTEMS (ICCCS 2020), 2020, : 685 - 690
[23] Generalized likelihood ratio test for voiced-unvoiced decision in noisy speech using the harmonic model
Fisher, E
Tabrikian, J
Dubnov, S
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (02): : 502 - 510
[24] Robust and High-resolution Voiced/Unvoiced Classification in Noisy Speech Using A Signal Smoothness Criterion
Murthy, A. Sreenivasa
Sekhar, S. Chandra
Sreenivas, T. V.
INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2260 - 2263
[25] Global Soft Decision Based Speech Enhancement Using Voiced-Unvoiced Uncertainty and Harmonic Phase Decomposition Technique
Samui, Suman
Chakrabarti, Indrajit
Ghosh, Soumya Kanti
2016 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS (SPCOM), 2016,
[26] Vocal tract resonances tracking based on voiced and unvoiced speech classification using dynamic programming and fixed interval Kalman smoother
Oezbek, I. Yuecel
Demirekler, Muebeccel
2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4217 - 4220
[27] Classification of emotional speech by using neural networks
Sato, H
Mitsukura, Y
Fukumi, M
Akamatsu, N
KNOWLEDGE-BASED INTELLIGENT INFORMATION ENGINEERING SYSTEMS & ALLIED TECHNOLOGIES, PTS 1 AND 2, 2001, 69 : 1009 - 1014
[28] A 2-STAGE CLASSIFICATION SCHEME WITH BACKPROPAGATION NEURAL NETWORK CLASSIFIERS
CHO, SB
KIM, JH
PATTERN RECOGNITION LETTERS, 1992, 13 (05) : 309 - 313
[29] SPIKES FILTERING WITH NEURAL NETWORKS - A 2-STAGE DETECTION SYSTEM
MOUSSET, E
REVUE DE L INSTITUT FRANCAIS DU PETROLE, 1992, 47 (03): : 407 - 421
[30] Probabilistic decision-based neural networks for speech pattern classification
Yiu, KK
Mak, MW
Li, CK
ICSP '98: 1998 FOURTH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, PROCEEDINGS, VOLS I AND II, 1998, : 1378 - 1381

← 1 2 3 4 5 →