Voiced/unvoiced/silence classification of speech using 2-stage neural networks with delayed decision input

被引：0

作者：

Ahn, R

Holmes, WH

机构：

来源：

ISSPA 96 - FOURTH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, PROCEEDINGS, VOLS 1 AND 2 | 1996年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper proposes a two stage feed-forward neural network classifier capable of determining voiced, unvoiced and silence in the first stage and refining unvoiced and silence decisions in the second stage. Delayed decision from the previous frame's classification along with preliminary decision by the first stage network, zero-crossing ratio and energy ratio enables the second stage to correct the mistakes made by the first stage in classifying unvoiced and silence frames. Comparisons with a single stage classifier demonstrates the necessity of two stage classification techniques. It also shows that the proposed classifier performs excellently.

引用

页码：389 / 390

页数：2

共 50 条

[11] VOICED-UNVOICED CLASSIFICATION OF SPEECH USING AUTOCORRELATION MATRIX
Senturk, Zekeriya
Yetgin, Omer Emre
Salor, Ozgul
2014 22ND SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2014, : 1802 - 1805
[12] Voiced-Unvoiced-Silence Classifications of Speech Using Hybrid Features and a Network Classifier
Qi, Yingyong
Hunt, Bobby R.
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1993, 1 (02): : 250 - 255
[13] Clustering based Voiced-Unvoiced-Silence Detection in Speech using Temporal and Spectral Parameters
Mondal, Sujoy
Das Barman, Abhirup
2015 IEEE INTERNATIONAL CONFERENCE ON RESEARCH IN COMPUTATIONAL INTELLIGENCE AND COMMUNICATION NETWORKS (ICRCICN), 2015, : 390 - 394
[14] A pattern recognition approach to robust voiced/unvoiced speech classification using fuzzy logic
Beritelli, F
Casale, S
Russo, M
INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 1999, 13 (01) : 109 - 132
[15] ENHANCED POWER-NORMALIZED FEATURES FOR MANDARIN ROBUST SPEECH RECOGNITION BASED ON A VOICED-UNVOICED-SILENCE DECISION
Tan, Ying-Wei
Liu, Wen-Ju
Yang, Zhen-Lei
Chen, Ming-Ming
2014 IEEE CHINA SUMMIT & INTERNATIONAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (CHINASIP), 2014, : 222 - 226
[16] A robust Voiced/Unvoiced phoneme classification from whispered speech using the 'color' of whispered phonemes and Deep Neural Network
Meenakshi, G. Nisha
Ghosh, Prasanta Kumar
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 503 - 507
[17] Proposed a new approach for voiced / Unvoiced decision of speech file using Lagrange technique
Hassan, N.F. (nidaaalalousi@yahoo.com), 1600, Begell House Inc. (72): : 495 - 504
[18] VOICED UNVOICED MIXED EXCITATION CLASSIFICATION OF SPEECH USING THE AUTOCORRELATION OF THE OUTPUT OF AN ADPCM SYSTEM
RAFILA, KS
DAWOUD, DS
IEEE INTERNATIONAL CONFERENCE ON SYSTEMS ENGINEERING ///, 1989, : 537 - 540
[19] Robust Voiced/Unvoiced Speech Classification using Empirical Mode Decomposition and Periodic Correlation Model
Molla, Md. Khademul Islam
Hirose, Keikichi
Minematsu, Nobuaki
INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 2530 - +
[20] Towards a robust/fast continuous speech recognition system using a voiced-unvoiced decision
O'Shaughnessy, Douglas
Tolba, Hesham
ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 1999, 1 : 413 - 416

← 1 2 3 4 5 →