Voiced/unvoiced/silence classification of speech using 2-stage neural networks with delayed decision input

被引:0
|
作者
Ahn, R
Holmes, WH
机构
来源
ISSPA 96 - FOURTH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, PROCEEDINGS, VOLS 1 AND 2 | 1996年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a two stage feed-forward neural network classifier capable of determining voiced, unvoiced and silence in the first stage and refining unvoiced and silence decisions in the second stage. Delayed decision from the previous frame's classification along with preliminary decision by the first stage network, zero-crossing ratio and energy ratio enables the second stage to correct the mistakes made by the first stage in classifying unvoiced and silence frames. Comparisons with a single stage classifier demonstrates the necessity of two stage classification techniques. It also shows that the proposed classifier performs excellently.
引用
收藏
页码:389 / 390
页数:2
相关论文
共 50 条
  • [1] VOICED UNVOICED SILENCE CLASSIFICATION OF SPEECH SIGNALS BASED ON STATISTICAL APPROACHES
    ALHASHEMY, BAR
    TAHA, SMR
    APPLIED ACOUSTICS, 1988, 25 (03) : 169 - 179
  • [2] Speech enhancement using voiced/unvoiced classification
    Lachiri, Z
    Ellouze, N
    8TH WORLD MULTI-CONFERENCE ON SYSTEMICS, CYBERNETICS, AND INFORMATICS, VOL XVI, PROCEEDINGS, 2004, : 345 - 349
  • [3] On a Classification of Voiced/Unvoiced by using SNR for Speech Recognition
    Kim, Jongkuk
    Hahn, Hernsoo
    PROCEEDINGS OF THE 2013 INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER SCIENCE AND ELECTRONICS INFORMATION (ICACSEI 2013), 2013, 41 : 472 - 476
  • [4] Voiced-unvoiced-silence speech sound classification based on unsupervised learning
    Deng, Huiqun
    O'Shaughnessy, Douglas
    2007 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-5, 2007, : 176 - 179
  • [5] PATTERN-RECOGNITION APPROACH TO VOICED UNVOICED SILENCE CLASSIFICATION WITH APPLICATIONS TO SPEECH RECOGNITION
    ATAL, BS
    RABINER, LR
    IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1976, 24 (03): : 201 - 212
  • [6] Robust voiced/unvoiced speech classification using fuzzy rules
    Beritelli, F
    Casale, S
    1997 IEEE WORKSHOP ON SPEECH CODING FOR TELECOMMUNICATIONS, PROCEEDINGS: BACK TO BASICS: ATTACKING FUNDAMENTAL PROBLEMS IN SPEECH CODING, 1997, : 5 - 6
  • [7] Speech De-noising using Wavelet based Methods with Focus on Classification of Speech into Voiced, Unvoiced and Silence Regions
    Baishya, Anamika
    Kumar, Priyatam
    2018 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND INTEGRATED NETWORKS (SPIN), 2018, : 419 - 424
  • [8] A novel two-step SVM classifier for voiced/unvoiced/silence classification of speech
    Qi, FY
    Bao, CC
    Liu, Y
    2004 International Symposium on Chinese Spoken Language Processing, Proceedings, 2004, : 77 - 80
  • [9] DIRECTLY MODELING VOICED AND UNVOICED COMPONENTS IN SPEECH WAVEFORMS BY NEURAL NETWORKS
    Tokuda, Keiichi
    Zen, Heiga
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 5640 - 5644
  • [10] Voiced-Unvoiced Classification of Speech Using a Neural Network Trained with LPC Coefficients
    Struwe, Kevin
    2017 INTERNATIONAL CONFERENCE ON CONTROL, ARTIFICIAL INTELLIGENCE, ROBOTICS & OPTIMIZATION (ICCAIRO), 2017, : 56 - 59