Voiced/unvoiced/silence classification of speech using 2-stage neural networks with delayed decision input

被引:0
|
作者
Ahn, R
Holmes, WH
机构
来源
ISSPA 96 - FOURTH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, PROCEEDINGS, VOLS 1 AND 2 | 1996年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a two stage feed-forward neural network classifier capable of determining voiced, unvoiced and silence in the first stage and refining unvoiced and silence decisions in the second stage. Delayed decision from the previous frame's classification along with preliminary decision by the first stage network, zero-crossing ratio and energy ratio enables the second stage to correct the mistakes made by the first stage in classifying unvoiced and silence frames. Comparisons with a single stage classifier demonstrates the necessity of two stage classification techniques. It also shows that the proposed classifier performs excellently.
引用
收藏
页码:389 / 390
页数:2
相关论文
共 50 条
  • [21] Towards a robust/fast continuous speech recognition system using a voiced-unvoiced decision
    O'Shaughnessy, D
    Tolba, H
    ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 413 - 416
  • [22] Improved Silence-Unvoiced-Voiced (SUV) Segmentation for Dysarthric Speech Signals using Linear Prediction Error Variance
    Ijitona, Tolulope
    Yue, Hong
    Soraghan, John
    Lowit, Anja
    2020 5TH INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION SYSTEMS (ICCCS 2020), 2020, : 685 - 690
  • [23] Generalized likelihood ratio test for voiced-unvoiced decision in noisy speech using the harmonic model
    Fisher, E
    Tabrikian, J
    Dubnov, S
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (02): : 502 - 510
  • [24] Robust and High-resolution Voiced/Unvoiced Classification in Noisy Speech Using A Signal Smoothness Criterion
    Murthy, A. Sreenivasa
    Sekhar, S. Chandra
    Sreenivas, T. V.
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2260 - 2263
  • [25] Global Soft Decision Based Speech Enhancement Using Voiced-Unvoiced Uncertainty and Harmonic Phase Decomposition Technique
    Samui, Suman
    Chakrabarti, Indrajit
    Ghosh, Soumya Kanti
    2016 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS (SPCOM), 2016,
  • [26] Vocal tract resonances tracking based on voiced and unvoiced speech classification using dynamic programming and fixed interval Kalman smoother
    Oezbek, I. Yuecel
    Demirekler, Muebeccel
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4217 - 4220
  • [27] Classification of emotional speech by using neural networks
    Sato, H
    Mitsukura, Y
    Fukumi, M
    Akamatsu, N
    KNOWLEDGE-BASED INTELLIGENT INFORMATION ENGINEERING SYSTEMS & ALLIED TECHNOLOGIES, PTS 1 AND 2, 2001, 69 : 1009 - 1014
  • [28] A 2-STAGE CLASSIFICATION SCHEME WITH BACKPROPAGATION NEURAL NETWORK CLASSIFIERS
    CHO, SB
    KIM, JH
    PATTERN RECOGNITION LETTERS, 1992, 13 (05) : 309 - 313
  • [29] SPIKES FILTERING WITH NEURAL NETWORKS - A 2-STAGE DETECTION SYSTEM
    MOUSSET, E
    REVUE DE L INSTITUT FRANCAIS DU PETROLE, 1992, 47 (03): : 407 - 421
  • [30] Probabilistic decision-based neural networks for speech pattern classification
    Yiu, KK
    Mak, MW
    Li, CK
    ICSP '98: 1998 FOURTH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, PROCEEDINGS, VOLS I AND II, 1998, : 1378 - 1381