Voiced/unvoiced/silence classification of speech using 2-stage neural networks with delayed decision input

被引:0
|
作者
Ahn, R
Holmes, WH
机构
来源
ISSPA 96 - FOURTH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, PROCEEDINGS, VOLS 1 AND 2 | 1996年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a two stage feed-forward neural network classifier capable of determining voiced, unvoiced and silence in the first stage and refining unvoiced and silence decisions in the second stage. Delayed decision from the previous frame's classification along with preliminary decision by the first stage network, zero-crossing ratio and energy ratio enables the second stage to correct the mistakes made by the first stage in classifying unvoiced and silence frames. Comparisons with a single stage classifier demonstrates the necessity of two stage classification techniques. It also shows that the proposed classifier performs excellently.
引用
收藏
页码:389 / 390
页数:2
相关论文
共 50 条
  • [31] Efficient classification of noisy speech using neural networks
    Shao, C
    Bouchard, M
    SEVENTH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, VOL 1, PROCEEDINGS, 2003, : 357 - 360
  • [32] EEG Classification of Covert Speech Using Regularized Neural Networks
    Sereshkeh, Alborz Rezazadeh
    Trott, Robert
    Bricout, Aurelien
    Chau, Tom
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (12) : 2292 - 2300
  • [33] Emotional speech classification with prosodic prameters by using neural networks
    Sato, H
    Mitsukura, Y
    Fukumi, M
    Akamatsu, N
    ANZIIS 2001: PROCEEDINGS OF THE SEVENTH AUSTRALIAN AND NEW ZEALAND INTELLIGENT INFORMATION SYSTEMS CONFERENCE, 2001, : 395 - 398
  • [34] Air temperature forecasting using artificial neural networks with delayed exogenous input
    Jallal, Mohammed Ali
    Chabaa, Samira
    El Yassini, Abdessalam
    Zeroual, Abdelouhab
    Ibnyaich, Saida
    2019 INTERNATIONAL CONFERENCE ON WIRELESS TECHNOLOGIES, EMBEDDED AND INTELLIGENT SYSTEMS (WITS), 2019,
  • [35] Automatic Classification with Neural Networks Using New Decision Rule
    Karimov, A.
    Moharrami, S.
    INTERNATIONAL JOURNAL OF APPLIED MATHEMATICS & STATISTICS, 2010, 19 (D10): : 90 - 96
  • [36] Diabetic Retinopathy Stage Classification using Convolutional Neural Networks
    Wang, Xiaoliang
    Lu, Yongjin
    Wang, Yujuan
    Chen, Wei-Bang
    2018 IEEE INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION (IRI), 2018, : 465 - 471
  • [37] Binaural Classification for Reverberant Speech Segregation Using Deep Neural Networks
    Jiang, Yi
    Wang, DeLiang
    Liu, RunSheng
    Feng, ZhenMing
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (12) : 2112 - 2121
  • [38] A systematic approach for segmenting voiced/unvoiced signals using fuzzy-logic system and general fusion of neural network models for phonemes-based speech recognition
    Nataraj, Sathees Kumar
    Paulraj, M. P.
    Bin Abdullah, Ahmad Nazri
    Bin Yaacob, Sazali
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2020, 39 (05) : 7411 - 7429
  • [39] A NON-HOMOTHETIC 2-STAGE DECISION-MODEL USING AIDS
    SEGERSON, K
    MOUNT, TD
    REVIEW OF ECONOMICS AND STATISTICS, 1985, 67 (04) : 630 - 639
  • [40] A 2-stage Approach for Inferring Gene Regulatory Networks using Dynamic Bayesian Networks
    Shermin, Akther
    Orgun, Mehmet A.
    2009 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2009, : 166 - 169