Voiced/unvoiced/silence classification of speech using 2-stage neural networks with delayed decision input

被引:0
|
作者
Ahn, R
Holmes, WH
机构
来源
ISSPA 96 - FOURTH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, PROCEEDINGS, VOLS 1 AND 2 | 1996年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a two stage feed-forward neural network classifier capable of determining voiced, unvoiced and silence in the first stage and refining unvoiced and silence decisions in the second stage. Delayed decision from the previous frame's classification along with preliminary decision by the first stage network, zero-crossing ratio and energy ratio enables the second stage to correct the mistakes made by the first stage in classifying unvoiced and silence frames. Comparisons with a single stage classifier demonstrates the necessity of two stage classification techniques. It also shows that the proposed classifier performs excellently.
引用
收藏
页码:389 / 390
页数:2
相关论文
共 50 条
  • [11] VOICED-UNVOICED CLASSIFICATION OF SPEECH USING AUTOCORRELATION MATRIX
    Senturk, Zekeriya
    Yetgin, Omer Emre
    Salor, Ozgul
    2014 22ND SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2014, : 1802 - 1805
  • [12] Voiced-Unvoiced-Silence Classifications of Speech Using Hybrid Features and a Network Classifier
    Qi, Yingyong
    Hunt, Bobby R.
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1993, 1 (02): : 250 - 255
  • [13] Clustering based Voiced-Unvoiced-Silence Detection in Speech using Temporal and Spectral Parameters
    Mondal, Sujoy
    Das Barman, Abhirup
    2015 IEEE INTERNATIONAL CONFERENCE ON RESEARCH IN COMPUTATIONAL INTELLIGENCE AND COMMUNICATION NETWORKS (ICRCICN), 2015, : 390 - 394
  • [14] A pattern recognition approach to robust voiced/unvoiced speech classification using fuzzy logic
    Beritelli, F
    Casale, S
    Russo, M
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 1999, 13 (01) : 109 - 132
  • [15] ENHANCED POWER-NORMALIZED FEATURES FOR MANDARIN ROBUST SPEECH RECOGNITION BASED ON A VOICED-UNVOICED-SILENCE DECISION
    Tan, Ying-Wei
    Liu, Wen-Ju
    Yang, Zhen-Lei
    Chen, Ming-Ming
    2014 IEEE CHINA SUMMIT & INTERNATIONAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (CHINASIP), 2014, : 222 - 226
  • [16] A robust Voiced/Unvoiced phoneme classification from whispered speech using the 'color' of whispered phonemes and Deep Neural Network
    Meenakshi, G. Nisha
    Ghosh, Prasanta Kumar
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 503 - 507
  • [17] Proposed a new approach for voiced / Unvoiced decision of speech file using Lagrange technique
    Hassan, N.F. (nidaaalalousi@yahoo.com), 1600, Begell House Inc. (72): : 495 - 504
  • [18] VOICED UNVOICED MIXED EXCITATION CLASSIFICATION OF SPEECH USING THE AUTOCORRELATION OF THE OUTPUT OF AN ADPCM SYSTEM
    RAFILA, KS
    DAWOUD, DS
    IEEE INTERNATIONAL CONFERENCE ON SYSTEMS ENGINEERING ///, 1989, : 537 - 540
  • [19] Robust Voiced/Unvoiced Speech Classification using Empirical Mode Decomposition and Periodic Correlation Model
    Molla, Md. Khademul Islam
    Hirose, Keikichi
    Minematsu, Nobuaki
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 2530 - +
  • [20] Towards a robust/fast continuous speech recognition system using a voiced-unvoiced decision
    O'Shaughnessy, Douglas
    Tolba, Hesham
    ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 1999, 1 : 413 - 416