Power Spectrum Difference Teager Energy Features for Speech Recognition in Noisy Environment

被引:0
|
作者
Nehe, N. S. [1 ]
Holambe, R. S. [1 ]
机构
[1] SGGS Inst Engn & Technol, Dept Instrumentat Engn, Nanded, MS, India
来源
IEEE REGION 10 COLLOQUIUM AND THIRD INTERNATIONAL CONFERENCE ON INDUSTRIAL AND INFORMATION SYSTEMS, VOLS 1 AND 2 | 2008年
关键词
Isolated word recognition; Teager Energy Operator; Power Spectrum;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Feature extraction in noisy condition is one of the most important issues in the speech recognition system. There are two dominant approaches of acoustic measurement. First is in Temporal domain called parametric approach like Linear Prediction (LP) and second is in Frequency domain called nonparametric approach like Mel Frequency Cepstral Coefficients (MFCC) based on human auditory perception system. It is widely accepted that incorporating perceptual information in the feature extraction process leads to improve accuracy and robustness. MFCC is widely used due to low complexity, good performance for Automatic Speech Recognition (ASR) under clean environment. In this paper features derived from the Power Spectrum Difference (PSD) and Teager Energy Operator (TEO) abbreviated as PSDTE-MFCC have been proposed to improve the robustness of speech recognizer in presence of white noise. Noise filtering capability of TEO and noise reduction due to PSD improves the performance of proposed features in noisy environment. We demonstrate the effectiveness of the newly derived feature set for Isolated Word Recognition (IWR) in noisy environment. The results are compared using Hidden Markov Model (HMM) and found superior than MFCC.
引用
收藏
页码:223 / 227
页数:5
相关论文
共 50 条
  • [1] Mel Frequency Teager Energy Features for Isolate Word Recognition in Noisy Environment
    Nehe, N. S.
    Holambe, R. S.
    2009 SECOND INTERNATIONAL CONFERENCE ON EMERGING TRENDS IN ENGINEERING AND TECHNOLOGY (ICETET 2009), 2009, : 300 - +
  • [2] Speech Emotion Recognition Using Non-Linear Teager Energy Based Features in Noisy Environments
    Georgogiannis, Alexandros
    Digalakis, Vassilis
    2012 PROCEEDINGS OF THE 20TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2012, : 2045 - 2049
  • [3] Robust speech recognition in noisy backgrounds based on teager energy operator and auditory process
    Zhao, JH
    Kuang, JM
    Dai, QH
    CONFERENCE RECORD OF THE THIRTY-SEVENTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, VOLS 1 AND 2, 2003, : 550 - 554
  • [4] Emotion Recognition of Stressed Speech using Teager Energy and Linear Prediction Features
    Reddy, Surekha B.
    Kumar, T. Kishore
    2018 IEEE 18TH INTERNATIONAL CONFERENCE ON ADVANCED LEARNING TECHNOLOGIES (ICALT 2018), 2018, : 422 - 425
  • [5] AUTOMATIC EMOTION RECOGNITION IN SPEECH SIGNAL USING TEAGER ENERGY OPERATOR AND MFCC FEATURES
    He, Ling
    Lech, Margaret
    Allen, Nicholas
    2011 3RD INTERNATIONAL CONFERENCE ON COMPUTER TECHNOLOGY AND DEVELOPMENT (ICCTD 2011), VOL 3, 2012, : 695 - 699
  • [6] Speech emotion recognition in noisy environment
    Chenchah, Farah
    Lachiri, Zied
    2016 2ND INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR SIGNAL AND IMAGE PROCESSING (ATSIP), 2016, : 788 - 792
  • [7] Study of speech recognition in noisy environment
    Kreisinger, T
    Pollak, P
    Sovka, P
    Uhlir, J
    SIGNAL ANALYSIS & PREDICTION I, 1997, : 334 - 337
  • [8] SPEECH RECOGNITION IN THE NOISY CAR ENVIRONMENT
    RUEHL, HW
    DOBLER, S
    WEITH, J
    MEYER, P
    NOLL, A
    HAMER, HH
    PIOTROWSKI, H
    SPEECH COMMUNICATION, 1991, 10 (01) : 11 - 22
  • [9] Voice Activity Detection Algorithm Based on the Power Spectral Deviation of Teager Energy in Noisy Environment
    Park, Yun Sik
    An, Hong Sub
    Lee, Sangmin
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2011, 30 (07): : 396 - 401
  • [10] Teager Energy Subband Filtered Features for Near and Far-Field Automatic Speech Recognition
    Kamble, Madhu R.
    Nayak, Shekhar
    Shaik, M. Ali Basha
    Rath, Shakti P.
    Vij, Vikram
    Patil, Hemant A.
    2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 491 - 496