Power Spectrum Difference Teager Energy Features for Speech Recognition in Noisy Environment

被引：0

作者：

Nehe, N. S. ^{[1
]}

Holambe, R. S. ^{[1
]}

机构：

[1] SGGS Inst Engn & Technol, Dept Instrumentat Engn, Nanded, MS, India

来源：

IEEE REGION 10 COLLOQUIUM AND THIRD INTERNATIONAL CONFERENCE ON INDUSTRIAL AND INFORMATION SYSTEMS, VOLS 1 AND 2 | 2008年

关键词：

Isolated word recognition; Teager Energy Operator; Power Spectrum;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Feature extraction in noisy condition is one of the most important issues in the speech recognition system. There are two dominant approaches of acoustic measurement. First is in Temporal domain called parametric approach like Linear Prediction (LP) and second is in Frequency domain called nonparametric approach like Mel Frequency Cepstral Coefficients (MFCC) based on human auditory perception system. It is widely accepted that incorporating perceptual information in the feature extraction process leads to improve accuracy and robustness. MFCC is widely used due to low complexity, good performance for Automatic Speech Recognition (ASR) under clean environment. In this paper features derived from the Power Spectrum Difference (PSD) and Teager Energy Operator (TEO) abbreviated as PSDTE-MFCC have been proposed to improve the robustness of speech recognizer in presence of white noise. Noise filtering capability of TEO and noise reduction due to PSD improves the performance of proposed features in noisy environment. We demonstrate the effectiveness of the newly derived feature set for Isolated Word Recognition (IWR) in noisy environment. The results are compared using Hidden Markov Model (HMM) and found superior than MFCC.

引用

页码：223 / 227

页数：5

共 50 条

[1] Mel Frequency Teager Energy Features for Isolate Word Recognition in Noisy Environment
Nehe, N. S.
Holambe, R. S.
2009 SECOND INTERNATIONAL CONFERENCE ON EMERGING TRENDS IN ENGINEERING AND TECHNOLOGY (ICETET 2009), 2009, : 300 - +
[2] Speech Emotion Recognition Using Non-Linear Teager Energy Based Features in Noisy Environments
Georgogiannis, Alexandros
Digalakis, Vassilis
2012 PROCEEDINGS OF THE 20TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2012, : 2045 - 2049
[3] Robust speech recognition in noisy backgrounds based on teager energy operator and auditory process
Zhao, JH
Kuang, JM
Dai, QH
CONFERENCE RECORD OF THE THIRTY-SEVENTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, VOLS 1 AND 2, 2003, : 550 - 554
[4] Emotion Recognition of Stressed Speech using Teager Energy and Linear Prediction Features
Reddy, Surekha B.
Kumar, T. Kishore
2018 IEEE 18TH INTERNATIONAL CONFERENCE ON ADVANCED LEARNING TECHNOLOGIES (ICALT 2018), 2018, : 422 - 425
[5] AUTOMATIC EMOTION RECOGNITION IN SPEECH SIGNAL USING TEAGER ENERGY OPERATOR AND MFCC FEATURES
He, Ling
Lech, Margaret
Allen, Nicholas
2011 3RD INTERNATIONAL CONFERENCE ON COMPUTER TECHNOLOGY AND DEVELOPMENT (ICCTD 2011), VOL 3, 2012, : 695 - 699
[6] Speech emotion recognition in noisy environment
Chenchah, Farah
Lachiri, Zied
2016 2ND INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR SIGNAL AND IMAGE PROCESSING (ATSIP), 2016, : 788 - 792
[7] Study of speech recognition in noisy environment
Kreisinger, T
Pollak, P
Sovka, P
Uhlir, J
SIGNAL ANALYSIS & PREDICTION I, 1997, : 334 - 337
[8] SPEECH RECOGNITION IN THE NOISY CAR ENVIRONMENT
RUEHL, HW
DOBLER, S
WEITH, J
MEYER, P
NOLL, A
HAMER, HH
PIOTROWSKI, H
SPEECH COMMUNICATION, 1991, 10 (01) : 11 - 22
[9] Voice Activity Detection Algorithm Based on the Power Spectral Deviation of Teager Energy in Noisy Environment
Park, Yun Sik
An, Hong Sub
Lee, Sangmin
JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2011, 30 (07): : 396 - 401
[10] Teager Energy Subband Filtered Features for Near and Far-Field Automatic Speech Recognition
Kamble, Madhu R.
Nayak, Shekhar
Shaik, M. Ali Basha
Rath, Shakti P.
Vij, Vikram
Patil, Hemant A.
2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 491 - 496

← 1 2 3 4 5 →