A PITCH BASED NOISE ESTIMATION TECHNIQUE FOR ROBUST SPEECH RECOGNITION WITH MISSING DATA

被引：0

作者：

Morales-Cordovilla, Juan A. ^{[1
]}

Ma, Ning ^{[2
]}

Sanchez, Victoria ^{[1
]}

Carmona, Jose L. ^{[1
]}

Peinado, Antonio M. ^{[1
]}

Barker, Jon ^{[2
]}

机构：

[1] Univ Granada, Dept Teoria Senal Telemat & Comunicac, E-18071 Granada, Spain

[2] Univ Sheffield, Dept Comp Sci, Sheffield, South Yorkshire, England

来源：

2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING | 2011年

基金：

英国工程与自然科学研究理事会;

关键词：

Robust speech recognition; missing data; noise estimation; VAD; harmonic tunnelling;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

This paper presents a noise estimation technique based on knowledge of pitch information for robust speech recognition. In the first stage the noise is estimated by means of extrapolating the noise from frames where speech is believed to be absent. These frames are detected with a proposed pitch based VAD (Voice Activity Detector). In the second stage the noise estimation is revised in voiced frames using harmonic tunnelling thechnique. The tunnelling noise estimation is used at high SNRs as an upper bound of the noise rather than a suitable estimation. A spectrogram MD (Missing Data) recognition system is chosen to evaluate the proposed noise estimation. The proposed system is compared in Aurora-2 with other similar techniques like cepstral SS (Spectral Subtraction).

引用

页码：4808 / 4811

页数：4

共 50 条

[41] Maximum Confidence Measure Based Interaural Phase Difference Estimation for Noise Masking in Dual-Microphone Robust Speech Recognition
Liao, Hsien-Cheng
Liao, Yuan-Fu
Lee, Chin-Hui
12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 480 - +
[42] MODIFIED SPLICE AND ITS EXTENSION TO NON-STEREO DATA FOR NOISE ROBUST SPEECH RECOGNITION
Kumar, D. S. Pavan
Prasad, N. Vishnu
Joshi, Vikas
Umesh, S.
2013 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2013, : 174 - 179
[43] Robust Noise Estimation Based on Noise Injection
Tang, Chongwu
Yang, Xiaokang
Zhai, Guangtao
JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2014, 74 (01): : 69 - 78
[44] An effective subband OSF-based VAD with noise reduction for robust speech recognition
Ramírez, J
Segura, JC
Benítez, C
de la Torre, A
Rubio, A
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2005, 13 (06): : 1119 - 1129
[45] Speech-enhanced and Noise-aware Networks for Robust Speech Recognition
Lee, Hung-Shin
Chen, Pin-Yuan
Cheng, Yao-Fei
Tsao, Yu
Wang, Hsin-Min
2022 13TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2022, : 145 - 149
[46] Robust Noise Estimation Based on Noise Injection
Chongwu Tang
Xiaokang Yang
Guangtao Zhai
Journal of Signal Processing Systems, 2014, 74 : 69 - 78
[47] Transfer learning for acoustic modeling of noise robust speech recognition
Yi J.
Tao J.
Liu B.
Wen Z.
Qinghua Daxue Xuebao/Journal of Tsinghua University, 2018, 58 (01): : 55 - 60
[48] Feature domain compensation of nonstationary noise for robust speech recognition
Kim, NS
SPEECH COMMUNICATION, 2002, 37 (3-4) : 231 - 248
[49] Accurate estimation of missing data under noise distribution
Koh, Sung-Shik
Zin, Thi Thi
Hama, Hiromitsu
IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2006, 52 (02) : 528 - 535
[50] Probabilistic Class Histogram Equalization Based on Posterior Mean Estimation for Robust Speech Recognition
Suh, Youngjoo
Kim, Hoirin
IEEE SIGNAL PROCESSING LETTERS, 2015, 22 (12) : 2421 - 2424

← 1 2 3 4 5 →