Noise robust estimate of speech dynamics for speaker recognition

被引：0

作者：

Openshaw, JP

Mason, JS

机构：

来源：

ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4 | 1996年

关键词：

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

This paper investigates the robustness of cepstral based features with respect to additive noise, and details two methods of increasing the robustness with minimal need for o-priori knowledge of the noise statistics. The first approach is a form of noise masking which adds a fixed offset to the linear spectral estimate. The second is a form of sub-band filtering, again in the linear domain, which estimates the dynamic content of the speech using Fourier transforms. This avoids negative values normally inherent in such filtering and which presents difficulties in deriving log estimates. Both methods are shown to provide useful levels of robustness to additive noise, for example, speaker identification error rates in SNR mis-matched conditions of 15 dB are reduced from 60.5% for standard mel cepstra to 13.8% and 24.1% for the two approaches respectively.

引用

页码：925 / 928

页数：4

共 50 条

[11] Noise robust speaker identification for spontaneous Arabic speech
Graciarena, Martin
Kajarekar, Sachin
Stolcke, Andreas
Shriberg, Elizabeth
2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 245 - +
[12] Channel and speaker adaptation techniques for robust speech recognition
Chen, Jingdong
Yao, Lei
Huang, Taiyi
Shengxue Xuebao/Acta Acustica, 1998, 23 (06): : 537 - 544
[13] Robust Digital Speech Watermarking For Online Speaker Recognition
Nematollahi, Mohammad Ali
Gamboa-Rosales, Hamurabi
Akhaee, Mohammad Ali
Al-Haddad, S. A. R.
MATHEMATICAL PROBLEMS IN ENGINEERING, 2015, 2015
[14] Channel Robust MFCCs for Continuous Speech Speaker Recognition
Chougule, Sharada Vikram
Chavan, Mahesh S.
ADVANCES IN SIGNAL PROCESSING AND INTELLIGENT RECOGNITION SYSTEMS, 2014, 264 : 557 - 568
[15] Robust speech recognition with speaker localization by a microphone array
Yamada, T
Nakamura, S
Shikano, K
ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1317 - 1320
[16] An Integrated Approach to Robust Speaker Identification and Speech Recognition
Kwan, C.
Yin, J.
Ayhan, B.
Chu, S.
Liu, X.
Puckett, K.
Zhao, Y.
Ho, K. C.
Kruger, M.
Sityar, I.
2008 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-8, 2008, : 1635 - +
[17] Adaptive wavelet shrinkage for noise robust speaker recognition
Govindan, Sumithra Manimegalai
Duraisamy, Prakash
Yuan, Xiaohui
DIGITAL SIGNAL PROCESSING, 2014, 33 : 180 - 190
[18] Noise Robust Speaker Recognition with Convolutive Sparse Coding
Hurmalainen, Antti
Saeidi, Rahim
Virtanen, Tuomas
16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 244 - 248
[19] SHORT-TIMED SPEECH DYNAMICS FOR SPEAKER RECOGNITION
LI, H
HATON, JP
SU, J
ELECTRONICS LETTERS, 1995, 31 (17) : 1416 - 1418
[20] Unsupervised speaker adaptation for robust speech recognition in real environments
Yamade, S
Baba, A
Yoshikawa, S
Lee, A
Saruwatari, H
Shikano, K
ELECTRONICS AND COMMUNICATIONS IN JAPAN PART II-ELECTRONICS, 2005, 88 (08): : 30 - 41

← 1 2 3 4 5 →