Overview of speech enhancement techniques for automatic speaker recognition

被引：0

作者：

OrtegaGarcia, J

GonzalezRodriguez, J

机构：

来源：

ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4 | 1996年

关键词：

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Real world conditions differ from ideal or laboratory conditions, causing mismatch between training and testing phases, and consequently, inducing performance degradation in automatic speaker recognition systems [1]. Many strategies have been adopted to cope with acoustical degradation; in some applications of speaker identification systems a clean sample of speech, prior to the recognition stage, is needed. This has justified the use of procedures that may reduce the impact of acoustical noise on the desired signal, giving rise to techniques involved in the enhancement of noisy speech [2, 9]. In this paper, a comparative performance analysis of single-channel (based in classical spectral subtraction and some derived alternatives), dual-channel (based in adaptive noise cancelling) and multi-channel (using microphone arrays) speech enhancement techniques, with different types of noise at different SNRs, as a pre-processing stage to an ergodic HMM-based speaker recognizer, is presented.

引用

页码：929 / 932

页数：4

共 50 条

[31] Analysis of Compressed Speech Signals in an Automatic Speaker Recognition System
Metzger, Richard A.
Doherty, John F.
Jenkins, David M.
2015 49TH ANNUAL CONFERENCE ON INFORMATION SCIENCES AND SYSTEMS (CISS), 2015,
[32] Evaluating Automatic Speaker Recognition systems: An overview of the NIST Speaker Recognition Evaluations (1996-2014)
Gonzalez-Rodriguez, Joaquin
LOQUENS, 2014, 1 (01):
[33] Spectral Analysis for Automatic Speech Recognition and Enhancement
Oruh, Jane
Viriri, Serestina
MACHINE LEARNING FOR NETWORKING, MLN 2020, 2021, 12629 : 245 - 254
[34] Impact of Emotional Speech to Automatic Speaker Recognition - Experiments on GEES Speech Database
Jokic, Ivan
Jokic, Stevan
Delic, Vlado
Peric, Zoran
SPEECH AND COMPUTER, 2014, 8773 : 268 - 275
[35] Analyzing the impact of speaker localization errors on speech separation for automatic speech recognition
Sivasankaran, Sunit
Vincent, Emmanuel
Fohr, Dominique
28TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2020), 2021, : 346 - 350
[36] TEnet: target speaker extraction network with accumulated speaker embedding for automatic speech recognition
Li, Wenjie
Zhang, Pengyuan
Yan, Yonghong
ELECTRONICS LETTERS, 2019, 55 (14) : 816 - 818
[37] SPEAKER-ADAPTABLE CLASSIFICATION PROCEDURE FOR AUTOMATIC SPEECH RECOGNITION
KATTERFELDT, H
THON, W
NACHRICHTENTECHNISCHE ZEITSCHRIFT, 1974, 27 (06): : 230 - 232
[38] An Improved Switch Speech Enhancement Algorithm for Automatic Speech Recognition
Ma, Yongbao
Zhou, Yi
Liu, Jingang
Xia, Jie
Liu, Hongqing
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATIONS (ICCC), 2015, : 430 - 435
[39] Analysis of DNN Speech Signal Enhancement for Robust Speaker Recognition
Novotny, Ondrej
Plchot, Oldrich
Glembek, Ondrej
Cernocky, Jan ''Honza''
Burget, Lukas
COMPUTER SPEECH AND LANGUAGE, 2019, 58 : 403 - 421
[40] Multi-Stage Speech Enhancement for Automatic Speech Recognition
Lee, Seungyeol
Lee, Youngwoo
Cho, Namgook
2016 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2016,

← 1 2 3 4 5 →