Spectral subtraction and RASTA-filtering in text-dependent HMM-based speaker verification

被引：0

作者：

Hardt, D

Fellbaum, K

机构：

来源：

1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS | 1997年

关键词：

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In real text-dependent telephone-based speaker verification systems, both, additive and convolutional noise influence the error rate considerably. In this paper, different procedures which make a speaker verification system more robust against noise are compared. We either use the spectral subtraction in addition to the MFCC-feature extraction or only the PLP and RASTA-PLP (without spectral subtraction). Considering spectral subtraction two modifications were examined: one version which was preconnected to the system and a second one being integrated into the MFCC computation. The first version has the advantage that the window length can be chosen independently on those of the MFCC procedure. This led to better results. However, the most effective procedure for telephone speech data is the J-RASTA-PLP, but the estimation of the optimal J factor is difficult. At first we used a fixed J factor based on the off-line measurement of the noise power. Finally, we performed some experiments to optimize the system with the adaptive estimation of the J factor during the utterance. This procedure is based an the method of spectral mapping which has been shown to be very effective in automatic speech recognition.

引用

页码：867 / 870

页数：4

共 50 条

[41] Cohort Selection for Text-dependent Speaker Verification Score Normalization
Khemiri, Houssemeddine
Petrovska-Delacretaz, Dijana
2016 2ND INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR SIGNAL AND IMAGE PROCESSING (ATSIP), 2016, : 689 - 692
[42] Tandem Features for Text-dependent Speaker Verification on the RedDots Corpus
Alam, Md Jahangir
Kenny, Patrick
Gupta, Vishwa
17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 420 - 424
[43] Multi-Task Learning for Text-dependent Speaker Verification
Chen, Nanxin
Qian, Yanmin
Yu, Kai
16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 185 - 189
[44] EXPLORING SEQUENTIAL CHARACTERISTICS IN SPEAKER BOTTLENECK FEATURE FOR TEXT-DEPENDENT SPEAKER VERIFICATION
Chen, Liping
Zhao, Yong
Zhang, Shi-Xiong
Li, Jie
Ye, Guoli
Soong, Frank
2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5364 - 5368
[45] Addressing Text-Dependent Speaker Verification Using Singing Speech
Shi, Yan
Zhou, Juanjuan
Long, Yanhua
Li, Yijie
Mao, Hongwei
APPLIED SCIENCES-BASEL, 2019, 9 (13):
[46] Integrating DNN–HMM Technique with Hierarchical Multi-layer Acoustic Model for Text-Dependent Speaker Verification
Mohammad Azharuddin Laskar
Rabul Hussain Laskar
Circuits, Systems, and Signal Processing, 2019, 38 : 3548 - 3572
[47] EFFECTS OF GENDER INFORMATION IN TEXT-INDEPENDENT AND TEXT-DEPENDENT SPEAKER VERIFICATION
Kanervisto, Anssi
Vestman, Ville
Sahidullah, Md
Hautamaki, Ville
Kinnunen, Tomi
2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5360 - 5364
[48] Weighting scores to improve speaker-dependent threshold estimation in text-dependent speaker verification
Saeta, JR
Hernando, J
NONLINEAR ANALYSES AND ALGORITHMS FOR SPEECH PROCESSING, 2005, 3817 : 81 - 91
[49] Highly Noise Robust Text-dependent Speaker Recognition based on Hypothesized Wiener Filtering
Ramasubramanian, V.
Vijaywargiay, Deepak
Kumar, V. Praveen
INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1455 - 1458
[50] Integrating DNN-HMM Technique with Hierarchical Multi-layer Acoustic Model for Text-Dependent Speaker Verification
Laskar, Mohammad Azharuddin
Laskar, Rabul Hussain
CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2019, 38 (08) : 3548 - 3572

← 1 2 3 4 5 →