Spectral subtraction and RASTA-filtering in text-dependent HMM-based speaker verification

被引：0

作者：

Hardt, D

Fellbaum, K

机构：

来源：

1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS | 1997年

关键词：

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In real text-dependent telephone-based speaker verification systems, both, additive and convolutional noise influence the error rate considerably. In this paper, different procedures which make a speaker verification system more robust against noise are compared. We either use the spectral subtraction in addition to the MFCC-feature extraction or only the PLP and RASTA-PLP (without spectral subtraction). Considering spectral subtraction two modifications were examined: one version which was preconnected to the system and a second one being integrated into the MFCC computation. The first version has the advantage that the window length can be chosen independently on those of the MFCC procedure. This led to better results. However, the most effective procedure for telephone speech data is the J-RASTA-PLP, but the estimation of the optimal J factor is difficult. At first we used a fixed J factor based on the off-line measurement of the noise power. Finally, we performed some experiments to optimize the system with the adaptive estimation of the J factor during the utterance. This procedure is based an the method of spectral mapping which has been shown to be very effective in automatic speech recognition.

引用

页码：867 / 870

页数：4

共 50 条

[21] Towards Goat Detection in Text-Dependent Speaker Verification
Toledo-Ronen, Orith
Aronowitz, Hagai
Hoory, Ron
Pelecanos, Jason
Nahamoo, David
12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 16 - +
[22] Deep Embedding Learning for Text-Dependent Speaker Verification
Zhang, Peng
Hu, Peng
Zhang, Xueliang
INTERSPEECH 2020, 2020, : 3461 - 3465
[23] Tandem Deep Features for Text-Dependent Speaker Verification
Fu, Tianfan
Qian, Yanmin
Liu, Yuan
Yu, Kai
15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 1327 - 1331
[24] END-TO-END ATTENTION BASED TEXT-DEPENDENT SPEAKER VERIFICATION
Zhang, Shi-Xiong
Chen, Zhuo
Zhao, Yong
Li, Jinyu
Gong, Yifan
2016 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2016), 2016, : 171 - 178
[25] EXPLOITING SEQUENCE INFORMATION FOR TEXT-DEPENDENT SPEAKER VERIFICATION
Dey, Subhadeep
Motlicek, Petr
Madikeri, Srikanth
Ferras, Marc
2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5370 - 5374
[26] Lexicon-Based Local Representation for Text-Dependent Speaker Verification
You, Hanxu
Li, Wei
Li, Lianqiang
Zhu, Jie
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2017, E100D (03): : 587 - 589
[27] Text-dependent Speaker Verification Using Word-based Scoring
Yao, Shengyu
Huang, Houjun
Zhou, Ruohua
Yan, Yonghong
2018 11TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2018, : 314 - 318
[28] Data Augmentation Enhanced Speaker Enrollment for Text-dependent Speaker Verification
Sarkar, Achintya Kumar
Sarma, Himangshu
Dwivedi, Priyanka
Tan, Zheng-Hua
2020 3RD INTERNATIONAL CONFERENCE ON ENERGY, POWER AND ENVIRONMENT: TOWARDS CLEAN ENERGY TECHNOLOGIES (ICEPE 2020), 2021,
[29] Template-matching for text-dependent speaker verification
Dey, Subhadeep
Motlicek, Petr
Madikeri, Srikanth
Ferras, Marc
SPEECH COMMUNICATION, 2017, 88 : 96 - 105
[30] DEEP NEURAL NETWORK BASED POSTERIORS FOR TEXT-DEPENDENT SPEAKER VERIFICATION
Dey, Subhadeep
Madikeri, Srikanth
Ferras, Marc
Modicek, Petr
2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 5050 - 5054

← 1 2 3 4 5 →