Robust speaker verification via fusion of speech and lip modalities

被引：13

作者：

Wark, T ^{[1
]}

Sridharan, S ^{[1
]}

Chandran, V ^{[1
]}

机构：

[1] Queensland Univ Technol, Sch Elect Elect & Syst Engn, Speech Res Lab, Brisbane, Qld 4001, Australia

来源：

ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI | 1999年

关键词：

D O I：

10.1109/ICASSP.1999.757487

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

This paper investigates the use of lip information, in conjunction with speech information, for robust speaker verification in the presence of background noise. It has been previously shown in our own work, and in the work of others, that features extracted from a speaker's moving lips hold speaker dependencies which are complementary with speech features. We demonstrate that the fusion of lip and speech information allows for a highly robust speaker verification system which outperforms the performance of either sub-system. We present a new technique for determining the weighting to be applied to each modality so as to optimize the performance of the fused system. Given a correct weighting, Lip information is shown to be highly effective for reducing the false acceptance and false rejection error rates in the presence of background noise.

引用

页码：3061 / 3064

页数：4

共 50 条

[1] Robust speaker verification via fusion of speech and lip modalities
Wark, T.
Sridharan, S.
Chandran, V.
ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 1999, 6 : 3061 - 3064
[2] The use of speech and lip modalities for robust speaker verification under adverse conditions
Wark, TJ
Sridharan, S
Chandran, V
IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS, PROCEEDINGS VOL 1, 1999, : 812 - 816
[3] Use of speech and lip modalities for robust speaker verification under adverse conditions
Queensland Univ of Technology, Brisbane
Int Conf Multimedia Comput Syst Proc, (812-816):
[4] Adaptive fusion of speech and lip information for robust speaker identification
Wark, T
Sridharan, S
DIGITAL SIGNAL PROCESSING, 2001, 11 (03) : 169 - 186
[5] Audiovisual Speaker Identification Based on Lip and Speech Modalities
Chelali, Fatma
Djeradi, Amar
INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2017, 14 (01) : 99 - 110
[6] Attentive Feature Fusion for Robust Speaker Verification
Liu, Bei
Chen, Zhengyang
Qian, Yanmin
INTERSPEECH 2022, 2022, : 286 - 290
[7] A Fused Speech Enhancement Framework for Robust Speaker Verification
Wu, Yanfeng
Li, Taihao
Zhao, Junan
Wang, Qirui
Xu, Jing
IEEE SIGNAL PROCESSING LETTERS, 2023, 30 : 883 - 887
[8] Robust speaker verification from GSM-transcoded speech based on decision fusion and feature transformation
Mak, MW
Cheung, MC
Kung, SY
2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL II, PROCEEDINGS: SPEECH II; INDUSTRY TECHNOLOGY TRACKS; DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS; NEURAL NETWORKS FOR SIGNAL PROCESSING, 2003, : 745 - 748
[9] Channel robust speaker verification via feature mapping
Reynolds, DA
2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL II, PROCEEDINGS: SPEECH II; INDUSTRY TECHNOLOGY TRACKS; DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS; NEURAL NETWORKS FOR SIGNAL PROCESSING, 2003, : 53 - 56
[10] Shouted Speech Compensation for Speaker Verification Robust to Vocal Effort Conditions
Prieto, Santi
Ortega, Alfonso
Lopez-Espejo, Ivan
Lleida, Eduardo
INTERSPEECH 2020, 2020, : 1511 - 1515

← 1 2 3 4 5 →