Robust speaker verification via fusion of speech and lip modalities

被引:13
|
作者
Wark, T [1 ]
Sridharan, S [1 ]
Chandran, V [1 ]
机构
[1] Queensland Univ Technol, Sch Elect Elect & Syst Engn, Speech Res Lab, Brisbane, Qld 4001, Australia
来源
ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI | 1999年
关键词
D O I
10.1109/ICASSP.1999.757487
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper investigates the use of lip information, in conjunction with speech information, for robust speaker verification in the presence of background noise. It has been previously shown in our own work, and in the work of others, that features extracted from a speaker's moving lips hold speaker dependencies which are complementary with speech features. We demonstrate that the fusion of lip and speech information allows for a highly robust speaker verification system which outperforms the performance of either sub-system. We present a new technique for determining the weighting to be applied to each modality so as to optimize the performance of the fused system. Given a correct weighting, Lip information is shown to be highly effective for reducing the false acceptance and false rejection error rates in the presence of background noise.
引用
收藏
页码:3061 / 3064
页数:4
相关论文
共 50 条
  • [1] Robust speaker verification via fusion of speech and lip modalities
    Wark, T.
    Sridharan, S.
    Chandran, V.
    ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 1999, 6 : 3061 - 3064
  • [2] The use of speech and lip modalities for robust speaker verification under adverse conditions
    Wark, TJ
    Sridharan, S
    Chandran, V
    IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS, PROCEEDINGS VOL 1, 1999, : 812 - 816
  • [3] Use of speech and lip modalities for robust speaker verification under adverse conditions
    Queensland Univ of Technology, Brisbane
    Int Conf Multimedia Comput Syst Proc, (812-816):
  • [4] Adaptive fusion of speech and lip information for robust speaker identification
    Wark, T
    Sridharan, S
    DIGITAL SIGNAL PROCESSING, 2001, 11 (03) : 169 - 186
  • [5] Audiovisual Speaker Identification Based on Lip and Speech Modalities
    Chelali, Fatma
    Djeradi, Amar
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2017, 14 (01) : 99 - 110
  • [6] Attentive Feature Fusion for Robust Speaker Verification
    Liu, Bei
    Chen, Zhengyang
    Qian, Yanmin
    INTERSPEECH 2022, 2022, : 286 - 290
  • [7] A Fused Speech Enhancement Framework for Robust Speaker Verification
    Wu, Yanfeng
    Li, Taihao
    Zhao, Junan
    Wang, Qirui
    Xu, Jing
    IEEE SIGNAL PROCESSING LETTERS, 2023, 30 : 883 - 887
  • [8] Robust speaker verification from GSM-transcoded speech based on decision fusion and feature transformation
    Mak, MW
    Cheung, MC
    Kung, SY
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL II, PROCEEDINGS: SPEECH II; INDUSTRY TECHNOLOGY TRACKS; DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS; NEURAL NETWORKS FOR SIGNAL PROCESSING, 2003, : 745 - 748
  • [9] Channel robust speaker verification via feature mapping
    Reynolds, DA
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL II, PROCEEDINGS: SPEECH II; INDUSTRY TECHNOLOGY TRACKS; DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS; NEURAL NETWORKS FOR SIGNAL PROCESSING, 2003, : 53 - 56
  • [10] Shouted Speech Compensation for Speaker Verification Robust to Vocal Effort Conditions
    Prieto, Santi
    Ortega, Alfonso
    Lopez-Espejo, Ivan
    Lleida, Eduardo
    INTERSPEECH 2020, 2020, : 1511 - 1515