CAM: CONTEXT-AWARE MASKING FOR ROBUST SPEAKER VERIFICATION

被引:12
|
作者
Yu, Ya-Qi [1 ]
Zheng, Siqi [2 ]
Suo, Hongbin [2 ]
Lei, Yun [2 ]
Li, Wu-Jun [1 ]
机构
[1] Nanjing Univ, Dept Comp Sci & Technol, Natl Key Lab Novel Software Technol, Nanjing, Peoples R China
[2] Alibaba Grp, Speech Lab, Hangzhou, Peoples R China
来源
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021) | 2021年
关键词
Speaker verification; speech enhancement; context embedding; context-aware masking; FEATURE ENHANCEMENT;
D O I
10.1109/ICASSP39728.2021.9414704
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Performance degradation caused by noise has been a long-standing challenge for speaker verification. Previous methods usually involve applying a denoising transformation to speaker embeddings or enhancing input features. Nevertheless, these methods are lossy and inefficient for speaker embedding. In this paper, we propose context-aware masking (CAM), a novel method to extract robust speaker embedding. CAM enables the speaker embedding network to "focus" on the speaker of interest and "blur" unrelated noise. The threshold of masking is dynamically controlled by an auxiliary context embedding that captures speaker and noise characteristics. Moreover, models adopting CAM can be trained in an end-to-end manner without using synthesized noisy-clean speech pairs. Our results show that CAM improves speaker verification performance in the wild by a large margin, compared to the baselines.
引用
收藏
页码:6703 / 6707
页数:5
相关论文
共 50 条
  • [41] Context-Aware Collector
    Maciel, Carlos A. V., Jr.
    Filho, Jose Anderson S. N.
    Barros, Gabriella A. B.
    Chiu, Thun Pin T. F.
    Tedesco, Patrcia C. A. R.
    da Silva, Fabio Q. B.
    Santos, Andre L. M.
    Cavalcanti, Antonio L. O., Jr.
    Mascaro, Angelica A.
    2013 IEEE 15TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS & 2013 IEEE INTERNATIONAL CONFERENCE ON EMBEDDED AND UBIQUITOUS COMPUTING (HPCC_EUC), 2013, : 2181 - 2186
  • [42] Context-aware aspects
    Tanter, Eric
    Gybels, Kris
    Denker, Marcus
    Bergel, Alexandre
    SOFTWARE COMPOSITION, 2006, 4089 : 227 - 242
  • [43] Context-Aware IPTV
    Song, Songbo
    Moustafa, Hassnaa
    Afifi, Hossam
    WIRED-WIRELESS MULTIMEDIA NETWORKS AND SERVICES MANAGEMENT, 2009, 5842 : 189 - +
  • [44] The Context-Aware Browser
    Coppola, Paolo
    Della Mea, Vincenzo
    Di Gaspero, Luca
    Menegon, Davide
    Mischis, Danny
    Mizzaro, Stefano
    Scagnetto, Ivan
    Vassena, Luca
    IEEE INTELLIGENT SYSTEMS, 2010, 25 (01) : 38 - 47
  • [45] Towards context-aware collaborative filtering by learning context-aware latent representations
    Liu, Xin
    Zhang, Jiyong
    Yan, Chenggang
    KNOWLEDGE-BASED SYSTEMS, 2020, 199
  • [46] Context-aware regulation of context-aware mobile services in pervasive computing environments
    Syukur, Evi
    Loke, Seng Wai
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2006, PT 4, 2006, 3983 : 138 - 147
  • [47] DISENTANGLED SPEAKER EMBEDDING FOR ROBUST SPEAKER VERIFICATION
    Yi, Lu
    Mak, Man-Wai
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 7662 - 7666
  • [48] Occlusion-robust workflow recognition with context-aware compositional ConvNet
    Zhang, Min
    Hu, Haiyang
    Li, Zhongjin
    Chen, Jie
    SOFT COMPUTING, 2024, 28 (06) : 5125 - 5135
  • [49] Occlusion-robust workflow recognition with context-aware compositional ConvNet
    Min Zhang
    Haiyang Hu
    Zhongjin Li
    Jie Chen
    Soft Computing, 2024, 28 : 5125 - 5135
  • [50] Discovering and exploiting causal dependencies for robust mobile context-aware recommenders
    Yap, Ghim-Eng
    Tan, Ah-Hwee
    Pang, Hwee-Hwa
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2007, 19 (07) : 977 - 992