Noise-Robust Speaker Recognition Based on Morphological Component Analysis

被引:0
|
作者
He, Yongjun [1 ]
Chen, Chen [2 ]
Han, Jiqing [2 ]
机构
[1] Harbin Univ Sci & Technol, Sch Comp Sci & Technol, Harbin, Peoples R China
[2] Harbin Inst Technol, Sch Comp Sci & Technol, Harbin, Peoples R China
关键词
Morphological component analysis; sparse representation; discriminant dictionary; speaker recognition; VERIFICATION; REPRESENTATION;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Speaker recognition suffers severe performance degradation under noisy environments. To solve this problem, we propose a novel method based on morphological component analysis. This method employs a universal background dictionary (UBD) to model common variability of all speakers, a speech dictionary of each speaker to model special variability of this speaker and a noise dictionary to model variability of environmental noise. These three dictionaries are concatenated to be a big dictionary, over which test speech is sparsely represented and classified. To improve the discriminability of speaker dictionaries, we optimize the speaker dictionaries by removing speaker atoms which are close to the UBD atoms. To ensure varying noises can be tracked, we design an algorithm to update the noise dictionary with the noisy speech. We finally conduct experiments under various noise conditions and the results show that the proposed method can obviously improve the robustness of speaker recognition under noisy environments.
引用
收藏
页码:3001 / 3005
页数:5
相关论文
共 50 条
  • [1] Noise-robust feature based on sparse representation for speaker recognition
    Qi, Hongzhuo
    Metallurgical and Mining Industry, 2015, 7 (04): : 64 - 69
  • [2] TOWARDS NOISE-ROBUST SPEAKER RECOGNITION USING PROBABILISTIC LINEAR DISCRIMINANT ANALYSIS
    Lei, Yun
    Burget, Lukas
    Ferrer, Luciana
    Graciarena, Martin
    Scheffer, Nicolas
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4253 - 4256
  • [3] A Noise-Robust System for NIST 2012 Speaker Recognition Evaluation
    Ferrer, Luciana
    McLaren, Mitchell
    Scheffer, Nicolas
    Lei, Yun
    Graciarena, Martin
    Mitra, Vikramjit
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 1980 - 1984
  • [4] SIMPLIFIED VTS-BASED I-VECTOR EXTRACTION IN NOISE-ROBUST SPEAKER RECOGNITION
    Lei, Yun
    McLaren, Mitchell
    Ferrer, Luciana
    Scheffer, Nicolas
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [5] Feature recovery for noise-robust speaker verification
    Huang, Houjun
    Xu, Yunfei
    Zhou, Ruohua
    Yan, Yonghong
    ELECTRONICS LETTERS, 2015, 51 (18) : 1459 - 1461
  • [6] Gradient Regularization for Noise-Robust Speaker Verification
    Li, Jianchen
    Han, Jiqing
    Song, Hongwei
    INTERSPEECH 2021, 2021, : 1074 - 1078
  • [7] Pitch synchronous based feature extraction for noise-robust speaker verification
    Gong Wei-Guo
    Yang Li-Ping
    Chen Di
    CISP 2008: FIRST INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOL 5, PROCEEDINGS, 2008, : 295 - 298
  • [8] Extraction of Noise-Robust Speaker Embedding Based on Generative Adversarial Networks
    Zhou, Jianfeng
    Jiang, Tao
    Hong, Qingyang
    Li, Lin
    2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 1641 - 1645
  • [9] EXPLOITING LONG-RANGE TEMPORAL DYNAMICS OF SPEECH FOR NOISE-ROBUST SPEAKER RECOGNITION
    Jafari, Ayeh
    Srinivasan, Ramji
    Crookes, Danny
    Ming, Ji
    19TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO-2011), 2011, : 2123 - 2127
  • [10] A Longest Matching Segment Approach with Baysian Adaptation - Application to Noise-Robust Speaker Recognition
    Jafari, Ayeh
    Srinivasan, Ramji
    Crookes, Danny
    Ming, Ji
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2760 - +