Auditory Sparse Representation for Robust Speaker Recognition Based on Tensor Structure

被引:0
|
作者
Qiang Wu
Liqing Zhang
机构
[1] Shanghai Jiao Tong University,Department of Computer Science and Engineering
来源
EURASIP Journal on Audio, Speech, and Music Processing | / 2008卷
关键词
Hair Cell; Speech Signal; Basilar Membrane; Tensor Structure; Sparse Code;
D O I
暂无
中图分类号
学科分类号
摘要
This paper investigates the problem of speaker recognition in noisy conditions. A new approach called nonnegative tensor principal component analysis (NTPCA) with sparse constraint is proposed for speech feature extraction. We encode speech as a general higher-order tensor in order to extract discriminative features in spectrotemporal domain. Firstly, speech signals are represented by cochlear feature based on frequency selectivity characteristics at basilar membrane and inner hair cells; then, low-dimension sparse features are extracted by NTPCA for robust speaker modeling. The useful information of each subspace in the higher-order tensor can be preserved. Alternating projection algorithm is used to obtain a stable solution. Experimental results demonstrate that our method can increase the recognition accuracy specifically in noisy environments.
引用
收藏
相关论文
共 50 条
  • [1] Auditory Sparse Representation for Robust Speaker Recognition Based on Tensor Structure
    Wu, Qiang
    Zhang, Liqing
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2008, 2008 (1)
  • [2] A robust feature based on sparse representation for speaker recognition
    Xie, Yining
    Huang, Jinjie
    Wang, Xinlei
    Journal of Computational Information Systems, 2013, 9 (09): : 3553 - 3561
  • [3] SPARSE-BASED AUDITORY MODEL FOR ROBUST SPEAKER RECOGNITION
    You, Datao
    Han, Jiqing
    Zheng, Tieran
    Zheng, Guibin
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2012, 26 (07)
  • [4] Noise-robust feature based on sparse representation for speaker recognition
    Qi, Hongzhuo
    Metallurgical and Mining Industry, 2015, 7 (04): : 64 - 69
  • [5] Intrinsic Variation Robust Speaker Verification based on Sparse Representation
    Nie, Yi
    Xu, Mingxing
    Xianyu, Haishu
    2014 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2014,
  • [6] Robust speaker verification based on max pooling of sparse representation
    Wang, Wei
    Han, Jiqing
    Zheng, Tieran
    Zheng, Guibin
    Han, J. (jqhan@hit.edu.cn), 1600, Computer Society of the Republic of China (24): : 56 - 65
  • [7] A robust sparse auditory feature for speaker verification
    Han, J. (jqhan@hit.edu.cn), 1600, Binary Information Press, P.O. Box 162, Bethel, CT 06801-0162, United States (09):
  • [8] Deep Hashing for Speaker Identification and Retrieval Based on Auditory Sparse Representation
    Tran, Dung Kim
    Akagi, Masato
    Unoki, Masashi
    PROCEEDINGS OF 2022 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2022, : 937 - 943
  • [9] Robust Face Recognition Based on Supervised Sparse Representation
    Mi, Jian-Xun
    Sun, Yueru
    Lu, Jia
    INTELLIGENT COMPUTING METHODOLOGIES, ICIC 2018, PT III, 2018, 10956 : 253 - 259
  • [10] Robust sparse representation based face recognition in an adaptive weighted spatial pyramid structure
    Xiao Ma
    Fandong Zhang
    Yuelong Li
    Jufu Feng
    Science China Information Sciences, 2018, 61