Exploring Data-Independent Dimensionality Reduction in Sparse Representation-Based Speaker Identification

被引:2
作者
Haris, B. C. [1 ]
Sinha, Rohit [1 ]
机构
[1] Indian Inst Technol, Dept Elect & Elect Engn, Gauhati 781039, India
关键词
Sparse representation classification; Random projections; Speaker recognition; Supervectors; Dimensionality reduction; VERIFICATION; RECOGNITION; ALGORITHM;
D O I
10.1007/s00034-014-9757-x
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The sparse representation classification (SRC) has attracted the attention of many signal processing domains in past few years. Recently, it has been successfully explored for the speaker recognition task with Gaussian mixture model (GMM) mean supervectors which are typically of the order of tens of thousands as speaker representations. As a result of this, the complexity of such systems become very high. With the use of the state-of-the-art i-vector representations, the dimension of GMM mean supervectors can be reduced effectively. But the i-vector approach involves a high dimensional data projection matrix which is learned using the factor analysis approach over huge amount of data from a large number of speakers. Also, the estimation of i-vector for a given utterance involves a computationally complex procedure. Motivated by these facts, we explore the use of data-independent projection approaches for reducing the dimensionality of GMM mean supervectors. The data-independent projection methods studied in this work include a normal random projection and two kinds of sparse random projections. The study is performed on SRC-based speaker identification using the NIST SRE 2005 dataset which includes channel matched and mismatched conditions. We find that the use of data-independent random projections for the dimensionality reduction of the supervectors results in only 3 % absolute loss in performance compared to that of the data-dependent (i-vector) approach. It is highlighted that with the use of highly sparse random projection matrices having 1 as non-zero coefficients, a significant reduction in computational complexity is achieved in finding the projections. Further, as these matrices do not require floating point representations, their storage requirement is also very small compared to that of the data-dependent or the normal random projection matrices. These reduced complexity sparse random projections would be of interest in context of the speaker recognition applications implemented on platforms having low computational power.
引用
收藏
页码:2521 / 2538
页数:18
相关论文
共 50 条
  • [21] Comparative study of linear and nonlinear dimensionality reduction for speaker identification
    Errity, Andrew
    McKenna, John
    PROCEEDINGS OF THE 2007 15TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING, 2007, : 587 - +
  • [22] Sparse representation-based ECG signal enhancement and QRS detection
    Zhou, Yichao
    Hu, Xiyuan
    Tang, Zhenmin
    Ahn, Andrew C.
    PHYSIOLOGICAL MEASUREMENT, 2016, 37 (12) : 2093 - 2110
  • [23] Sparse representation-based 3D model retrieval
    Cao, Qun
    An, Yang
    Shi, Yingdi
    Zhu, Xiaorong
    MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (19) : 20069 - 20079
  • [24] Optimized Color Filter Arrays for Sparse Representation-Based Demosaicking
    Li, Jia
    Bai, Chenyan
    Lin, Zhouchen
    Yu, Jian
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017, 26 (05) : 2381 - 2393
  • [25] Robust L1-norm two-dimensional collaborative representation-based projection for dimensionality reduction
    He, Lulu
    Ye, Jimin
    E, Jianwei
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2020, 81 (81)
  • [26] Evaluation of a Sparse Representation-Based Classifier For Bird Phrase Classification Under Limited Data Conditions
    Tan, Lee Ngee
    Kaewtip, Kantapon
    Cody, Martin L.
    Taylor, Charles E.
    Alwan, Abeer
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 2521 - 2524
  • [27] Category Guided Sparse Preserving Projection for Biometric Data Dimensionality Reduction
    Huang, Qianying
    Wu, Yunsong
    Zhao, Chenqiu
    Zhang, Xiaohong
    Yang, Dan
    Biometric Recognition, 2016, 9967 : 539 - 546
  • [28] A Tensor-Based Approach for Big Data Representation and Dimensionality Reduction
    Kuang, Liwei
    Hao, Fei
    Yang, Laurence T.
    Lin, Man
    Luo, Changqing
    Min, Geyong
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTING, 2014, 2 (03) : 280 - 291
  • [29] Sparse Dimensionality Reduction Based on Compressed Sensing
    Tang, Yufang
    Li, Xueming
    Liu, Yan
    Wang, Jizhe
    Xu, Yan
    2014 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2014, : 3373 - 3378
  • [30] A robust feature based on sparse representation for speaker recognition
    Xie, Yining
    Huang, Jinjie
    Wang, Xinlei
    Journal of Computational Information Systems, 2013, 9 (09): : 3553 - 3561