Exploring Data-Independent Dimensionality Reduction in Sparse Representation-Based Speaker Identification

被引:2
作者
Haris, B. C. [1 ]
Sinha, Rohit [1 ]
机构
[1] Indian Inst Technol, Dept Elect & Elect Engn, Gauhati 781039, India
关键词
Sparse representation classification; Random projections; Speaker recognition; Supervectors; Dimensionality reduction; VERIFICATION; RECOGNITION; ALGORITHM;
D O I
10.1007/s00034-014-9757-x
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The sparse representation classification (SRC) has attracted the attention of many signal processing domains in past few years. Recently, it has been successfully explored for the speaker recognition task with Gaussian mixture model (GMM) mean supervectors which are typically of the order of tens of thousands as speaker representations. As a result of this, the complexity of such systems become very high. With the use of the state-of-the-art i-vector representations, the dimension of GMM mean supervectors can be reduced effectively. But the i-vector approach involves a high dimensional data projection matrix which is learned using the factor analysis approach over huge amount of data from a large number of speakers. Also, the estimation of i-vector for a given utterance involves a computationally complex procedure. Motivated by these facts, we explore the use of data-independent projection approaches for reducing the dimensionality of GMM mean supervectors. The data-independent projection methods studied in this work include a normal random projection and two kinds of sparse random projections. The study is performed on SRC-based speaker identification using the NIST SRE 2005 dataset which includes channel matched and mismatched conditions. We find that the use of data-independent random projections for the dimensionality reduction of the supervectors results in only 3 % absolute loss in performance compared to that of the data-dependent (i-vector) approach. It is highlighted that with the use of highly sparse random projection matrices having 1 as non-zero coefficients, a significant reduction in computational complexity is achieved in finding the projections. Further, as these matrices do not require floating point representations, their storage requirement is also very small compared to that of the data-dependent or the normal random projection matrices. These reduced complexity sparse random projections would be of interest in context of the speaker recognition applications implemented on platforms having low computational power.
引用
收藏
页码:2521 / 2538
页数:18
相关论文
共 50 条
  • [31] Dimensionality Reduction by Integrating Sparse Representation and Fisher Criterion and Its Applications
    Gao, Quanxue
    Wang, Qianqian
    Huang, Yunfang
    Gao, Xinbo
    Hong, Xin
    Zhang, Hailin
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2015, 24 (12) : 5684 - 5695
  • [32] Power System Fault Classification Method based on Sparse Representation and Random Dimensionality Reduction Projection
    Cheng, Long
    Wang, Lingyun
    Gao, Feng
    2015 IEEE POWER & ENERGY SOCIETY GENERAL MEETING, 2015,
  • [33] Optimal Couple Projections for Domain Adaptive Sparse Representation-Based Classification
    Zhang, Guoqing
    Sun, Huaijiang
    Porikli, Fatih
    Liu, Yazhou
    Sun, Quansen
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017, 26 (12) : 5922 - 5935
  • [34] Sparse Representation-Based Multiple Frame Video Super-Resolution
    Dai, Qiqin
    Yoo, Seunghwan
    Kappeler, Armin
    Katsaggelos, Aggelos K.
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017, 26 (02) : 765 - 781
  • [35] Face Sketch Synthesis via Sparse Representation-Based Greedy Search
    Zhang, Shengchuan
    Gao, Xinbo
    Wang, Nannan
    Li, Jie
    Zhang, Mingjin
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2015, 24 (08) : 2466 - 2477
  • [36] Sparse Representation-Based SAR Image Target Classification on the 10-Class MSTAR Data Set
    Song, Haibo
    Ji, Kefeng
    Zhang, Yunshu
    Xing, Xiangwei
    Zou, Huanxin
    APPLIED SCIENCES-BASEL, 2016, 6 (01):
  • [37] SUPERPIXEL-LEVEL SPARSE REPRESENTATION-BASED CLASSIFICATION FOR HYPERSPECTRAL IMAGERY
    Jia, Sen
    Deng, Bin
    Jia, Xiuping
    2016 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2016, : 3302 - 3305
  • [38] A SPARSE REPRESENTATION-BASED CLASSIFIER FOR IN-SET BIRD PHRASE VERIFICATION AND CLASSIFICATION WITH LIMITED TRAINING DATA
    Tan, Lee Ngee
    Kossan, George
    Cody, Martin L.
    Taylor, Charles E.
    Alwan, Abeer
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 763 - 767
  • [39] A robust DNN model for text-independent speaker identification using non-speaker embeddings in diverse data conditions
    Shome, Nirupam
    Saritha, Banala
    Kashyap, Richik
    Laskar, Rabul Hussain
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (26) : 18933 - 18947
  • [40] A sparse grid based method for generative dimensionality reduction of high-dimensional data
    Bohn, Bastian
    Garcke, Jochen
    Griebel, Michael
    JOURNAL OF COMPUTATIONAL PHYSICS, 2016, 309 : 1 - 17