Exploring Data-Independent Dimensionality Reduction in Sparse Representation-Based Speaker Identification

被引：2

作者：

Haris, B. C. ^{[1
]}

Sinha, Rohit ^{[1
]}

机构：

[1] Indian Inst Technol, Dept Elect & Elect Engn, Gauhati 781039, India

来源：

CIRCUITS SYSTEMS AND SIGNAL PROCESSING | 2014年 / 33卷 / 08期

关键词：

Sparse representation classification; Random projections; Speaker recognition; Supervectors; Dimensionality reduction; VERIFICATION; RECOGNITION; ALGORITHM;

D O I：

10.1007/s00034-014-9757-x

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

The sparse representation classification (SRC) has attracted the attention of many signal processing domains in past few years. Recently, it has been successfully explored for the speaker recognition task with Gaussian mixture model (GMM) mean supervectors which are typically of the order of tens of thousands as speaker representations. As a result of this, the complexity of such systems become very high. With the use of the state-of-the-art i-vector representations, the dimension of GMM mean supervectors can be reduced effectively. But the i-vector approach involves a high dimensional data projection matrix which is learned using the factor analysis approach over huge amount of data from a large number of speakers. Also, the estimation of i-vector for a given utterance involves a computationally complex procedure. Motivated by these facts, we explore the use of data-independent projection approaches for reducing the dimensionality of GMM mean supervectors. The data-independent projection methods studied in this work include a normal random projection and two kinds of sparse random projections. The study is performed on SRC-based speaker identification using the NIST SRE 2005 dataset which includes channel matched and mismatched conditions. We find that the use of data-independent random projections for the dimensionality reduction of the supervectors results in only 3 % absolute loss in performance compared to that of the data-dependent (i-vector) approach. It is highlighted that with the use of highly sparse random projection matrices having 1 as non-zero coefficients, a significant reduction in computational complexity is achieved in finding the projections. Further, as these matrices do not require floating point representations, their storage requirement is also very small compared to that of the data-dependent or the normal random projection matrices. These reduced complexity sparse random projections would be of interest in context of the speaker recognition applications implemented on platforms having low computational power.

引用

页码：2521 / 2538

页数：18

共 50 条

[21] Comparative study of linear and nonlinear dimensionality reduction for speaker identification
Errity, Andrew
McKenna, John
PROCEEDINGS OF THE 2007 15TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING, 2007, : 587 - +
[22] Sparse representation-based ECG signal enhancement and QRS detection
Zhou, Yichao
Hu, Xiyuan
Tang, Zhenmin
Ahn, Andrew C.
PHYSIOLOGICAL MEASUREMENT, 2016, 37 (12) : 2093 - 2110
[23] Sparse representation-based 3D model retrieval
Cao, Qun
An, Yang
Shi, Yingdi
Zhu, Xiaorong
MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (19) : 20069 - 20079
[24] Optimized Color Filter Arrays for Sparse Representation-Based Demosaicking
Li, Jia
Bai, Chenyan
Lin, Zhouchen
Yu, Jian
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017, 26 (05) : 2381 - 2393
[25] Robust L1-norm two-dimensional collaborative representation-based projection for dimensionality reduction
He, Lulu
Ye, Jimin
E, Jianwei
SIGNAL PROCESSING-IMAGE COMMUNICATION, 2020, 81 (81)
[26] Evaluation of a Sparse Representation-Based Classifier For Bird Phrase Classification Under Limited Data Conditions
Tan, Lee Ngee
Kaewtip, Kantapon
Cody, Martin L.
Taylor, Charles E.
Alwan, Abeer
13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 2521 - 2524
[27] Category Guided Sparse Preserving Projection for Biometric Data Dimensionality Reduction
Huang, Qianying
Wu, Yunsong
Zhao, Chenqiu
Zhang, Xiaohong
Yang, Dan
Biometric Recognition, 2016, 9967 : 539 - 546
[28] A Tensor-Based Approach for Big Data Representation and Dimensionality Reduction
Kuang, Liwei
Hao, Fei
Yang, Laurence T.
Lin, Man
Luo, Changqing
Min, Geyong
IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTING, 2014, 2 (03) : 280 - 291
[29] Sparse Dimensionality Reduction Based on Compressed Sensing
Tang, Yufang
Li, Xueming
Liu, Yan
Wang, Jizhe
Xu, Yan
2014 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2014, : 3373 - 3378
[30] A robust feature based on sparse representation for speaker recognition
Xie, Yining
Huang, Jinjie
Wang, Xinlei
Journal of Computational Information Systems, 2013, 9 (09): : 3553 - 3561

← 1 2 3 4 5 →