Auditory Sparse Representation for Robust Speaker Recognition Based on Tensor Structure

被引：0

作者：

Qiang Wu

Liqing Zhang

机构：

[1] Shanghai Jiao Tong University,Department of Computer Science and Engineering

来源：

EURASIP Journal on Audio, Speech, and Music Processing | / 2008卷

关键词：

Hair Cell; Speech Signal; Basilar Membrane; Tensor Structure; Sparse Code;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

This paper investigates the problem of speaker recognition in noisy conditions. A new approach called nonnegative tensor principal component analysis (NTPCA) with sparse constraint is proposed for speech feature extraction. We encode speech as a general higher-order tensor in order to extract discriminative features in spectrotemporal domain. Firstly, speech signals are represented by cochlear feature based on frequency selectivity characteristics at basilar membrane and inner hair cells; then, low-dimension sparse features are extracted by NTPCA for robust speaker modeling. The useful information of each subspace in the higher-order tensor can be preserved. Alternating projection algorithm is used to obtain a stable solution. Experimental results demonstrate that our method can increase the recognition accuracy specifically in noisy environments.

引用

共 50 条

[1] Auditory Sparse Representation for Robust Speaker Recognition Based on Tensor Structure
Wu, Qiang
Zhang, Liqing
EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2008, 2008 (1)
[2] A robust feature based on sparse representation for speaker recognition
Xie, Yining
Huang, Jinjie
Wang, Xinlei
Journal of Computational Information Systems, 2013, 9 (09): : 3553 - 3561
[3] SPARSE-BASED AUDITORY MODEL FOR ROBUST SPEAKER RECOGNITION
You, Datao
Han, Jiqing
Zheng, Tieran
Zheng, Guibin
INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2012, 26 (07)
[4] Noise-robust feature based on sparse representation for speaker recognition
Qi, Hongzhuo
Metallurgical and Mining Industry, 2015, 7 (04): : 64 - 69
[5] Intrinsic Variation Robust Speaker Verification based on Sparse Representation
Nie, Yi
Xu, Mingxing
Xianyu, Haishu
2014 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2014,
[6] Robust speaker verification based on max pooling of sparse representation
Wang, Wei
Han, Jiqing
Zheng, Tieran
Zheng, Guibin
Han, J. (jqhan@hit.edu.cn), 1600, Computer Society of the Republic of China (24): : 56 - 65
[7] A robust sparse auditory feature for speaker verification
Han, J. (jqhan@hit.edu.cn), 1600, Binary Information Press, P.O. Box 162, Bethel, CT 06801-0162, United States (09):
[8] Deep Hashing for Speaker Identification and Retrieval Based on Auditory Sparse Representation
Tran, Dung Kim
Akagi, Masato
Unoki, Masashi
PROCEEDINGS OF 2022 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2022, : 937 - 943
[9] Robust Face Recognition Based on Supervised Sparse Representation
Mi, Jian-Xun
Sun, Yueru
Lu, Jia
INTELLIGENT COMPUTING METHODOLOGIES, ICIC 2018, PT III, 2018, 10956 : 253 - 259
[10] Robust sparse representation based face recognition in an adaptive weighted spatial pyramid structure
Xiao Ma
Fandong Zhang
Yuelong Li
Jufu Feng
Science China Information Sciences, 2018, 61

← 1 2 3 4 5 →