Speaker-Aware Linear Discriminant Analysis in Speaker Verification

被引：0

作者：

Zheng, Naijun ^{[1
]}

Wu, Xixin ^{[1
]}

Zhong, Jinghua ^{[2
]}

Liu, Xunying ^{[1
]}

Meng, Helen ^{[1
]}

机构：

[1] Chinese Univ Hong Kong, Hong Kong, Peoples R China

[2] SpeechX Ltd, Shenzhen, Peoples R China

来源：

INTERSPEECH 2020 | 2020年

关键词：

Linear discriminant analysis (LDA); speaker verification; speaker-aware;

D O I：

10.21437/Interspeech.2020-2061

中图分类号：

R36 [病理学]; R76 [耳鼻咽喉科学];

学科分类号：

100104 ; 100213 ;

摘要：

Linear discriminant analysis (LDA) is an effective and widely used discriminative technique for speaker verification. However, it only utilizes the information on global structure to perform classification. Some variants of LDA, such as local pairwise LDA (LPLDA), are proposed to preserve more information on the local structure in the linear projection matrix. However, considering that the local structure may vary a lot in different regions, summing up related components to construct a single projection matrix may not be sufficient. In this paper, we present a speaker-aware strategy focusing on preserving distinct information on local structure in a set of linear discriminant projection matrices, and allocating them to different local regions for dimension reduction and classification. Experiments on NIST SRE2010 and NIST SRE2016 show that the speaker-aware strategy can boost the performance of both LDA and LPLDA backends in i-vector systems and x-vector systems.

引用

页码：3012 / 3016

页数：5

共 50 条

[41] Speaker and session variability in GMM-based speaker verification [J].

Kenny, Patrick ;

Boulianne, Gilles ;

Ouellet, Pierre ;

Dumouchel, Pierre .

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (04) :1448-1460

[42] PRISM: Pre-trained Indeterminate Speaker Representation Model for Speaker Diarization and Speaker Verification [J].

Zheng, Siqi ;

Suo, Hongbin ;

Chen, Qian .

INTERSPEECH 2022, 2022, :1431-1435

[43] REVERBERATION COMPENSATION FOR SPEAKER VERIFICATION [J].

Peer, Itai ;

Rafaely, Boaz ;

Zigel, Yaniv .

2008 IEEE 25TH CONVENTION OF ELECTRICAL AND ELECTRONICS ENGINEERS IN ISRAEL, VOLS 1 AND 2, 2008, :333-+

[44] Multimodal Association for Speaker Verification [J].

Shon, Suwon ;

Glass, James .

INTERSPEECH 2020, 2020, :2247-2251

[45] Learnable MFCCs for Speaker Verification [J].

Liu, Xuechen ;

Sahidullah, Md ;

Kinnunen, Tomi .

2021 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2021,

[46] Discriminative Adaptation for Speaker Verification [J].

Longworth, C. ;

Gales, M. J. F. .

INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, :1467-1470

[47] Lightweight Embeddings for Speaker Verification [J].

Tkachenko, Maxim ;

Yamshinin, Alexander ;

Kotov, Mikhail ;

Nastasenko, Marina .

SPEECH AND COMPUTER (SPECOM 2018), 2018, 11096 :687-696

[48] DISCRIMINATIVE AUTOENCODERS FOR SPEAKER VERIFICATION [J].

Lee, Hung-Shin ;

Lu, Yu-Ding ;

Hsu, Chin-Cheng ;

Tsao, Yu ;

Wang, Hsin-Min ;

Leng, Shyh-Kang .

2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, :5375-5379

[49] Speaker verification for multimedia application [J].

Ciota, Z .

2004 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN & CYBERNETICS, VOLS 1-7, 2004, :2752-2756

[50] SPEAKER VERIFICATION FOR ROMANIAN LANGUAGE [J].

Dumitru, C. O. ;

Gavat, Inge .

UNIVERSITY POLITEHNICA OF BUCHAREST SCIENTIFIC BULLETIN SERIES C-ELECTRICAL ENGINEERING AND COMPUTER SCIENCE, 2006, 68 (04) :81-90

← 1 2 3 4 5 →