Simplification of I-Vector Extraction for Speaker Identification

被引:0
|
作者
XU Longting [1 ]
YANG Zhen [1 ]
SUN Linhui [1 ]
机构
[1] Broadband Wireless Communication and Sensor Network Technology Key Lab,Nanjing University of Posts and Telecommunications
基金
中国国家自然科学基金;
关键词
Speaker identification; Closed-set; I-vector; Symmetric matrix factorization;
D O I
暂无
中图分类号
TN912.34 [语音识别与设备];
学科分类号
0711 ;
摘要
The identity vector(i-vector) approach has been the state-of-the-art for text-independent speaker recognition, both identification and verification in recent years. An i-vector is a low-dimensional vector in the socalled total variability space represented with a thin and tall rectangular matrix. This paper introduces a novel algorithm to improve the computational and memory requirements for the application. In our method, the series of symmetric matrices can be represented by diagonal expression,sharing the same dictionary, which to some extent is analogous to eigen decomposition, and we name this algorithm Eigen decomposition like factorization(EDLF). Similar algorithms are listed for comparison, in the same condition,our method shows no disadvantages in identification accuracy.
引用
收藏
页码:1121 / 1126
页数:6
相关论文
共 50 条
  • [31] Evaluation of i-vector Speaker Recognition Systems for Forensic Application
    Mandasari, Miranti Indar
    McLaren, Mitchell
    van Leeuwen, David
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 28 - 31
  • [32] SIMPLIFIED AND SUPERVISED I-VECTOR MODELING FOR SPEAKER AGE REGRESSION
    Shivakumar, Prashanth Gurunath
    Li, Ming
    Dhandhania, Vedant
    Narayanan, Shrikanth S.
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [33] Simplified supervised i-vector modeling with application to robust and efficient language identification and speaker verification
    Li, Ming
    Narayanan, Shrikanth
    COMPUTER SPEECH AND LANGUAGE, 2014, 28 (04) : 940 - 958
  • [34] WEIGHTED LDA TECHNIQUES FOR I-VECTOR BASED SPEAKER VERIFICATION
    Kanagasundaram, A.
    Dean, D.
    Vogt, R.
    McLaren, M.
    Sridharan, S.
    Mason, M.
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4781 - 4784
  • [35] Bayesian Principal Component Analysis for I-Vector Speaker Verification
    Rong Y.-F.
    Chen C.
    Chen D.-Y.
    He Y.-J.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2021, 49 (11): : 2186 - 2194
  • [36] Non-linear PLDA for i-Vector Speaker Verification
    Novoselov, Sergey
    Pekhovsky, Timur
    Kudashev, Oleg
    Mendelev, Valentin
    Prudnikov, Alexey
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 214 - 218
  • [37] Bayesian Distance Metric Learning on i-vector for Speaker Verification
    Fang, Xiao
    Dehak, Najim
    Glass, James
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 2513 - 2517
  • [38] Speaker Adaptation Using the I-Vector Technique for Bottleneck Features
    Cardinal, Patrick
    Dehak, Najim
    Zhang, Yu
    Glass, James
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2867 - 2871
  • [39] Minimax i-vector extractor for short duration speaker verification
    Hautamaki, Ville
    Cheng, You-Chi
    Rajan, Padmanabhan
    Lee, Chin-Hui
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3675 - 3679
  • [40] Clustering-Based I-Vector Formulation for Speaker Recognition
    Lee, Hung-Shin
    Tsao, Yu
    Wang, Hsin-Min
    Jeng, Shyh-Kang
    15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 1101 - 1105