Classifiers Ensemble of HMM and d-Vectors in Biometric Speaker Verification

被引:0
|
作者
Carlos Atenco-Vazquez, Juan [1 ]
Moreno-Rodriguez, Juan C. [1 ]
Cruz-Vega, Israel [1 ]
Gomez-Gil, Pilar [1 ]
Arechiga, Rene [2 ]
Manuel Ramirez-Cortes, Juan [1 ]
机构
[1] Natl Inst Astrophys Opt & Elect, Puebla, Mexico
[2] New Mexico Inst Min & Technol, Elect Engn Dept, Socorro, NM USA
关键词
Speaker verification; Voice biometrics; HMM-UBM; Machine learning; d-vectors; FIS; SVM; MLP;
D O I
10.1007/978-3-030-60884-2_9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a novel approach on text-dependent biometric speaker verification (SV) based on the ensemble of two feature extraction and classification processes using Hidden Markov Models in a Universal Background Model framework (HMM-UBM) and d-Vectors derived from a Deep Learning Network (DNN) structure. Once the individual SV systems are trained, a third classifier is trained/tuned over individual test scores in the same dataset using three different approaches for comparison purposes: Multilayer Perceptron (MLP), Support Vector Machine (SVM) with three different kernels, and a Fuzzy Inference System (FIS). Obtained results over a proprietary speech database in Spanish, indicate an improved performance, providing an Equal Error Rate (EER) within the range of 0.7%-2.54% when classifier ensembles are used, versus an EER of 3.6% and above obtained in average with individual classifiers. Results in detail corresponding to comparison of the several approaches used in this experimental work are further described.
引用
收藏
页码:119 / 131
页数:13
相关论文
共 50 条
  • [1] Beyond Speech: Generalizing D-Vectors for Biometric Verification
    Baldwin, Jacob
    Burnham, Ryan
    Meyer, Andrew
    Dora, Robert
    Wright, Robert
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 842 - 849
  • [2] Speaker adaptation in DNN-based speech synthesis using d-vectors
    Doddipatla, Rama
    Braunschweiler, Norbert
    Maia, Ranniery
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 3404 - 3408
  • [3] MULTIMODAL SPEAKER DIARIZATION OF REAL-WORLD MEETINGS USING D-VECTORS WITH SPATIAL FEATURES
    Kang, Wonjune
    Roy, Brandon C.
    Chow, Wesley
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 6509 - 6513
  • [4] An ensemble learning approach to lip-based biometric verification, with a dynamic selection of classifiers
    Porwik, Piotr
    Doroz, Rafal
    Wrobel, Krzysztof
    EXPERT SYSTEMS WITH APPLICATIONS, 2019, 115 : 673 - 683
  • [5] Ensemble approach in Speaker Verification
    Perera, Leibny Paola Garcia
    Raj, Bhiksha
    Flores, Juan Arturo Nolazco
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 2454 - 2458
  • [6] Speaker verification using HMM for Romanian language
    Dumitru, Corneliu Octavian
    Gavat, Inge
    Vieru, Rodica
    PROCEEDINGS ELMAR-2006, 2006, : 131 - +
  • [7] Automatic Speaker Verification Experiments using HMM
    Munteanu, Doru-Petru
    Toma, Stefan-Adrian
    PROCEEDINGS OF THE 2010 8TH INTERNATIONAL CONFERENCE ON COMMUNICATIONS (COMM), 2010, : 107 - 110
  • [8] Application of ANN and HMM to automatic speaker verification
    Alegre, Federico Leonardo
    IEEE Latin America Transactions, 2007, 5 (05) : 329 - 337
  • [9] A HMM/SVM Hybrid Method for Speaker Verification
    Florin, Rastoceanu
    Militaru, Diana
    PROCEEDINGS OF THE 2010 8TH INTERNATIONAL CONFERENCE ON COMMUNICATIONS (COMM), 2010, : 111 - 114
  • [10] An experimental comparison of ensemble of classifiers for biometric data
    Nanni, Loris
    Lumini, Alessandra
    NEUROCOMPUTING, 2006, 69 (13-15) : 1670 - 1673