Combination of pitch and MFCC GMM supervectors for speaker verification

被引:1
|
作者
Huang, Wei [1 ]
Chao, Jianshu
Zhang, Yaxin [1 ]
机构
[1] Motorola Inc, China Res Ctr, Shanghai, Peoples R China
关键词
D O I
10.1109/ICALIP.2008.4590129
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A large majority of speaker verification systems are based on frame-level acoustic features, such as Mel Frequency Cepstral Coefficients (MFCCs) which characterize the vocal tract contribution. The most commonly used statistical GMM-UBM classifier models the distribution of MFCCs quite well. Pitch is one of the most important features which characterize speaker-dependent vocal fold vibration rate. It can complement the vocal tract information as source information. Although the source information is supposed to follow a lognormal distribution, the discriminative Support Vector Machine (SVM) is more suitable for pitch classification. In this paper, firstly we exploit GMM-UBM and SVM to the frame-level pitch vectors. Then we put the state-of-the-art GMM supervectors concept to the pitch feature vectors and experiment shows a promising result. And the combination of two feature type GMM supervectors systems gains much better performance. All experiment results are obtained on the NIST 2001 Speaker Database.
引用
收藏
页码:1335 / 1339
页数:5
相关论文
共 50 条
  • [1] Speaker Verification in Noisy Environment Using GMM Supervectors
    Sarkar, Sourjya
    Rao, K. Sreenivasa
    2013 NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2013,
  • [2] Support vector machines using GMM supervectors for speaker verification
    Campbell, WM
    Sturim, DE
    Reynolds, DA
    IEEE SIGNAL PROCESSING LETTERS, 2006, 13 (05) : 308 - 311
  • [3] SVM speaker verification using session variability modelling and GMM supervectors
    McLaren, M.
    Vogt, R.
    Sridharan, S.
    ADVANCES IN BIOMETRICS, PROCEEDINGS, 2007, 4642 : 1077 - +
  • [4] LDA combination of pitch and MFCC features in speaker recognition
    Harrag, A
    Mohamadi, T
    Serignat, JF
    INDICON 2005 Proceedings, 2005, : 237 - 240
  • [5] Comparison of LPCC and MFCC features and GMM and GMM-UBM modeling for Limited Data Speaker Verification
    Kumari, Jayanthi T. R.
    Jayanna, H. S.
    2014 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMPUTING RESEARCH (IEEE ICCIC), 2014, : 818 - 823
  • [6] Writer Identification and Verification Using GMM Supervectors
    Christlein, Vincent
    Bernecker, David
    Hoenig, Florian
    Angelopoulou, Elli
    2014 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2014, : 998 - 1005
  • [7] GPU Accelerated GMM Supervectors for Speaker and Language Recognition
    Wang Fuqiu
    Zhang, Wei-Qiang
    Jia, Liu
    PROCEEDINGS OF 2012 IEEE 11TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP) VOLS 1-3, 2012, : 536 - +
  • [8] GMM-Based Speaker Verification System with Hardware MFCC in SoC Design
    Tsai, Tsung-Han
    Wang, Chiao-Li
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (19) : 56991 - 57010
  • [9] Gender Identification of a Speaker Using MFCC and GMM
    Yucesoy, Ergun
    Nabiyev, Vasif V.
    2013 8TH INTERNATIONAL CONFERENCE ON ELECTRICAL AND ELECTRONICS ENGINEERING (ELECO), 2013, : 626 - 629
  • [10] Determination of a speaker's age and gender with an SVM classifier based on GMM supervectors
    Yucesoy, Ergun
    Nabiyev, Vasif V.
    JOURNAL OF THE FACULTY OF ENGINEERING AND ARCHITECTURE OF GAZI UNIVERSITY, 2016, 31 (03): : 501 - 510