HIGH IMPROVEMENT OF SPEAKER IDENTIFICATION AND VERIFICATION BY COMBINING MFCC AND PHASE INFORMATION

被引:24
|
作者
Wang, Longbiao [1 ]
Ohtsuka, Shinji [2 ]
Nakagawa, Seiichi [2 ]
机构
[1] Shizuoka Univ, Dept Syst Engn, Shizuoka 4228529, Japan
[2] Toyohashi Univ Technol, Dept Informat & Comp Sci, Aichi, Japan
关键词
speaker identification; speaker verification; MFCC; phase information; combination method; RECOGNITION; FEATURES; MIXTURE;
D O I
10.1109/ICASSP.2009.4960637
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In conventional speaker recognition methods based on MFCC, phase information has been ignored. We proposed a method that integrated the phase information with MFCC on a speaker identification method, and a preliminary experiment was performed. In this paper, we propose a new modified feature parameter (that is, coordidates on an unit circle) obtained from the original phase information, and evaluated it by using speech database consisting of normal, fast and slow speaking modes. The speaker identification experiments were performed using NTT database which consists of sentences uttered by 35 Japanese speakers (22 males and 13 females) on five sessions over ten months. Each speaker uttered only 5 training utterances at a normal speaking mode (about 20 seconds in total). The proposed new phase information was more robust than the original phase information for all speaking modes. By integrating the new phase information with the MFCC, the speaker identification error rate was remarkably reduced for normal, fast and slow speaking rates in comparison with a standard MFCC-based method. In this paper, speaker verification experiments were also evaluated using the phase information. The experiments show that the phase information is also very useful for the speaker verification.
引用
收藏
页码:4529 / +
页数:2
相关论文
共 50 条
  • [1] Speaker Identification and Verification by Combining MFCC and Phase Information
    Nakagawa, Seiichi
    Wang, Longbiao
    Ohtsuka, Shinji
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (04): : 1085 - 1095
  • [2] SPEAKER IDENTIFICATION BY COMBINING MFCC AND PHASE INFORMATION IN NOISY ENVIRONMENTS
    Wang, Longbiao
    Minami, Kazue
    Yamamoto, Kazumasa
    Nakagawa, Seiichi
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4502 - 4505
  • [3] Speaker Recognition by Combining MFCC and Phase Information
    Nakagawa, Seiichi
    Asakawa, Kouhei
    Wang, Longbiao
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 1065 - 1068
  • [4] Speaker recognition by combining MFCC and phase information
    Department of Information and Computer Sciences, Toyohashi University of Technology, Japan
    Int. Speech Commun. Assoc. - Annu. Conf. Int. Speech Commun. Assoc., Interspeech, (1065-1068):
  • [5] Speaker Recognition by Combining MFCC and Phase Information in Noisy Conditions
    Wang, Longbiao
    Minami, Kazue
    Yamamoto, Kazumasa
    Nakagawa, Seiichi
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2010, E93D (09): : 2397 - 2406
  • [6] Speaker Verification by Combining Information from Magnitude and Phase Spectrum
    Jain, Karthik K.
    Kirthan, M. G.
    Pai, Vinayak R.
    Narendra, K. C.
    PROCEEDINGS OF THE 2016 IEEE INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, SIGNAL PROCESSING AND NETWORKING (WISPNET), 2016, : 163 - 166
  • [7] Fusion of TEO Phase with MFCC Features for Speaker Verification
    Agrawal, Purvi
    Patil, Hemant A.
    PERCEPTION AND MACHINE INTELLIGENCE, 2015, 2015, : 161 - 166
  • [8] Improving short utterance speaker verification by combining MFCC and Entrocy in Noisy conditions
    Khamis A. Al-karawi
    Duraid Y. Mohammed
    Multimedia Tools and Applications, 2021, 80 : 22231 - 22249
  • [9] Improving short utterance speaker verification by combining MFCC and Entrocy in Noisy conditions
    Al-karawi, Khamis A.
    Mohammed, Duraid Y.
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (14) : 22231 - 22249
  • [10] Analysis of Performance Improvement for Speaker Verification by Combining Feature Vectors of LPC Spectral Envelope, MFCC and pLPC Pole Distribution
    Shigeta, Haruki
    Komatsu, Kodai
    Oyabu, Shun
    Matsuo, Kazuya
    Kurogi, Shuichi
    INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, ISDA 2021, 2022, 418 : 220 - 230