Speaker recognition in adverse conditions

被引:0
|
作者
Iyer, Ananth N. [1 ]
Ofoegbu, Uchechukwu O. [1 ]
Yantorno, Robert E. [1 ]
Wenndt, Stanley J. [2 ]
机构
[1] Temple Univ, Speech Proc Lab, Philadelphia, PA 19122 USA
[2] IFEC, AF Res Lab, Griffiss AFB, NY 13441 USA
来源
2007 IEEE AEROSPACE CONFERENCE, VOLS 1-9 | 2007年
关键词
D O I
暂无
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
Recognizing speakers from their voices is a challenging area of research with several practical applications. Presently speaker verification (SV) systems achieve a high level of accuracy under ideal conditions such as, when there is ample data to build speaker models and when speaker verification is performed in the presence of little or no interference. In general, these systems assume that the features extracted from the data follow a particular parametric probability density function (pdf), i.e., Gaussian or a mixture of Gaussians; where a form of the pdf is imposed on the speech data rather than determining the underlying structure of the pdf. In practical conditions, like in an aircraft cockpit where most of the verbal communication is in the form of short commands, it is almost impossible to ascertain that the assumptions made about the structure of the pdf are correct, and wrong assumptions could lead to significant reduction in performance of the SV system. In this research, non-parametric strategies, to statistically model speakers are developed and evaluated. Nonparametric density estimation methods are generally known to be superior when limited data is available for model building and SV Experimental evaluation has shown that the non-parametric system yielded a 70% accuracy level in speaker verification with only 0.5 seconds of data and under the influence of noise with signal-to-noise ratio of 5dB. This result corresponds to a 20% decrease in error when compared to the parametric system.
引用
收藏
页码:1547 / 1554
页数:8
相关论文
共 50 条
  • [1] Robust speaker recognition in noisy conditions
    Ming, Ji
    Hazen, Timothy J.
    Glass, James R.
    Reynolds, Douglas A.
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (05): : 1711 - 1723
  • [2] An Analysis of the Influence of Acoustical Adverse Conditions on Speaker Gender Identification
    Maka, Tomasz
    Dziurzanski, Piotr
    2014 XXII ANNUAL PACIFIC VOICE CONFERENCE (PVC), 2014,
  • [3] Enhancement of mismatched conditions in speaker recognition for multimedia applications
    Fakhr, W
    AbdelSalam, A
    Hamdy, N
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 377 - 380
  • [4] Audiovisual perception in adverse conditions: Language, speaker and listener effects
    Hazan, Valerie
    Kim, Jeesun
    Chen, Yuchun
    SPEECH COMMUNICATION, 2010, 52 (11-12) : 996 - 1009
  • [5] Unsupervised and incremental speaker adaptation under adverse environmental conditions
    Takagi, K
    Shinoda, K
    Hattori, H
    Watanabe, T
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 2079 - 2082
  • [6] Aural and automatic forensic speaker recognition in mismatched conditions
    Alexander, Anil
    Dessimoz, Damien
    Botti, Filippo
    Drygajlo, Andrzel
    INTERNATIONAL JOURNAL OF SPEECH LANGUAGE AND THE LAW, 2005, 12 (02) : 214 - 234
  • [7] SPEAKER RECOGNITION IN NOISY CONDITIONS WITH LIMITED TRAINING DATA
    McLaughlin, Niall
    Ming, Ji
    Crookes, Danny
    19TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO-2011), 2011, : 1294 - 1298
  • [8] Speech recognition in adverse conditions: A review
    Mattys, Sven L.
    Davis, Matthew H.
    Bradlow, Ann R.
    Scott, Sophie K.
    LANGUAGE AND COGNITIVE PROCESSES, 2012, 27 (7-8): : 953 - 978
  • [9] From Speaker Recognition to Forensic Speaker Recognition
    Drygajlo, Andrzej
    BIOMETRIC AUTHENTICATION (BIOMET 2014), 2014, 8897 : 93 - 104
  • [10] Joint Identification and Localization of a Speaker in Adverse Conditions Using a Microphone Array
    Salvati, Daniele
    Drioli, Carlo
    Foresti, Gian Luca
    2018 26TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2018, : 21 - 25