Spectral entropy and spectral shape based pre-quantization for real time speaker identification system

被引:2
|
作者
Sarkar G. [1 ]
Saha G. [1 ]
机构
[1] Department of Electronics and Electrical Communication Engineering, IIT Kharagpur
关键词
Kurtosis; Pre-quantization; Speaker identification; Spectral entropy;
D O I
10.1007/s10772-010-9079-8
中图分类号
学科分类号
摘要
Pre-processing is one of the vital steps for developing robust and efficient recognition system. Better preprocessing not only aid in better data selection but also in significant reduction of computational complexity. Further an efficient frame selection technique can improve the overall performance of the system. Pre-quantization (PQ) is the technique of selecting less number of frames in the pre-processing stage to reduce the computational burden in the post processing stages of speaker identification (SI). In this paper, we develop PQ techniques based on spectral entropy and spectral shape to pick suitable frames containing speaker specific information that varies from frame to frame depending on spoken text and environmental conditions. The attempt is to exploit the statistical properties of distributions of speech frames at the pre-processing stage of speaker recognition. Our aim is not only to reduce the frame rate but also to maintain identification accuracy reasonably high. Further we have also analyzed the robustness of our proposed techniques on noisy utterances. To establish the efficacy of our proposed methods, we used two different databases, POLYCOST (telephone speech) and YOHO (microphone speech). © Springer Science+Business Media, LLC 2010.
引用
收藏
页码:189 / 199
页数:10
相关论文
共 19 条
  • [1] Efficient Pre-Quantization Techniques Based on Probability Density for Speaker Recognition System
    Sarkar, Gourav
    Saha, Goutam
    TENCON 2009 - 2009 IEEE REGION 10 CONFERENCE, VOLS 1-4, 2009, : 53 - +
  • [2] Efficient speaker identification using spectral entropy
    Luque-Suarez, Fernando
    Camarena-Ibarrola, Antonio
    Chavez, Edgar
    MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (12) : 16803 - 16815
  • [3] Efficient speaker identification using spectral entropy
    Fernando Luque-Suárez
    Antonio Camarena-Ibarrola
    Edgar Chávez
    Multimedia Tools and Applications, 2019, 78 : 16803 - 16815
  • [4] Forensic speaker identification based on spectral moments
    Rodman, R
    McAllister, D
    Bitzer, D
    Cepeda, L
    Abbitt, P
    FORENSIC LINGUISTICS-THE INTERNATIONAL JOURNAL OF SPEECH LANGUAGE AND THE LAW, 2002, 9 (01): : 22 - 43
  • [5] Real-time speaker identification system
    Al-Shboul, Bashar
    Alsawalqah, Hamad
    Lee, Dongman
    PROCEEDINGS OF THE 7TH WSEAS INTERNATIONAL CONFERENCE ON APPLIED COMPUTER SCIENCE: COMPUTER SCIENCE CHALLENGES, 2007, : 422 - +
  • [6] Spectral Restoration Based Speech Enhancement for Robust Speaker Identification
    Saleem, Nasir
    Tareen, Tayyaba Gul
    INTERNATIONAL JOURNAL OF INTERACTIVE MULTIMEDIA AND ARTIFICIAL INTELLIGENCE, 2018, 5 (01): : 34 - 39
  • [7] Identification of Network Topology Variations Based on Spectral Entropy
    Su, Housheng
    Chen, Dan
    Pan, Gui-Jun
    Zeng, Zhigang
    IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (10) : 10468 - 10478
  • [8] Intelligent Seizure Prediction System Based on Spectral Entropy
    Rusnac, Ana-Luiza
    Grigore, Ovidiu
    2019 INTERNATIONAL SYMPOSIUM ON SIGNALS, CIRCUITS AND SYSTEMS (ISSCS 2019), 2019,
  • [9] Robust Speaker Identification System Based on Two-Stage Vector Quantization
    Chen, Wan-Chen
    Hsieh, Ching-Tang
    Hsu, Chih-Hsu
    JOURNAL OF APPLIED SCIENCE AND ENGINEERING, 2008, 11 (04): : 357 - 366
  • [10] Implementation of a Real-Time Text Dependent Speaker Identification System
    Andrei, Valentin
    Paleologu, Constantin
    Burileanu, Corneliu
    2011 6TH CONFERENCE ON SPEECH TECHNOLOGY AND HUMAN-COMPUTER DIALOGUE (SPED), 2011,