Speaker Identification Approach Based On Time Domain Extracted Features

被引:0
|
作者
Lupu, Eugen [1 ]
Emerich, Simina [1 ]
机构
[1] Tech Univ Cluj Napoca, Dept Commun, Cluj Napoca, Romania
来源
关键词
speaker identification; TESPAR; epoch; SVM; confusion matrix;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper presents a speaker identification approach based on features extracted by time domain speech analysis. Most features (28) issue from the TESPAR (Time Encoded Signal Processing and Recognition) coding method. The other four features are provided by the time domain analysis of the waveform. The features further employed are: the relative mean square energy, the number of maxima in the energy envelope, the pitch frequency average and the relative number of zero crossings for every utterance. This approach implies low computational requirements for features extraction and provides good recognition rates. For the experiments some classifiers (kNN, Bayes Net, Naive Bayes, RBF and SVM) provided by the WEKA (Waikato Environment for Knowledge Analysis) environment are employed.
引用
收藏
页码:355 / 358
页数:4
相关论文
共 50 条
  • [31] Spectrogram Features-Based Automatic Speaker Identification For Smart Services
    Jahangir, Rashid
    Alreshoodi, Mohammed
    Khaled Alarfaj, Fawaz
    APPLIED ARTIFICIAL INTELLIGENCE, 2025, 39 (01)
  • [32] Speaker identification and verification based on cepstral features and fuzzy nonlinear classifier
    Dustor, A.
    Proceedings of the International Conference Mixed Design of Integrated Circuits and Systems, 2006, : 692 - 697
  • [33] An HMM-based subband processing approach to speaker identification
    Higgins, JE
    Damper, RI
    AUDIO- AND VIDEO-BASED BIOMETRIC PERSON AUTHENTICATION, PROCEEDINGS, 2001, 2091 : 169 - 174
  • [34] Classification of radar clutter using features extracted from the time-frequency domain
    Jouny, I
    Wu, C
    AUTOMATIC TARGET RECOGNITION VII, 1997, 3069 : 49 - 60
  • [35] Time-domain Transformer-based Audiovisual Speaker Separation
    Kalkhorani, Vahid Ahmadi
    Kumar, Anurag
    Tan, Ke
    Xu, Buye
    Wang, DeLiang
    INTERSPEECH 2023, 2023, : 3472 - 3476
  • [36] Identification of Motor Neuron Disease Using Wavelet Domain Features Extracted from EMG Signal
    Fattah, Shaikh Anowarul
    Doulah, A. B. M. Sayeed Ud
    Iqbal, Md. Asif
    Shahnaz, Celia
    Zhu, Wei-Ping
    Ahmad, M. Omair
    2013 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2013, : 1308 - 1311
  • [37] Intelligent identification of bearing faults using time domain features
    Wu Chenxi
    Ning Liwei
    Jiang Rong
    Wu Xing
    Liu Junan
    2013 FOURTH INTERNATIONAL CONFERENCE ON DIGITAL MANUFACTURING AND AUTOMATION (ICDMA), 2013, : 713 - 716
  • [38] Improvement of speaker identification by combining prosodic features with acoustic features
    Zheng, R
    Zhang, SW
    Xu, B
    ADVANCES IN BIOMETRIC PERSON AUTHENTICATION, PROCEEDINGS, 2004, 3338 : 569 - 576
  • [39] Time-Domain Target-Speaker Speech Separation With Waveform-Based Speaker Embedding
    Zhao, Jianshu
    Gao, Shengzhou
    Shinozaki, Takahiro
    INTERSPEECH 2020, 2020, : 1436 - 1440
  • [40] Speaker identification using speech and lip features
    Ou, GB
    Li, X
    Yao, XC
    Jia, HB
    Murphey, YL
    PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), VOLS 1-5, 2005, : 2565 - 2570