Efficient speaker identification using spectral entropy

被引：0

作者：

Fernando Luque-Suárez

Antonio Camarena-Ibarrola

Edgar Chávez

机构：

[1] CICESE,

[2] Universidad Michoacana,undefined

来源：

Multimedia Tools and Applications | 2019年 / 78卷

关键词：

Speaker recognition; Speaker identification; Entropygrams;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

In voice recognition, the two main problems are speech recognition (what was said), and speaker recognition (who was speaking). The usual method for speaker recognition is to postulate a model where the speaker identity corresponds to the parameters of the model, which estimation could be time-consuming when the number of candidate speakers is large. In this paper, we model the speaker as a high dimensional point cloud of entropy-based features, extracted from the speech signal. The method allows indexing, and hence it can manage large databases. We experimentally assessed the quality of the identification with a publicly available database formed by extracting audio from a collection of YouTube videos of 1,000 different speakers. With 20 second audio excerpts, we were able to identify a speaker with 97% accuracy when the recording environment is not controlled, and with 99% accuracy for controlled recording environments.

引用

页码：16803 / 16815

页数：12

共 50 条

[41] Speaker verification using the spectral and time parameters of voice signal
V. N. Sorokin
A. I. Tsyplikhin
Journal of Communications Technology and Electronics, 2010, 55 : 1561 - 1574
[42] A robust DNN model for text-independent speaker identification using non-speaker embeddings in diverse data conditions
Nirupam Shome
Banala Saritha
Richik Kashyap
Rabul Hussain Laskar
Neural Computing and Applications, 2023, 35 : 18933 - 18947
[43] An efficient speaker recognition using quantum neural network
Kaur, Rupinderdeep
Sharma, R. K.
Kumar, Parteek
MODERN PHYSICS LETTERS B, 2018, 32 (31):
[44] Using Approximate Entropy as a Speech Quality Measure for a Speaker Recognition System
Metzger, Richard A.
Doherty, John F.
Jenkins, David M.
2016 ANNUAL CONFERENCE ON INFORMATION SCIENCE AND SYSTEMS (CISS), 2016,
[45] A robust DNN model for text-independent speaker identification using non-speaker embeddings in diverse data conditions
Shome, Nirupam
Saritha, Banala
Kashyap, Richik
Laskar, Rabul Hussain
NEURAL COMPUTING & APPLICATIONS, 2023, 35 (26): : 18933 - 18947
[46] NMF Based System for Speaker Identification
Costantini, Giovanni
Cesarini, Valerio
Paolizzo, Fabio
2021 IEEE INTERNATIONAL WORKSHOP ON METROLOGY FOR INDUSTRY 4.0 & IOT (IEEE METROIND4.0 & IOT), 2021, : 620 - 624
[47] The case for aural perceptual speaker identification
Hollien, Harry
Didla, Grace
Harnsberger, James D.
Hollien, Keith A.
FORENSIC SCIENCE INTERNATIONAL, 2016, 269 : 8 - 20
[48] Super-Dirichlet Mixture Models using Differential Line Spectral Frequencies for Text-Independent Speaker Identification
Ma, Zhanyu
Leijon, Arne
12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2360 - +
[49] Speaker Identification Using Semi-supervised Learning
Fazakis, Nikos
Karlos, Stamatis
Kotsiantis, Sotiris
Sgarbas, Kyriakos
SPEECH AND COMPUTER (SPECOM 2015), 2015, 9319 : 389 - 396
[50] Novel Approach in Speaker Identification using SVM and GMM
Bourouba, H.
Korba, C. A.
Djemili, Rafik
CONTROL ENGINEERING AND APPLIED INFORMATICS, 2013, 15 (03): : 87 - 95

← 1 2 3 4 5 →