Efficient speaker identification using spectral entropy

被引：0

作者：

Fernando Luque-Suárez

Antonio Camarena-Ibarrola

Edgar Chávez

机构：

[1] CICESE,

[2] Universidad Michoacana,undefined

来源：

Multimedia Tools and Applications | 2019年 / 78卷

关键词：

Speaker recognition; Speaker identification; Entropygrams;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

In voice recognition, the two main problems are speech recognition (what was said), and speaker recognition (who was speaking). The usual method for speaker recognition is to postulate a model where the speaker identity corresponds to the parameters of the model, which estimation could be time-consuming when the number of candidate speakers is large. In this paper, we model the speaker as a high dimensional point cloud of entropy-based features, extracted from the speech signal. The method allows indexing, and hence it can manage large databases. We experimentally assessed the quality of the identification with a publicly available database formed by extracting audio from a collection of YouTube videos of 1,000 different speakers. With 20 second audio excerpts, we were able to identify a speaker with 97% accuracy when the recording environment is not controlled, and with 99% accuracy for controlled recording environments.

引用

页码：16803 / 16815

页数：12

共 50 条

[31] Using Placement and Name for Speaker Identification in Captioning
Vy, Quoc V.
Fels, Deborah I.
COMPUTERS HELPING PEOPLE WITH SPECIAL NEEDS, PROCEEDINGS, PT 1, 2010, 6179 : 247 - 254
[32] Speaker identification using orthogonal and discriminative features
Davarpanah, SH
Mirzaei, A
Ziaei, A
IWSSIP 2005: Proceedings of the 12th International Worshop on Systems, Signals & Image Processing, 2005, : 293 - 296
[33] Using Avatars for Improving Speaker Identification in Captioning
Vy, Quoc V.
Fels, Deborah I.
HUMAN-COMPUTER INTERACTION - INTERACT 2009, PT II, PROCEEDINGS, 2009, 5727 : 916 - 919
[34] An efficient speaker identification framework based on Mask R-CNN classifier parameter optimized using hosted cuckoo optimization (HCO)
Gaurav
Bhardwaj S.
Agarwal R.
Journal of Ambient Intelligence and Humanized Computing, 2023, 14 (10) : 13613 - 13625
[35] EFFICIENT SPEAKER VERIFICATION SYSTEM USING SPEAKER MODEL CLUSTERING FOR T AND Z NORMALIZATIONS
Ravulakollu, Kiran
Apsingekar, Vijendra Raj
De Leon, Phillip L.
42ND ANNUAL 2008 IEEE INTERNATIONAL CARNAHAN CONFERENCE ON SECURITY TECHNOLOGY, PROCEEDINGS, 2008, : 56 - 62
[36] ENERGY-EFFICIENT SPEAKER IDENTIFICATION WITH LOW-PRECISION NETWORKS
Koppula, Skanda
Glass, James
Chandrakasan, Anantha P.
2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 2246 - 2250
[37] Efficient Low-Complexity Speaker Identification based on a SpeakerGAN Approach
Spisiak, Michal
Jakubec, Maros
Jarina, Roman
Kasak, Peter
2024 34TH INTERNATIONAL CONFERENCE RADIOELEKTRONIKA, RADIOELEKTRONIKA 2024, 2024,
[38] EFFICIENT SCORE NORMALIZATION FOR SPEAKER RECOGNITION
Aronowitz, Hagai
Aronowitz, Vanessia
2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4402 - 4405
[39] An Improved GMM-based Clustering Algorithm for Efficient Speaker Identification
Lin, Wenyong
PROCEEDINGS OF 2015 4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2015), 2015, : 1490 - 1493
[40] Efficient speaker identification from speech transmitted over Bluetooth networks
Khalil, Ali
Elnaby, Mustafa
Saad, Elsayed
Al-Nahari, Azzam
Al-Zubi, Nayel
El-Bendary, Mohsen
El-Samie, Fathi
INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2014, 17 (04) : 409 - 416

← 1 2 3 4 5 →