Efficient speaker identification using spectral entropy

被引：7

作者：

Luque-Suarez, Fernando ^{[1
]}

Camarena-Ibarrola, Antonio ^{[2
]}

Chavez, Edgar ^{[1
]}

机构：

[1] CICESE, Ensenada, Baja California, Mexico

[2] Univ Michoacana, Morelia, Michoacan, Mexico

来源：

MULTIMEDIA TOOLS AND APPLICATIONS | 2019年 / 78卷 / 12期

关键词：

Speaker recognition; Speaker identification; Entropygrams; RECOGNITION;

D O I：

10.1007/s11042-018-7035-9

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In voice recognition, the two main problems are speech recognition (what was said), and speaker recognition (who was speaking). The usual method for speaker recognition is to postulate a model where the speaker identity corresponds to the parameters of the model, which estimation could be time-consuming when the number of candidate speakers is large. In this paper, we model the speaker as a high dimensional point cloud of entropy-based features, extracted from the speech signal. The method allows indexing, and hence it can manage large databases. We experimentally assessed the quality of the identification with a publicly available database formed by extracting audio from a collection of YouTube videos of 1,000 different speakers. With 20 second audio excerpts, we were able to identify a speaker with 97% accuracy when the recording environment is not controlled, and with 99% accuracy for controlled recording environments.

引用

页码：16803 / 16815

页数：13

共 50 条

[41] A Novel Speech Enhancement Method Using Fourier Series Decomposition and Spectral Subtraction for Robust Speaker Identification
Ali I. Siam
Heba A. El-khobby
Mustafa M. Abd Elnaby
Hatem S. Abdelkader
Fathi E. Abd El-Samie
Wireless Personal Communications, 2019, 108 : 1055 - 1068
[42] Real-Time Speaker Identification Using Speaker Model Distance
Zeinali, Hossein
Sameti, Hossein
Hadian, Hossein
2015 23RD IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2015, : 643 - 647
[43] EFFICIENT FEATURE EXTRACTION OF SPEAKER IDENTIFICATION USING PHONEME MEAN F-RATIO FOR CHINESE
Zhao, Chen
Wang, Hongcui
Hyon, Songgun
Wei, Jianguo
Dang, Jianwu
2012 8TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, 2012, : 345 - 348
[44] Speaker Identification for Business-Card-Type Sensors
Yamaguchi, Shunpei
Oshima, Ritsuko
Oshima, Jun
Shiina, Ryota
Fujihashi, Takuya
Saruwatari, Shunsuke
Watanabe, Takashi
IEEE OPEN JOURNAL OF THE COMPUTER SOCIETY, 2021, 2 : 216 - 226
[45] Efficient Parameterization for Automatic Speaker Recognition Using Support Vector Machines
Chakroun, Rania
Frikha, Mondher
Zouari, Leila Beltaifa
INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS (ISDA 2016), 2017, 557 : 659 - 666
[46] Joint Speech Enhancement and Speaker Identification Using Monte Carlo Methods
Maina, Ciira Wa
Walsh, John MacLaren
INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1359 - 1362
[47] EmoBack: Backdoor Attacks Against Speaker Identification Using Emotional Prosody
Schoof, Coen
Koffas, Stefanos
Conti, Mauro
Picek, Stjepan
PROCEEDINGS OF THE 2024 WORKSHOP ON ARTIFICIAL INTELLIGENCE AND SECURITY, AISEC 2024, 2024, : 137 - 148
[48] Boosting Speaker Identification Performance Using a Frame Level Based Algorithm
Djemili, Rafik
Bourouba, Hocine
Korba, M. C. Amara
O'Shaughnessy, Douglas
2015 INTERNATIONAL CONFERENCE ON COMMUNICATIONS, SIGNAL PROCESSING, AND THEIR APPLICATIONS (ICCSPA'15), 2015,
[49] Forensic Speaker Identification: a Tutorial
Univaso, Pedro
IEEE LATIN AMERICA TRANSACTIONS, 2017, 15 (09) : 1754 - 1770
[50] Robust Automatic Speaker Identification System Using Shuffled MFCC Features
Barhoush, Mahdi
Hallawa, Ahmed
Schmeink, Anke
2021 IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLIED NETWORK TECHNOLOGIES (ICMLANT II), 2021, : 28 - 33

← 1 2 3 4 5 →