Speaker Recognition with Deep Learning Approaches: A Review

被引：0

作者：

Alenizi, Abdulrahman S. ^{[1
]}

Al-Karawi, Khamis A. ^{[2
]}

机构：

[1] PAAET, Shuwaikh Ind, Kuwait

[2] Diyala Univ, Baqubah, Diyala, Iraq

来源：

PROCEEDINGS OF NINTH INTERNATIONAL CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGY, VOL 5, ICICT 2024 | 2024年 / 1000卷

关键词：

Deep learning text independence; Feature extraction; Statistical models; Discriminative models; Speaker identification; And speaker verification; MACHINES; NOISE;

D O I：

10.1007/978-981-97-3289-0_39

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This article gives an overview of the methods for using deep learning to identify and verify speakers. Speaker recognition is an everyday use of speech technology. Many research initiatives have been carried out in the past few years, but little progress has been achieved. But just as deep learning techniques are replacing previous state-of-the-art approaches in speech recognition, they are also developing in most machine learning fields. Deep learning seems to have evolved into the most advanced speaker verification and identification technique. Most novel efforts start with the common x-vectors in addition to i-vectors. The increasing volume of data gathered makes the area where deep learning is most effective more accessible.

引用

页码：481 / 499

页数：19

共 50 条

[41] Review of speaker recognition : Concepts, challenges, architectures, and future directions
Nehra, Neelam
Sharma, Geetanjali
Kumar, Parveen
Sheoran, Dinesh
Yadav, Amita
JOURNAL OF INFORMATION & OPTIMIZATION SCIENCES, 2025, 46 (01) : 121 - 132
[42] A Review on Feature Extraction for Speaker Recognition under Degraded Conditions
Disken, Gokay
Tufekci, Zekeriya
Saribulut, Lutfu
Cevik, Ulus
IETE TECHNICAL REVIEW, 2017, 34 (03) : 321 - 332
[43] Feature extraction and modelling techniques for multilingual speaker recognition: a review
Nagaraja, B. G.
Jayanna, H. S.
INTERNATIONAL JOURNAL OF SIGNAL AND IMAGING SYSTEMS ENGINEERING, 2016, 9 (02) : 67 - 78
[44] VAD, feature extraction and modelling techniques for speaker recognition: a review
Jainar, Spoorti J.
Sale, Pritam Limbaji
Nagaraja, B. G.
INTERNATIONAL JOURNAL OF SIGNAL AND IMAGING SYSTEMS ENGINEERING, 2020, 12 (1-2) : 1 - 18
[45] FULL-INFO TRAINING FOR DEEP SPEAKER FEATURE LEARNING
Li, Lantian
Tang, Zhiyuan
Wang, Dong
Zheng, Thomas Fang
2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5369 - 5373
[46] Deep learning approaches in flow visualization
Can Liu
Ruike Jiang
Datong Wei
Changhe Yang
Yanda Li
Fang Wang
Xiaoru Yuan
Advances in Aerodynamics, 4
[47] An overview of text-independent speaker recognition: From features to supervectors
Kinnunen, Tomi
Li, Haizhou
SPEECH COMMUNICATION, 2010, 52 (01) : 12 - 40
[48] Deep learning approaches in flow visualization
Liu, Can
Jiang, Ruike
Wei, Datong
Yang, Changhe
Li, Yanda
Wang, Fang
Yuan, Xiaoru
ADVANCES IN AERODYNAMICS, 2022, 4 (01)
[49] Learning pairwise SVM on hierarchical deep features for ear recognition
Omara, Ibrahim
Wu, Xiaohe
Zhang, Hongzhi
Du, Yong
Zuo, Wangmeng
IET BIOMETRICS, 2018, 7 (06) : 557 - 566
[50] Deep Learning for Traffic Scene Understanding: A Review
Dolatyabi, Parya
Regan, Jacob
Khodayar, Mahdi
IEEE ACCESS, 2025, 13 : 13187 - 13237

← 1 2 3 4 5 →