Speaker Recognition with Deep Learning Approaches: A Review

被引：0

作者：

Alenizi, Abdulrahman S. ^{[1
]}

Al-Karawi, Khamis A. ^{[2
]}

机构：

[1] PAAET, Shuwaikh Ind, Kuwait

[2] Diyala Univ, Baqubah, Diyala, Iraq

来源：

PROCEEDINGS OF NINTH INTERNATIONAL CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGY, VOL 5, ICICT 2024 | 2024年 / 1000卷

关键词：

Deep learning text independence; Feature extraction; Statistical models; Discriminative models; Speaker identification; And speaker verification; MACHINES; NOISE;

D O I：

10.1007/978-981-97-3289-0_39

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This article gives an overview of the methods for using deep learning to identify and verify speakers. Speaker recognition is an everyday use of speech technology. Many research initiatives have been carried out in the past few years, but little progress has been achieved. But just as deep learning techniques are replacing previous state-of-the-art approaches in speech recognition, they are also developing in most machine learning fields. Deep learning seems to have evolved into the most advanced speaker verification and identification technique. Most novel efforts start with the common x-vectors in addition to i-vectors. The increasing volume of data gathered makes the area where deep learning is most effective more accessible.

引用

页码：481 / 499

页数：19

共 50 条

[1] Speaker recognition based on deep learning: An overview
Bai, Zhongxin
Zhang, Xiao-Lei
NEURAL NETWORKS, 2021, 140 : 65 - 99
[2] Speaker Recognition through Deep Learning Techniques: A Comprehensive Review and Research Challenges
Shome N.
Sarkar A.
Ghosh A.K.
Laskar R.H.
Kashyap R.
Periodica polytechnica Electrical engineering and computer science, 2023, 67 (03): : 300 - 336
[3] Deep Learning Approaches for Continuous Sign Language Recognition: A Comprehensive Review
Khan, Asma
Jin, Seyong
Lee, Geon-Hee
Arzu, Gul E.
Dang, L. Minh
Nguyen, Tan N.
Choi, Woong
Moon, Hyeonjoon
IEEE ACCESS, 2025, 13 : 55524 - 55544
[4] Sign Language Recognition: A Comprehensive Review of Traditional and Deep Learning Approaches, Datasets, and Challenges
Tao, Tangfei
Zhao, Yizhe
Liu, Tianyu
Zhu, Jieli
IEEE ACCESS, 2024, 12 : 75034 - 75060
[5] Plant image recognition with deep learning: A review
Chen, Ying
Huang, Yiqi
Zhang, Zizhao
Wang, Zhen
Liu, Bo
Liu, Conghui
Huang, Cong
Dong, Shuangyu
Pu, Xuejiao
Wan, Fanghao
Qiao, Xi
Qian, Wanqiang
COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2023, 212
[6] Use of Speaker Recognition Approaches for Learning and Evaluating Embedding Representations of Musical Instrument Sounds
Shi, Xuan
Cooper, Erica
Yamagishi, Junichi
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 367 - 377
[7] Gender Recognition of Bangla Names Using Deep Learning Approaches
Kabir, Md. Humaun
Ahmad, Faruk
Hasan, Md. Al Mehedi
Shin, Jungpil
APPLIED SCIENCES-BASEL, 2023, 13 (01):
[8] Robust Deep Speaker Recognition: Learning Latent Representation with Joint Angular Margin Loss
Chowdhury, Labib
Zunair, Hasib
Mohammed, Nabeel
APPLIED SCIENCES-BASEL, 2020, 10 (21): : 1 - 17
[9] VoxCeleb2: Deep Speaker Recognition
Chung, Joon Son
Nagrani, Arsha
Zisserman, Andrew
19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1086 - 1090
[10] Speaker Recognition in Uncontrolled Environent: A Review
Karamangala, Narendra
Kumaraswamy, Ratnaswamy
JOURNAL OF INTELLIGENT SYSTEMS, 2013, 22 (01) : 49 - 65

← 1 2 3 4 5 →