Speaker Recognition with Deep Learning Approaches: A Review

被引:0
|
作者
Alenizi, Abdulrahman S. [1 ]
Al-Karawi, Khamis A. [2 ]
机构
[1] PAAET, Shuwaikh Ind, Kuwait
[2] Diyala Univ, Baqubah, Diyala, Iraq
来源
PROCEEDINGS OF NINTH INTERNATIONAL CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGY, VOL 5, ICICT 2024 | 2024年 / 1000卷
关键词
Deep learning text independence; Feature extraction; Statistical models; Discriminative models; Speaker identification; And speaker verification; MACHINES; NOISE;
D O I
10.1007/978-981-97-3289-0_39
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This article gives an overview of the methods for using deep learning to identify and verify speakers. Speaker recognition is an everyday use of speech technology. Many research initiatives have been carried out in the past few years, but little progress has been achieved. But just as deep learning techniques are replacing previous state-of-the-art approaches in speech recognition, they are also developing in most machine learning fields. Deep learning seems to have evolved into the most advanced speaker verification and identification technique. Most novel efforts start with the common x-vectors in addition to i-vectors. The increasing volume of data gathered makes the area where deep learning is most effective more accessible.
引用
收藏
页码:481 / 499
页数:19
相关论文
共 50 条
  • [1] Speaker recognition based on deep learning: An overview
    Bai, Zhongxin
    Zhang, Xiao-Lei
    NEURAL NETWORKS, 2021, 140 : 65 - 99
  • [2] Speaker Recognition through Deep Learning Techniques: A Comprehensive Review and Research Challenges
    Shome N.
    Sarkar A.
    Ghosh A.K.
    Laskar R.H.
    Kashyap R.
    Periodica polytechnica Electrical engineering and computer science, 2023, 67 (03): : 300 - 336
  • [3] Deep Learning Approaches for Continuous Sign Language Recognition: A Comprehensive Review
    Khan, Asma
    Jin, Seyong
    Lee, Geon-Hee
    Arzu, Gul E.
    Dang, L. Minh
    Nguyen, Tan N.
    Choi, Woong
    Moon, Hyeonjoon
    IEEE ACCESS, 2025, 13 : 55524 - 55544
  • [4] Sign Language Recognition: A Comprehensive Review of Traditional and Deep Learning Approaches, Datasets, and Challenges
    Tao, Tangfei
    Zhao, Yizhe
    Liu, Tianyu
    Zhu, Jieli
    IEEE ACCESS, 2024, 12 : 75034 - 75060
  • [5] Plant image recognition with deep learning: A review
    Chen, Ying
    Huang, Yiqi
    Zhang, Zizhao
    Wang, Zhen
    Liu, Bo
    Liu, Conghui
    Huang, Cong
    Dong, Shuangyu
    Pu, Xuejiao
    Wan, Fanghao
    Qiao, Xi
    Qian, Wanqiang
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2023, 212
  • [6] Use of Speaker Recognition Approaches for Learning and Evaluating Embedding Representations of Musical Instrument Sounds
    Shi, Xuan
    Cooper, Erica
    Yamagishi, Junichi
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 367 - 377
  • [7] Gender Recognition of Bangla Names Using Deep Learning Approaches
    Kabir, Md. Humaun
    Ahmad, Faruk
    Hasan, Md. Al Mehedi
    Shin, Jungpil
    APPLIED SCIENCES-BASEL, 2023, 13 (01):
  • [8] Robust Deep Speaker Recognition: Learning Latent Representation with Joint Angular Margin Loss
    Chowdhury, Labib
    Zunair, Hasib
    Mohammed, Nabeel
    APPLIED SCIENCES-BASEL, 2020, 10 (21): : 1 - 17
  • [9] VoxCeleb2: Deep Speaker Recognition
    Chung, Joon Son
    Nagrani, Arsha
    Zisserman, Andrew
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1086 - 1090
  • [10] Speaker Recognition in Uncontrolled Environent: A Review
    Karamangala, Narendra
    Kumaraswamy, Ratnaswamy
    JOURNAL OF INTELLIGENT SYSTEMS, 2013, 22 (01) : 49 - 65