Speaker Recognition with Deep Learning Approaches: A Review

被引:0
|
作者
Alenizi, Abdulrahman S. [1 ]
Al-Karawi, Khamis A. [2 ]
机构
[1] PAAET, Shuwaikh Ind, Kuwait
[2] Diyala Univ, Baqubah, Diyala, Iraq
来源
PROCEEDINGS OF NINTH INTERNATIONAL CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGY, VOL 5, ICICT 2024 | 2024年 / 1000卷
关键词
Deep learning text independence; Feature extraction; Statistical models; Discriminative models; Speaker identification; And speaker verification; MACHINES; NOISE;
D O I
10.1007/978-981-97-3289-0_39
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This article gives an overview of the methods for using deep learning to identify and verify speakers. Speaker recognition is an everyday use of speech technology. Many research initiatives have been carried out in the past few years, but little progress has been achieved. But just as deep learning techniques are replacing previous state-of-the-art approaches in speech recognition, they are also developing in most machine learning fields. Deep learning seems to have evolved into the most advanced speaker verification and identification technique. Most novel efforts start with the common x-vectors in addition to i-vectors. The increasing volume of data gathered makes the area where deep learning is most effective more accessible.
引用
收藏
页码:481 / 499
页数:19
相关论文
共 50 条
  • [41] Review of speaker recognition : Concepts, challenges, architectures, and future directions
    Nehra, Neelam
    Sharma, Geetanjali
    Kumar, Parveen
    Sheoran, Dinesh
    Yadav, Amita
    JOURNAL OF INFORMATION & OPTIMIZATION SCIENCES, 2025, 46 (01) : 121 - 132
  • [42] A Review on Feature Extraction for Speaker Recognition under Degraded Conditions
    Disken, Gokay
    Tufekci, Zekeriya
    Saribulut, Lutfu
    Cevik, Ulus
    IETE TECHNICAL REVIEW, 2017, 34 (03) : 321 - 332
  • [43] Feature extraction and modelling techniques for multilingual speaker recognition: a review
    Nagaraja, B. G.
    Jayanna, H. S.
    INTERNATIONAL JOURNAL OF SIGNAL AND IMAGING SYSTEMS ENGINEERING, 2016, 9 (02) : 67 - 78
  • [44] VAD, feature extraction and modelling techniques for speaker recognition: a review
    Jainar, Spoorti J.
    Sale, Pritam Limbaji
    Nagaraja, B. G.
    INTERNATIONAL JOURNAL OF SIGNAL AND IMAGING SYSTEMS ENGINEERING, 2020, 12 (1-2) : 1 - 18
  • [45] FULL-INFO TRAINING FOR DEEP SPEAKER FEATURE LEARNING
    Li, Lantian
    Tang, Zhiyuan
    Wang, Dong
    Zheng, Thomas Fang
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5369 - 5373
  • [46] Deep learning approaches in flow visualization
    Can Liu
    Ruike Jiang
    Datong Wei
    Changhe Yang
    Yanda Li
    Fang Wang
    Xiaoru Yuan
    Advances in Aerodynamics, 4
  • [47] An overview of text-independent speaker recognition: From features to supervectors
    Kinnunen, Tomi
    Li, Haizhou
    SPEECH COMMUNICATION, 2010, 52 (01) : 12 - 40
  • [48] Deep learning approaches in flow visualization
    Liu, Can
    Jiang, Ruike
    Wei, Datong
    Yang, Changhe
    Li, Yanda
    Wang, Fang
    Yuan, Xiaoru
    ADVANCES IN AERODYNAMICS, 2022, 4 (01)
  • [49] Learning pairwise SVM on hierarchical deep features for ear recognition
    Omara, Ibrahim
    Wu, Xiaohe
    Zhang, Hongzhi
    Du, Yong
    Zuo, Wangmeng
    IET BIOMETRICS, 2018, 7 (06) : 557 - 566
  • [50] Deep Learning for Traffic Scene Understanding: A Review
    Dolatyabi, Parya
    Regan, Jacob
    Khodayar, Mahdi
    IEEE ACCESS, 2025, 13 : 13187 - 13237