Speaker Recognition with Deep Learning Approaches: A Review

被引:0
|
作者
Alenizi, Abdulrahman S. [1 ]
Al-Karawi, Khamis A. [2 ]
机构
[1] PAAET, Shuwaikh Ind, Kuwait
[2] Diyala Univ, Baqubah, Diyala, Iraq
来源
PROCEEDINGS OF NINTH INTERNATIONAL CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGY, VOL 5, ICICT 2024 | 2024年 / 1000卷
关键词
Deep learning text independence; Feature extraction; Statistical models; Discriminative models; Speaker identification; And speaker verification; MACHINES; NOISE;
D O I
10.1007/978-981-97-3289-0_39
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This article gives an overview of the methods for using deep learning to identify and verify speakers. Speaker recognition is an everyday use of speech technology. Many research initiatives have been carried out in the past few years, but little progress has been achieved. But just as deep learning techniques are replacing previous state-of-the-art approaches in speech recognition, they are also developing in most machine learning fields. Deep learning seems to have evolved into the most advanced speaker verification and identification technique. Most novel efforts start with the common x-vectors in addition to i-vectors. The increasing volume of data gathered makes the area where deep learning is most effective more accessible.
引用
收藏
页码:481 / 499
页数:19
相关论文
共 50 条
  • [31] Speaker Recognition of Fiber-Optic External Fabry-Perot Interferometric Microphone Based on Deep Learning
    Wang, Yangfeng
    Wan, Shengpeng
    Zhang, Sijun
    Yu, Junsong
    IEEE SENSORS JOURNAL, 2022, 22 (13) : 12906 - 12912
  • [32] Data preprocessing and feature selection techniques in gait recognition: A comparative study of machine learning and deep learning approaches
    Parashar, Anubha
    Parashar, Apoorva
    Ding, Weiping
    Shabaz, Mohammad
    Rida, Imad
    PATTERN RECOGNITION LETTERS, 2023, 172 : 65 - 73
  • [33] Denoising and segmentation in medical image analysis: A comprehensive review on machine learning and deep learning approaches
    Ravi Ranjan Kumar
    Rahul Priyadarshi
    Multimedia Tools and Applications, 2025, 84 (12) : 10817 - 10875
  • [34] Review of various stages in speaker recognition system, performance measures and recognition toolkits
    Pawar, Rupali V.
    Jalnekar, Rajesh M.
    Chitode, Janardan S.
    ANALOG INTEGRATED CIRCUITS AND SIGNAL PROCESSING, 2018, 94 (02) : 247 - 257
  • [35] Review of various stages in speaker recognition system, performance measures and recognition toolkits
    Rupali V. Pawar
    Rajesh M. Jalnekar
    Janardan S. Chitode
    Analog Integrated Circuits and Signal Processing, 2018, 94 : 247 - 257
  • [36] A Unified Deep Learning Framework for Short-Duration Speaker Verification in Adverse Environments
    Jung, Youngmoon
    Choi, Yeunju
    Lim, Hyungjun
    Kim, Hoirin
    IEEE ACCESS, 2020, 8 : 175448 - 175466
  • [37] Self-learning speaker identification for enhanced speech recognition
    Herbig, Tobias
    Gerl, Franz
    Minker, Wolfgang
    COMPUTER SPEECH AND LANGUAGE, 2012, 26 (03) : 210 - 227
  • [38] Performance enhancement of text-independent speaker recognition in noisy and reverberation conditions using Radon transform with deep learning
    El-Moneim S.A.
    El-Mordy E.A.
    Nassar M.A.
    Dessouky M.I.
    Ismail N.A.
    El-Fishawy A.S.
    El-Dolil S.
    El-Dokany I.M.
    El-Samie F.E.A.
    International Journal of Speech Technology, 2022, 25 (03) : 679 - 687
  • [39] Deep Learning and Machine Learning Techniques Applied to Speaker Identification on Small Datasets
    Manfron, Enrico
    Teixeira, Joao Paulo
    Minetto, Rodrigo
    OPTIMIZATION, LEARNING ALGORITHMS AND APPLICATIONS, PT II, OL2A 2023, 2024, 1982 : 195 - 210
  • [40] A review of research on micro-expression recognition algorithms based on deep learning
    Zhang, Fan
    Chai, Lin
    Neural Computing and Applications, 2024, 36 (29) : 17787 - 17828