Speaker Recognition with Deep Learning Approaches: A Review

被引:0
|
作者
Alenizi, Abdulrahman S. [1 ]
Al-Karawi, Khamis A. [2 ]
机构
[1] PAAET, Shuwaikh Ind, Kuwait
[2] Diyala Univ, Baqubah, Diyala, Iraq
来源
PROCEEDINGS OF NINTH INTERNATIONAL CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGY, VOL 5, ICICT 2024 | 2024年 / 1000卷
关键词
Deep learning text independence; Feature extraction; Statistical models; Discriminative models; Speaker identification; And speaker verification; MACHINES; NOISE;
D O I
10.1007/978-981-97-3289-0_39
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This article gives an overview of the methods for using deep learning to identify and verify speakers. Speaker recognition is an everyday use of speech technology. Many research initiatives have been carried out in the past few years, but little progress has been achieved. But just as deep learning techniques are replacing previous state-of-the-art approaches in speech recognition, they are also developing in most machine learning fields. Deep learning seems to have evolved into the most advanced speaker verification and identification technique. Most novel efforts start with the common x-vectors in addition to i-vectors. The increasing volume of data gathered makes the area where deep learning is most effective more accessible.
引用
收藏
页码:481 / 499
页数:19
相关论文
共 50 条
  • [21] A comprehensive review on deep learning approaches in wind forecasting applications
    Wu, Zhou
    Luo, Gan
    Yang, Zhile
    Guo, Yuanjun
    Li, Kang
    Xue, Yusheng
    CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, 2022, 7 (02) : 129 - 143
  • [22] Evaluation on Data - Speaker Dependability Approaches for Speech Recognition Tasks
    Saod, Aini Hafizah Mohd
    Sulaiman, Siti Noraini
    Harron, Nur Athiqah
    Ahmad, Azizah
    Ramlan, Siti Azura
    Ramli, Dzati Athiar
    2012 IEEE INTERNATIONAL CONFERENCE ON CONTROL SYSTEM, COMPUTING AND ENGINEERING (ICCSCE 2012), 2012, : 254 - 258
  • [23] A Review of Micro-expression Recognition based on Deep Learning
    Zhang, He
    Zhang, Hanling
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [24] From Geometry to Deep Learning: An Overview of Finger Knuckle Biometrics Recognition Approaches
    Sumalatha, U.
    Prakasha, K. Krishna
    Prabhu, Srikanth
    Nayak, Vinod C.
    IEEE ACCESS, 2024, 12 : 175414 - 175444
  • [25] Feature Extraction Methods for Speaker Recognition: A Review
    Chaudhary, Gopal
    Srivastava, Smriti
    Bhardwaj, Saurabh
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2017, 31 (12)
  • [26] Mixture Representation Learning for Deep Speaker Embedding
    Lin, Weiwei
    Mak, Man-Wai
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 968 - 978
  • [27] Backbones-review: Feature extractor networks for deep learning and deep reinforcement learning approaches in computer vision
    Elharrouss, Omar
    Akbari, Younes
    Almadeed, Noor
    Al-Maadeed, Somaya
    COMPUTER SCIENCE REVIEW, 2024, 53
  • [28] Speaker recognition with hybrid features from a deep belief network
    Ali, Hazrat
    Tran, Son N.
    Benetos, Emmanouil
    Garcez, Artur S. d'Avila
    NEURAL COMPUTING & APPLICATIONS, 2018, 29 (06) : 13 - 19
  • [29] Discriminative Deep Audio Feature Embedding for Speaker Recognition in the Wild
    Bianco, Simone
    Cereda, Elia
    Napoletano, Paolo
    2018 IEEE 8TH INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - BERLIN (ICCE-BERLIN), 2018,
  • [30] A review of automatic recognition technology for bird vocalizations in the deep learning era
    Xie, Jiangjian
    Zhong, Yujie
    Zhang, Junguo
    Liu, Shuo
    Ding, Changqing
    Triantafyllopoulos, Andreas
    ECOLOGICAL INFORMATICS, 2023, 73