ASVtorch toolkit: Speaker verification with deep neural networks

被引:4
|
作者
Lee, Kong Aik [1 ]
Vestman, Ville [2 ]
Kinnunen, Tomi [2 ]
机构
[1] ASTAR, Inst Infocomm Res, Singapore, Singapore
[2] Univ Eastern Finland, Computat Speech Grp, Joensuu, Finland
基金
芬兰科学院;
关键词
Speaker recognition; PyTorch; Deep learning; RECOGNITION;
D O I
10.1016/j.softx.2021.100697
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The human voice differs substantially between individuals. This facilitates automatic speaker verification (ASV) - recognizing a person from his/her voice. ASV accuracy has substantially increased throughout the past decade due to recent advances in machine learning, particularly deep learning methods. An unfortunate downside has been substantially increased complexity of ASV systems. To help non experts to kick-start reproducible ASV development, a state-of-the-art toolkit implementing various ASV pipelines and functionalities is required. To this end, we introduce a new open-source toolkit, ASVtorch, implemented in Python using the widely used PyTorch machine learning framework. (C) 2021 The Author(s). Published by Elsevier B.V.
引用
收藏
页数:6
相关论文
共 50 条
  • [21] Voice-quality Features for Deep Neural Network Based Speaker Verification Systems
    Woubie, Abraham
    Koivisto, Lauri
    Backstrom, Tom
    29TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2021), 2021, : 176 - 180
  • [22] ASV-SUBTOOLS: OPEN SOURCE TOOLKIT FOR AUTOMATIC SPEAKER VERIFICATION
    Tong, Fuchuan
    Zhao, Miao
    Zhou, Jianfeng
    Lu, Hao
    Li, Zheng
    Li, Lin
    Hong, Qingyang
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6184 - 6188
  • [23] Automatic text-independent speaker verification using convolutional deep belief network
    Rakhmanenko, I. A.
    Shelupanov, A. A.
    Kostyuchenko, E. Y.
    COMPUTER OPTICS, 2020, 44 (04) : 596 - +
  • [24] Vietnamese Speaker Verification With Mel-Scale Filter Bank Energies and Deep Learning
    Nguyen, Thi-Thanh-Mai
    Nguyen, Duc-Dung
    Luong, Chi-Mai
    IEEE ACCESS, 2024, 12 : 150114 - 150122
  • [25] Audio Replay Attack Detection for Speaker Verification System Using Convolutional Neural Networks
    Kemanth, P. J.
    Supanekar, Sujata
    Koolagudi, Shashidhar G.
    PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PREMI 2019, PT II, 2019, 11942 : 445 - 453
  • [26] Offline Handwritten Signature Verification Using Deep Neural Networks
    Lopes, Jose A. P.
    Baptista, Bernardo
    Lavado, Nuno
    Mendes, Mateus
    ENERGIES, 2022, 15 (20)
  • [27] An MILP Encoding for Efficient Verification of Quantized Deep Neural Networks
    Mistry, Samvid
    Saha, Indranil
    Biswas, Swarnendu
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2022, 41 (11) : 4445 - 4456
  • [28] Speaker Verification based on extraction of Deep Features
    Mitsianis, Evangelos
    Spyrou, Evaggelos
    Giannakopoulos, Theodore
    10TH HELLENIC CONFERENCE ON ARTIFICIAL INTELLIGENCE (SETN 2018), 2018,
  • [29] DEEP SPEAKER REPRESENTATION USING ORTHOGONAL DECOMPOSITION AND RECOMBINATION FOR SPEAKER VERIFICATION
    Kim, Insoo
    Kim, Kyuhong
    Kim, Jiwhan
    Choi, Changkyu
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6126 - 6130
  • [30] Speaker2Vec: Unsupervised Learning and Adaptation of a Speaker Manifold using Deep Neural Networks with an Evaluation on Speaker Segmentation
    Jati, Arindam
    Georgiou, Panayiotis
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 3567 - 3571