ASVtorch toolkit: Speaker verification with deep neural networks

被引:4
|
作者
Lee, Kong Aik [1 ]
Vestman, Ville [2 ]
Kinnunen, Tomi [2 ]
机构
[1] ASTAR, Inst Infocomm Res, Singapore, Singapore
[2] Univ Eastern Finland, Computat Speech Grp, Joensuu, Finland
基金
芬兰科学院;
关键词
Speaker recognition; PyTorch; Deep learning; RECOGNITION;
D O I
10.1016/j.softx.2021.100697
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The human voice differs substantially between individuals. This facilitates automatic speaker verification (ASV) - recognizing a person from his/her voice. ASV accuracy has substantially increased throughout the past decade due to recent advances in machine learning, particularly deep learning methods. An unfortunate downside has been substantially increased complexity of ASV systems. To help non experts to kick-start reproducible ASV development, a state-of-the-art toolkit implementing various ASV pipelines and functionalities is required. To this end, we introduce a new open-source toolkit, ASVtorch, implemented in Python using the widely used PyTorch machine learning framework. (C) 2021 The Author(s). Published by Elsevier B.V.
引用
收藏
页数:6
相关论文
共 50 条
  • [31] PLDA inspired Siamese networks for speaker verification
    Ramoji, Shreyas
    Krishnan, Prashant
    Ganapathy, Sriram
    COMPUTER SPEECH AND LANGUAGE, 2022, 76
  • [32] Combining Deep Speaker Specific Representations with GMM-SVM for Speaker Verification
    Price, Ryan
    Biswas, Sangeeta
    Shinoda, Koichi
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 2787 - 2791
  • [33] Automatic Chinese Handwriting Verification Algorithm Using Deep Neural Networks
    Lee, Chi-Chang
    Ding, Jian-Jiun
    2019 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ISPACS), 2019,
  • [34] Testing and Verification of the Deep Neural Networks Against Sparse Pixel Defects
    Szczepankiewicz, Michal
    Radlak, Krystian
    Szczepankiewicz, Karolina
    Popowicz, Adam
    Zawistowski, Pawel
    COMPUTER SAFETY, RELIABILITY, AND SECURITY, SAFECOMP 2022 WORKSHOPS, 2022, 13415 : 71 - 82
  • [35] SPEAKER INDEPENDENT DIARIZATION FOR CHILD LANGUAGE ENVIRONMENT ANALYSIS USING DEEP NEURAL NETWORKS
    Najafian, Maryam
    Hansen, John H. L.
    2016 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2016), 2016, : 114 - 120
  • [36] Duration mismatch compensation using four-covariance model and deep neural network for speaker verification
    Bousquet, Pierre-Michel
    Rouvier, Mickael
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1547 - 1551
  • [37] IMPROVING SPEAKER RECOGNITION PERFORMANCE IN THE DOMAIN ADAPTATION CHALLENGE USING DEEP NEURAL NETWORKS
    Garcia-Romero, Daniel
    Zhang, Xiaohui
    McCree, Alan
    Povey, Daniel
    2014 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY SLT 2014, 2014, : 378 - 383
  • [38] Deep Vein: Novel Finger Vein Verification Methods Based on Deep Convolutional Neural Networks
    Huang, Houjun
    Liu, Shilei
    Zheng, He
    Ni, Liao
    Zhang, Yi
    Li, Wenxin
    2017 IEEE INTERNATIONAL CONFERENCE ON IDENTITY, SECURITY AND BEHAVIOR ANALYSIS (ISBA), 2017,
  • [39] Text-Independent Speaker Verification Using Lightweight 3D Convolutional Neural Networks
    Chen, Jyun-Yan
    Jeng, Jin-Tsong
    2024 INTERNATIONAL CONFERENCE ON SYSTEM SCIENCE AND ENGINEERING, ICSSE 2024, 2024,
  • [40] Spline Interpolation and Deep Neural Networks as Feature Extractors for Signature Verification Purposes
    Wei, Wei
    Ke, Qiao
    Polap, Dawid
    Wozniak, Marcin
    IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (03) : 2152 - 2161