ASVtorch toolkit: Speaker verification with deep neural networks

被引：4

作者：

Lee, Kong Aik ^{[1
]}

Vestman, Ville ^{[2
]}

Kinnunen, Tomi ^{[2
]}

机构：

[1] ASTAR, Inst Infocomm Res, Singapore, Singapore

[2] Univ Eastern Finland, Computat Speech Grp, Joensuu, Finland

来源：

SOFTWAREX | 2021年 / 14卷

基金：

芬兰科学院;

关键词：

Speaker recognition; PyTorch; Deep learning; RECOGNITION;

D O I：

10.1016/j.softx.2021.100697

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

The human voice differs substantially between individuals. This facilitates automatic speaker verification (ASV) - recognizing a person from his/her voice. ASV accuracy has substantially increased throughout the past decade due to recent advances in machine learning, particularly deep learning methods. An unfortunate downside has been substantially increased complexity of ASV systems. To help non experts to kick-start reproducible ASV development, a state-of-the-art toolkit implementing various ASV pipelines and functionalities is required. To this end, we introduce a new open-source toolkit, ASVtorch, implemented in Python using the widely used PyTorch machine learning framework. (C) 2021 The Author(s). Published by Elsevier B.V.

引用

页数：6

共 50 条

[31] PLDA inspired Siamese networks for speaker verification
Ramoji, Shreyas
Krishnan, Prashant
Ganapathy, Sriram
COMPUTER SPEECH AND LANGUAGE, 2022, 76
[32] Combining Deep Speaker Specific Representations with GMM-SVM for Speaker Verification
Price, Ryan
Biswas, Sangeeta
Shinoda, Koichi
14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 2787 - 2791
[33] Automatic Chinese Handwriting Verification Algorithm Using Deep Neural Networks
Lee, Chi-Chang
Ding, Jian-Jiun
2019 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ISPACS), 2019,
[34] Testing and Verification of the Deep Neural Networks Against Sparse Pixel Defects
Szczepankiewicz, Michal
Radlak, Krystian
Szczepankiewicz, Karolina
Popowicz, Adam
Zawistowski, Pawel
COMPUTER SAFETY, RELIABILITY, AND SECURITY, SAFECOMP 2022 WORKSHOPS, 2022, 13415 : 71 - 82
[35] SPEAKER INDEPENDENT DIARIZATION FOR CHILD LANGUAGE ENVIRONMENT ANALYSIS USING DEEP NEURAL NETWORKS
Najafian, Maryam
Hansen, John H. L.
2016 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2016), 2016, : 114 - 120
[36] Duration mismatch compensation using four-covariance model and deep neural network for speaker verification
Bousquet, Pierre-Michel
Rouvier, Mickael
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1547 - 1551
[37] IMPROVING SPEAKER RECOGNITION PERFORMANCE IN THE DOMAIN ADAPTATION CHALLENGE USING DEEP NEURAL NETWORKS
Garcia-Romero, Daniel
Zhang, Xiaohui
McCree, Alan
Povey, Daniel
2014 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY SLT 2014, 2014, : 378 - 383
[38] Deep Vein: Novel Finger Vein Verification Methods Based on Deep Convolutional Neural Networks
Huang, Houjun
Liu, Shilei
Zheng, He
Ni, Liao
Zhang, Yi
Li, Wenxin
2017 IEEE INTERNATIONAL CONFERENCE ON IDENTITY, SECURITY AND BEHAVIOR ANALYSIS (ISBA), 2017,
[39] Text-Independent Speaker Verification Using Lightweight 3D Convolutional Neural Networks
Chen, Jyun-Yan
Jeng, Jin-Tsong
2024 INTERNATIONAL CONFERENCE ON SYSTEM SCIENCE AND ENGINEERING, ICSSE 2024, 2024,
[40] Spline Interpolation and Deep Neural Networks as Feature Extractors for Signature Verification Purposes
Wei, Wei
Ke, Qiao
Polap, Dawid
Wozniak, Marcin
IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (03) : 2152 - 2161

← 1 2 3 4 5 →