共 32 条
[1]
Alumäe T, 2018, IEEE W SP LANG TECH, P1066, DOI 10.1109/SLT.2018.8639601
[3]
S4D: Speaker Diarization Toolkit in Python']Python
[J].
19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES,
2018,
:1368-1372
[4]
Brown A, 2022, Arxiv, DOI arXiv:2201.04583
[5]
Cai D., 2022, IEEE-ACM T AUDIO SPE
[6]
AN ITERATIVE FRAMEWORK FOR SELF-SUPERVISED DEEP SPEAKER REPRESENTATION LEARNING
[J].
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021),
2021,
:6728-6732
[7]
Chung J.S., 2018, arXiv
[8]
Out of Time: Automated Lip Sync in the Wild
[J].
COMPUTER VISION - ACCV 2016 WORKSHOPS, PT II,
2017, 10117
:251-263
[9]
Front-End Factor Analysis for Speaker Verification
[J].
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING,
2011, 19 (04)
:788-798
[10]
ArcFace: Additive Angular Margin Loss for Deep Face Recognition
[J].
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019),
2019,
:4685-4694