Speaker-Aware Anti-spoofing

被引:2
作者
Liu, Xuechen [1 ,2 ]
Sahidullah, Md [3 ]
Lee, Kong Aik [4 ]
Kinnunen, Tomi [1 ]
机构
[1] Univ Eastern Finland, Sch Comp, Joensuu, Finland
[2] Univ Lorraine, CNRS, INRIA, LORIA, F-54000 Nancy, France
[3] TCG CREST, Inst Adv Intelligence, Kolkata, W Bengal, India
[4] ASTAR, Inst Infocomm Res, Singapore, Singapore
来源
INTERSPEECH 2023 | 2023年
关键词
Speaker Verification; Speaker-Aware Anti-Spoofing; ASVspoof; Deepfake; Spoofing Countermeasures; COUNTERMEASURES; VERIFICATION; ASVSPOOF;
D O I
10.21437/Interspeech.2023-1323
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We address speaker-aware anti-spoofing, where prior knowledge of the target speaker is incorporated into a voice spoofing countermeasure (CM). In contrast to the frequently used speaker-independent solutions, we train the CM in a speaker-conditioned way. As a proof of concept, we consider speakeraware extension to the state-of-the-art AASIST (audio anti-spoofing using integrated spectro-temporal graph attention networks) model. To this end, we consider two alternative strategies to incorporate target speaker information at the frame and utterance levels, respectively. The experimental results on a custom protocol based on ASVspoof 2019 dataset indicate the efficiency of the speaker information via enrollment: we obtain maximum relative improvements of 25.1% and 11.6% in equal error rate (EER) and minimum tandem detection cost function (t-DCF) over a speaker-independent baseline, respectively.
引用
收藏
页码:2498 / 2502
页数:5
相关论文
共 36 条
[1]  
analyticsinsight, 2022, TOP 5 DEEPF SCAMS ST
[2]  
[Anonymous], 2016, ISO/IEC 30107
[3]   EAT: ENHANCED ASR-TTS FOR SELF-SUPERVISED SPEECH RECOGNITION [J].
Baskar, Murali Karthick ;
Burget, Lukas ;
Watanabe, Shinji ;
Astudillo, Ramon Fernandez ;
Cernocky, Jan Honza .
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, :6753-6757
[4]   Multi-Channel Training for End-to-End Speaker Recognition under Reverberant and Noisy Environment [J].
Cai, Danwei ;
Qin, Xiaoyi ;
Li, Ming .
INTERSPEECH 2019, 2019, :4365-4369
[5]  
Castan D., 2022, P SPEAK LANG REC WOR, P62
[6]  
Chen T., 2020, Odyssey, P132
[7]  
Chung J., 2019, ISCA ARCH
[8]   ECAPA-TDNN: Emphasized Channel Attention, Propagation and Aggregation in TDNN Based Speaker Verification [J].
Desplanques, Brecht ;
Thienpondt, Jenthe ;
Demuynck, Kris .
INTERSPEECH 2020, 2020, :3830-3834
[9]  
Goodfellow IJ, 2014, ADV NEUR IN, V27, P2672
[10]  
Gupta Vishwa, 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), P6334, DOI 10.1109/ICASSP.2014.6854823