Detecting Replay Attacks Using Multi-Channel Audio: A Neural Network-Based Method

被引:16
作者
Gong, Yuan [1 ]
Yang, Jian [1 ]
Poellabauer, Christian [1 ]
机构
[1] Univ Notre Dame, Dept Comp Sci & Engn, Notre Dame, IN 46637 USA
基金
美国国家科学基金会;
关键词
Microphone arrays; Task analysis; Feature extraction; Speech recognition; Array signal processing; Convolution; Microphone array signal processing; voice anti-spoofing; replay attack; beamforming; INSTANTANEOUS FREQUENCY; FEATURES;
D O I
10.1109/LSP.2020.2996908
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
With the rapidly growing number of security-sensitive systems that use voice as the primary input, it becomes increasingly important to address these systems' potential vulnerability to replay attacks. Previous efforts to address this concern have focused primarily on single-channel audio. In this paper, we introduce a novel neural network-based replay attack detection model that further leverages spatial information of multi-channel audio and is able to significantly improve the replay attack detection performance.
引用
收藏
页码:920 / 924
页数:5
相关论文
共 40 条
[11]   ResNet and Model Fusion for Automatic Spoofing Detection [J].
Chen, Zhuxin ;
Xie, Zhifeng ;
Zhang, Weibin ;
Xu, Xiangmin .
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, :102-106
[12]  
Diao W, 2014, P 4 ACM WORKSH SEC P, P63, DOI DOI 10.1145/2666620.2666623
[13]  
Gong Y, 2018, INT C LEARN REPR, P1
[14]   ReMASC: Realistic Replay Attack Corpus for Voice Controlled Systems [J].
Gong, Yuan ;
Yang, Jian ;
Huber, Jacob ;
MacKnight, Mitchell ;
Poellabauer, Christian .
INTERSPEECH 2019, 2019, :2355-2359
[15]  
Gong Yuan, 2017, ARXIV171103280
[16]  
Hochreiter S., 1997, Neural Computation, V9, P1735
[17]   Exploration of Compressed ILPR Features for Replay Attack Detection [J].
Jelil, Sarfaraz ;
Kalita, Sishir ;
Prasanna, S. R. Mahadeva ;
Sinha, Rohit .
19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, :631-635
[18]   Spoof Detection Using Source, Instantaneous Frequency and Cepstral Features [J].
Jelil, Sarfaraz ;
Das, Rohan Kumar ;
Prasanna, S. R. M. ;
Sinha, Rohit .
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, :22-26
[19]   Advances in anti-spoofing: from the perspective of ASVspoof challenges [J].
Kamble, Madhu R. ;
Sailor, Hardik B. ;
Patil, Hemant A. ;
Li, Haizhou .
APSIPA TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING, 2020, 9
[20]   Effectiveness of Speech Demodulation-Based Features for Replay Detection [J].
Kamble, Madhu R. ;
Tak, Hemlata ;
Patil, Hemant A. .
19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, :641-645