Detecting Replay Attacks Using Multi-Channel Audio: A Neural Network-Based Method

被引：16

作者：

Gong, Yuan ^{[1
]}

Yang, Jian ^{[1
]}

Poellabauer, Christian ^{[1
]}

机构：

[1] Univ Notre Dame, Dept Comp Sci & Engn, Notre Dame, IN 46637 USA

来源：

IEEE SIGNAL PROCESSING LETTERS | 2020年 / 27卷

基金：

美国国家科学基金会;

关键词：

Microphone arrays; Task analysis; Feature extraction; Speech recognition; Array signal processing; Convolution; Microphone array signal processing; voice anti-spoofing; replay attack; beamforming; INSTANTANEOUS FREQUENCY; FEATURES;

D O I：

10.1109/LSP.2020.2996908

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

With the rapidly growing number of security-sensitive systems that use voice as the primary input, it becomes increasingly important to address these systems' potential vulnerability to replay attacks. Previous efforts to address this concern have focused primarily on single-channel audio. In this paper, we introduce a novel neural network-based replay attack detection model that further leverages spatial information of multi-channel audio and is able to significantly improve the replay attack detection performance.

引用

页码：920 / 924

页数：5

共 40 条

[21]

Kingma DP, 2014, ADV NEUR IN, V27

[22] The ASVspoof 2017 Challenge: Assessing the Limits of Replay Spoofing Attack Detection [J].

Kinnunen, Tomi ;

Sahidullah, Md ;

Delgado, Hector ;

Todisco, Massimiliano ;

Evans, Nicholas ;

Yamagishi, Junichi ;

Lee, Kong Aik .

18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, :2-6

[23]

Kinnunen T, 2017, INT CONF ACOUST SPEE, P5395, DOI 10.1109/ICASSP.2017.7953187

[24] Audio replay attack detection with deep learning frameworks [J].

Lavrentyeva, Galina ;

Novoselov, Sergey ;

Malykh, Egor ;

Kozlov, Alexander ;

Kudashev, Oleg ;

Shchemelinin, Vadim .

18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, :82-86

[25]

Loshchilov Ilya, 2017, CoRR

[26]

Nair V., 2010, P 27 INT C MACH LEAR, P807

[27] Energy Separation-based Instantaneous Frequency Estimation for Cochlear Cepstral Feature for Replay Spoof Detection [J].

Patil, Ankur T. ;

Acharya, Rajul ;

Sai, Pulikonda Aditya ;

Patil, Hemant A. .

INTERSPEECH 2019, 2019, :2898-2902

[28]

Ryoya Y, 2019, ASIAPAC SIGN INFO PR, P833, DOI [10.1109/apsipaasc47483.2019.9023181, 10.1109/APSIPAASC47483.2019.9023181]

[29] Multichannel Signal Processing With Deep Neural Networks for Automatic Speech Recognition [J].

Sainath, Tara N. ;

Weiss, Ron J. ;

Wilson, Kevin W. ;

Li, Bo ;

Narayanan, Arun ;

Variani, Ehsan ;

Bacchiani, Michiel ;

Shafran, Izhak ;

Senior, Andrew ;

Chin, Kean ;

Misra, Ananya ;

Kim, Chanwoo .

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (05) :965-979

[30]

Sainath TN, 2015, 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, P1

← 1 2 3 4 →