Voice spoofing countermeasure for voice replay attacks using deep learning

被引:0
|
作者
Jincheng Zhou
Tao Hai
Dayang N. A. Jawawi
Dan Wang
Ebuka Ibeke
Cresantus Biamba
机构
[1] Qiannan Normal University for Nationalities,School of Computer and Information
[2] Key Laboratory of Complex Systems and Intelligent Optimization of Guizhou Province,School of Mathematics and Statistics
[3] School of Computing,undefined
[4] Faculty of Engineering,undefined
[5] Universiti Teknologi Malaysia (UTM),undefined
[6] 81310 UTM Skudai,undefined
[7] Johor Bahru,undefined
[8] Qiannan Normal University for Nationalities,undefined
[9] School of Creative and Cultural Business,undefined
[10] Robert Gordon University,undefined
[11] Department of Educational Sciences,undefined
[12] University of Gavle,undefined
来源
关键词
Automatic Speaker Verification (ASV) spoofing voice biometrics deep learning neural network machine learning;
D O I
暂无
中图分类号
学科分类号
摘要
In our everyday lives, we communicate with each other using several means and channels of communication, as communication is crucial in the lives of humans. Listening and speaking are the primary forms of communication. For listening and speaking, the human voice is indispensable. Voice communication is the simplest type of communication. The Automatic Speaker Verification (ASV) system verifies users with their voices. These systems are susceptible to voice spoofing attacks - logical and physical access attacks. Recently, there has been a notable development in the detection of these attacks. Attackers use enhanced gadgets to record users’ voices, replay them for the ASV system, and be granted access for harmful purposes. In this work, we propose a secure voice spoofing countermeasure to detect voice replay attacks. We enhanced the ASV system security by building a spoofing countermeasure dependent on the decomposed signals that consist of prominent information. We used two main features— the Gammatone Cepstral Coefficients and Mel-Frequency Cepstral Coefficients— for the audio representation. For the classification of the features, we used Bi-directional Long-Short Term Memory Network in the cloud, a deep learning classifier. We investigated numerous audio features and examined each feature’s capability to obtain the most vital details from the audio for it to be labelled genuine or a spoof speech. Furthermore, we use various machine learning algorithms to illustrate the superiority of our system compared to the traditional classifiers. The results of the experiments were classified according to the parameters of accuracy, precision rate, recall, F1-score, and Equal Error Rate (EER). The results were 97%, 100%, 90.19% and 94.84%, and 2.95%, respectively.
引用
收藏
相关论文
共 50 条
  • [1] Voice spoofing countermeasure for voice replay attacks using deep learning
    Zhou, Jincheng
    Hai, Tao
    Jawawi, Dayang N. A.
    Wang, Dan
    Ibeke, Ebuka
    Biamba, Cresantus
    JOURNAL OF CLOUD COMPUTING-ADVANCES SYSTEMS AND APPLICATIONS, 2022, 11 (01):
  • [2] Voice Spoofing Countermeasure for Logical Access Attacks Detection
    Arif, Tuba
    Javed, Ali
    Alhameed, Mohammed
    Jeribi, Fathe
    Tahir, Ali
    IEEE ACCESS, 2021, 9 : 162857 - 162868
  • [3] Voice spoofing detection for multiclass attack classification using deep learning
    Boyd, Jason
    Fahim, Muhammad
    Olukoya, Oluwafemi
    MACHINE LEARNING WITH APPLICATIONS, 2023, 14
  • [4] Voice Feature Learning using Convolutional Neural Networks Designed to Avoid Replay Attacks
    Duraibi, Salahaldeen
    Alhamdani, Wasim
    Sheldon, Frederick T.
    2020 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2020, : 1845 - 1851
  • [5] Detection of Voice Conversion Spoofing Attacks Using Voiced Speech
    Pillai, Arun Sankar Muttathu Sivasankara
    De Leon, Phillip L.
    Roedig, Utz
    SECURE IT SYSTEMS, NORDSEC 2022, 2022, 13700 : 159 - 175
  • [6] Spoofing Attacks to I-vector Based Voice Verification Systems Using Statistical Speech Synthesis with Additive Noise and Countermeasure
    Ozbay, Mustafa Caner
    Khodabakhsh, Ali
    Mohammadi, Amir
    Demiroglu, Cenk
    2016 24TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2016, : 1207 - 1211
  • [7] An Empirical Study on Channel Effects for Synthetic Voice Spoofing Countermeasure Systems
    Zhang, You
    Zhu, Ge
    Jiang, Fei
    Duan, Zhiyao
    INTERSPEECH 2021, 2021, : 4309 - 4313
  • [8] Ensemble learning for countermeasure of audio replay spoofing attack in ASVspoof2017
    Ji, Zhe
    Li, Zhi-Yi
    Li, Peng
    An, Maobo
    Gao, Shengxiang
    Wu, Dan
    Zhao, Faru
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 87 - 91
  • [9] Spoofing Attacks on Speaker Verification Systems Based Generated Voice Using Genetic Algorithm
    Li, Qi
    Zhu, Hui
    Zhang, Ziling
    Lu, Rongxing
    Wang, Fengwei
    Li, Hui
    ICC 2019 - 2019 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2019,
  • [10] Voice Gender Recognition Using Deep Learning
    Buyukyilmaz, Mucahit
    Cibikdiken, Ali Osman
    PROCEEDINGS OF 2016 INTERNATIONAL CONFERENCE ON MODELING, SIMULATION AND OPTIMIZATION TECHNOLOGIES AND APPLICATIONS (MSOTA2016), 2016, 58 : 409 - 411