Fake Speech Recognition Using Deep Learning

被引:10
|
作者
Camacho, Steven [1 ]
Maria Ballesteros, Dora [1 ]
Renza, Diego [1 ]
机构
[1] Univ Mil Nueva Granada, Bogota, Colombia
来源
APPLIED COMPUTER SCIENCES IN ENGINEERING, WEA 2021 | 2021年 / 1431卷
关键词
Classification task; Convolutional neural network; Deep learning; Deepfake; Speech recognition; Synthetic audio;
D O I
10.1007/978-3-030-86702-7_4
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The increase in the number of algorithms and commercial tools for creating synthetic audio has led to a high level of misinformation, especially on social media. As a consequence, efforts have been focused in recent years on detecting this type of content. However, this task is far from being successfully addressed, as the naturalness of fake audios is increasing. In this paper we present a model to classify audios between natural and fake, using an audio preparation stage that includes raw audio transformation, and a modelling stage by means of a custom Convolutional Neural Network (CNN) architecture. Our model is trained on data from the FoR dataset, which contains natural and synthetic audios obtained from several algorithms for deepfake content generation. The performance of the model is evaluated with different metrics such as F1 score, precision (P) and recall (R). According to the results, the audios are successfully classified in 88.9% of the cases.
引用
收藏
页码:38 / 48
页数:11
相关论文
共 50 条
  • [1] Fake Banknote Recognition Using Deep Learning
    Pachon, Cesar G.
    Ballesteros, Dora M.
    Renza, Diego
    APPLIED SCIENCES-BASEL, 2021, 11 (03): : 1 - 20
  • [2] Speech Recognition using Deep Learning
    Lakkhanawannakun, Phoemporn
    Noyunsan, Chaluemwut
    2019 34TH INTERNATIONAL TECHNICAL CONFERENCE ON CIRCUITS/SYSTEMS, COMPUTERS AND COMMUNICATIONS (ITC-CSCC 2019), 2019, : 514 - 517
  • [3] Korean speech recognition using deep learning
    Lee, Suji
    Han, Seokjin
    Park, Sewon
    Lee, Kyeongwon
    Lee, Jaeyong
    KOREAN JOURNAL OF APPLIED STATISTICS, 2019, 32 (02) : 213 - 227
  • [4] Speech Emotion Recognition Using Deep Learning
    Alagusundari, N.
    Anuradha, R.
    ARTIFICIAL INTELLIGENCE: THEORY AND APPLICATIONS, VOL 1, AITA 2023, 2024, 843 : 313 - 325
  • [5] Persian speech recognition using deep learning
    Veisi, Hadi
    Haji Mani, Armita
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2020, 23 (04) : 893 - 905
  • [6] Speech Command Recognition Using Deep Learning
    Ayache, Mohammad
    Kanaan, Hussien
    Kassir, Kawthar
    Kassir, Yasser
    2021 SIXTH INTERNATIONAL CONFERENCE ON ADVANCES IN BIOMEDICAL ENGINEERING (ICABME), 2021, : 24 - 29
  • [7] Speech Emotion Recognition Using Deep Learning
    Ahmed, Waqar
    Riaz, Sana
    Iftikhar, Khunsa
    Konur, Savas
    ARTIFICIAL INTELLIGENCE XL, AI 2023, 2023, 14381 : 191 - 197
  • [8] Persian speech recognition using deep learning
    Hadi Veisi
    Armita Haji Mani
    International Journal of Speech Technology, 2020, 23 : 893 - 905
  • [9] Fighting AI with AI: Fake Speech Detection using Deep Learning
    Malik, Hafiz
    Changalvala, Raghavendar
    2019 AES INTERNATIONAL CONFERENCE ON AUDIO FORENSICS, 2019,
  • [10] Recognition of English speech - using a deep learning algorithm
    Wang, Shuyan
    JOURNAL OF INTELLIGENT SYSTEMS, 2023, 32 (01)