Fake Speech Recognition Using Deep Learning

被引：10

作者：

Camacho, Steven ^{[1
]}

Maria Ballesteros, Dora ^{[1
]}

Renza, Diego ^{[1
]}

机构：

[1] Univ Mil Nueva Granada, Bogota, Colombia

来源：

APPLIED COMPUTER SCIENCES IN ENGINEERING, WEA 2021 | 2021年 / 1431卷

关键词：

Classification task; Convolutional neural network; Deep learning; Deepfake; Speech recognition; Synthetic audio;

D O I：

10.1007/978-3-030-86702-7_4

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

The increase in the number of algorithms and commercial tools for creating synthetic audio has led to a high level of misinformation, especially on social media. As a consequence, efforts have been focused in recent years on detecting this type of content. However, this task is far from being successfully addressed, as the naturalness of fake audios is increasing. In this paper we present a model to classify audios between natural and fake, using an audio preparation stage that includes raw audio transformation, and a modelling stage by means of a custom Convolutional Neural Network (CNN) architecture. Our model is trained on data from the FoR dataset, which contains natural and synthetic audios obtained from several algorithms for deepfake content generation. The performance of the model is evaluated with different metrics such as F1 score, precision (P) and recall (R). According to the results, the audios are successfully classified in 88.9% of the cases.

引用

页码：38 / 48

页数：11

共 50 条

[1] Fake Banknote Recognition Using Deep Learning
Pachon, Cesar G.
Ballesteros, Dora M.
Renza, Diego
APPLIED SCIENCES-BASEL, 2021, 11 (03): : 1 - 20
[2] Speech Recognition using Deep Learning
Lakkhanawannakun, Phoemporn
Noyunsan, Chaluemwut
2019 34TH INTERNATIONAL TECHNICAL CONFERENCE ON CIRCUITS/SYSTEMS, COMPUTERS AND COMMUNICATIONS (ITC-CSCC 2019), 2019, : 514 - 517
[3] Korean speech recognition using deep learning
Lee, Suji
Han, Seokjin
Park, Sewon
Lee, Kyeongwon
Lee, Jaeyong
KOREAN JOURNAL OF APPLIED STATISTICS, 2019, 32 (02) : 213 - 227
[4] Speech Emotion Recognition Using Deep Learning
Alagusundari, N.
Anuradha, R.
ARTIFICIAL INTELLIGENCE: THEORY AND APPLICATIONS, VOL 1, AITA 2023, 2024, 843 : 313 - 325
[5] Persian speech recognition using deep learning
Veisi, Hadi
Haji Mani, Armita
INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2020, 23 (04) : 893 - 905
[6] Speech Command Recognition Using Deep Learning
Ayache, Mohammad
Kanaan, Hussien
Kassir, Kawthar
Kassir, Yasser
2021 SIXTH INTERNATIONAL CONFERENCE ON ADVANCES IN BIOMEDICAL ENGINEERING (ICABME), 2021, : 24 - 29
[7] Speech Emotion Recognition Using Deep Learning
Ahmed, Waqar
Riaz, Sana
Iftikhar, Khunsa
Konur, Savas
ARTIFICIAL INTELLIGENCE XL, AI 2023, 2023, 14381 : 191 - 197
[8] Persian speech recognition using deep learning
Hadi Veisi
Armita Haji Mani
International Journal of Speech Technology, 2020, 23 : 893 - 905
[9] Fighting AI with AI: Fake Speech Detection using Deep Learning
Malik, Hafiz
Changalvala, Raghavendar
2019 AES INTERNATIONAL CONFERENCE ON AUDIO FORENSICS, 2019,
[10] Recognition of English speech - using a deep learning algorithm
Wang, Shuyan
JOURNAL OF INTELLIGENT SYSTEMS, 2023, 32 (01)

← 1 2 3 4 5 →