Deep convolutional neural networks for double compressed AMR audio detection

被引：3

作者：

Buker, Aykut ^{[1
]}

Hanilci, Cemal ^{[1
]}

机构：

[1] Bursa Tech Univ, Dept Elect & Elect Engn, Bursa, Turkey

来源：

IET SIGNAL PROCESSING | 2021年 / 15卷 / 04期

关键词：

STEGANALYSIS;

D O I：

10.1049/sil2.12028

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Detection of double compressed (DC) adaptive multi-rate (AMR) audio recordings is a challenging audio forensic problem and has received great attention in recent years. Here, the authors propose to use convolutional neural networks (CNN) for DC AMR audio detection. The CNN is used as (i) an end-to-end DC AMR audio detection system and (ii) a feature extractor. The end-to-end system receives the audio spectrogram as the input and returns the decision whether the input audio is single compressed (SC) or DC. As a feature extractor in turn, it is used to extract discriminative features and then these features are modelled using support vector machines (SVM) classifier. Our extensive analysis conducted on four different datasets shows the success of the proposed system and provides new findings related to the problem. Firstly, double compression has a considerable impact on the high frequency components of the signal. Secondly, the proposed system yields great performance independent of the recording device or environment. Thirdly, when previously altered files are used in the experiments, 97.41% detection rate is obtained with the CNN system. Finally, the cross-dataset evaluation experiments show that the proposed system is very effective in case of a mismatch between training and test datasets.

引用

页码：265 / 280

页数：16

共 50 条

[1] Double Compressed AMR Audio Detection Using Long-Term Features and Deep Neural Networks
Buker, Aykut
Hanilci, Cemal
2019 11TH INTERNATIONAL CONFERENCE ON ELECTRICAL AND ELECTRONICS ENGINEERING (ELECO 2019), 2019, : 590 - 594
[2] Double Compressed Wideband AMR Speech Detection Using Deep Neural Networks
Buker, Aykut
Hanilci, Cemal
CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2024, 43 (7) : 4528 - 4546
[3] DETECTING DOUBLE COMPRESSED AMR AUDIO USING DEEP LEARNING
Luo, Da
Yang, Rui
Huang, Jiwu
2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
[4] Detection of Double Compressed AMR Audio Using Stacked Autoencoder
Luo, Da
Yang, Rui
Li, Bin
Huang, Jiwu
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2017, 12 (02) : 432 - 444
[5] Audio Recapture Detection With Convolutional Neural Networks
Lin, Xiaodan
Liu, Jingxian
Kang, Xiangui
IEEE TRANSACTIONS ON MULTIMEDIA, 2016, 18 (08) : 1480 - 1487
[6] Violent Scene Detection Using Convolutional Neural Networks and Deep Audio Features
Mu, Guankun
Cao, Haibing
Jin, Qin
PATTERN RECOGNITION (CCPR 2016), PT II, 2016, 663 : 451 - 463
[7] Angular Margin Softmax Loss and Its Variants for Double Compressed AMR Audio Detection
Buker, Aykut
Hanilci, Cemal
PROCEEDINGS OF THE 2021 ACM WORKSHOP ON INFORMATION HIDING AND MULTIMEDIA SECURITY, IH&MMSEC 2021, 2021, : 45 - 50
[8] Deepfakes Audio Detection Leveraging Audio Spectrogram and Convolutional Neural Networks
Wani, Taiba Majid
Amerini, Irene
IMAGE ANALYSIS AND PROCESSING, ICIAP 2023, PT II, 2023, 14234 : 156 - 167
[9] Convolutional Recurrent Neural Networks for Bird Audio Detection
Cakir, Emre
Adavanne, Sharath
Parascandolo, Giambattista
Drossos, Konstantinos
Virtanen, Tuomas
2017 25TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2017, : 1744 - 1748
[10] DIVERGENCE BASED WEIGHTING FOR INFORMATION CHANNELS IN DEEP CONVOLUTIONAL NEURAL NETWORKS FOR BIRD AUDIO DETECTION
Zor, Cemre
Awais, Muhammad
Kittler, Josef
Bober, Miroslaw
Husain, Sameed
Kong, Qiuqiang
Kroos, Christian
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 3052 - 3056

← 1 2 3 4 5 →