Deep convolutional neural networks for double compressed AMR audio detection

被引:3
|
作者
Buker, Aykut [1 ]
Hanilci, Cemal [1 ]
机构
[1] Bursa Tech Univ, Dept Elect & Elect Engn, Bursa, Turkey
关键词
STEGANALYSIS;
D O I
10.1049/sil2.12028
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Detection of double compressed (DC) adaptive multi-rate (AMR) audio recordings is a challenging audio forensic problem and has received great attention in recent years. Here, the authors propose to use convolutional neural networks (CNN) for DC AMR audio detection. The CNN is used as (i) an end-to-end DC AMR audio detection system and (ii) a feature extractor. The end-to-end system receives the audio spectrogram as the input and returns the decision whether the input audio is single compressed (SC) or DC. As a feature extractor in turn, it is used to extract discriminative features and then these features are modelled using support vector machines (SVM) classifier. Our extensive analysis conducted on four different datasets shows the success of the proposed system and provides new findings related to the problem. Firstly, double compression has a considerable impact on the high frequency components of the signal. Secondly, the proposed system yields great performance independent of the recording device or environment. Thirdly, when previously altered files are used in the experiments, 97.41% detection rate is obtained with the CNN system. Finally, the cross-dataset evaluation experiments show that the proposed system is very effective in case of a mismatch between training and test datasets.
引用
收藏
页码:265 / 280
页数:16
相关论文
共 50 条
  • [1] Double Compressed AMR Audio Detection Using Long-Term Features and Deep Neural Networks
    Buker, Aykut
    Hanilci, Cemal
    2019 11TH INTERNATIONAL CONFERENCE ON ELECTRICAL AND ELECTRONICS ENGINEERING (ELECO 2019), 2019, : 590 - 594
  • [2] Double Compressed Wideband AMR Speech Detection Using Deep Neural Networks
    Buker, Aykut
    Hanilci, Cemal
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2024, 43 (7) : 4528 - 4546
  • [3] DETECTING DOUBLE COMPRESSED AMR AUDIO USING DEEP LEARNING
    Luo, Da
    Yang, Rui
    Huang, Jiwu
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [4] Detection of Double Compressed AMR Audio Using Stacked Autoencoder
    Luo, Da
    Yang, Rui
    Li, Bin
    Huang, Jiwu
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2017, 12 (02) : 432 - 444
  • [5] Audio Recapture Detection With Convolutional Neural Networks
    Lin, Xiaodan
    Liu, Jingxian
    Kang, Xiangui
    IEEE TRANSACTIONS ON MULTIMEDIA, 2016, 18 (08) : 1480 - 1487
  • [6] Violent Scene Detection Using Convolutional Neural Networks and Deep Audio Features
    Mu, Guankun
    Cao, Haibing
    Jin, Qin
    PATTERN RECOGNITION (CCPR 2016), PT II, 2016, 663 : 451 - 463
  • [7] Angular Margin Softmax Loss and Its Variants for Double Compressed AMR Audio Detection
    Buker, Aykut
    Hanilci, Cemal
    PROCEEDINGS OF THE 2021 ACM WORKSHOP ON INFORMATION HIDING AND MULTIMEDIA SECURITY, IH&MMSEC 2021, 2021, : 45 - 50
  • [8] Deepfakes Audio Detection Leveraging Audio Spectrogram and Convolutional Neural Networks
    Wani, Taiba Majid
    Amerini, Irene
    IMAGE ANALYSIS AND PROCESSING, ICIAP 2023, PT II, 2023, 14234 : 156 - 167
  • [9] Convolutional Recurrent Neural Networks for Bird Audio Detection
    Cakir, Emre
    Adavanne, Sharath
    Parascandolo, Giambattista
    Drossos, Konstantinos
    Virtanen, Tuomas
    2017 25TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2017, : 1744 - 1748
  • [10] DIVERGENCE BASED WEIGHTING FOR INFORMATION CHANNELS IN DEEP CONVOLUTIONAL NEURAL NETWORKS FOR BIRD AUDIO DETECTION
    Zor, Cemre
    Awais, Muhammad
    Kittler, Josef
    Bober, Miroslaw
    Husain, Sameed
    Kong, Qiuqiang
    Kroos, Christian
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 3052 - 3056