Rethinking environmental sound classification using convolutional neural networks: optimized parameter tuning of single feature extraction

被引:0
|
作者
Yousef Abd Al-Hattab
Hasan Firdaus Zaki
Amir Akramin Shafie
机构
[1] International Islamic University Malaysia,Department of Mechatronics Engineering
来源
关键词
Convolutional neural networks (CNN); Mel-frequency cepstral coefficients (MFCC); Environmental sound classification; Feature extraction; Urbansound8Kdataset;
D O I
暂无
中图分类号
学科分类号
摘要
The classification of environmental sounds is important for emerging applications such as automatic audio surveillance, audio forensics, and robot navigation. Existing techniques combined multiple features and stacked many CNN layers (very deep learning) to reach the desired accuracy. Instead of using many features and going deeper by stacking layers that are resource extensive, this paper proposes a novel technique that uses only a single feature, namely the Mel-Frequency Cepstral Coefficient (MFCC) and just three layers of CNN. We demonstrate that such a simple network can considerably outperform several conventional and deep learning-based algorithms. Through parameters fine-tuning of the data input, we reported a model that is significantly less complex in the architecture yet has recorded a similar accuracy of 95.59% compared to state-of-the-art deep models on UrbanSound8k dataset.
引用
收藏
页码:14495 / 14506
页数:11
相关论文
共 50 条
  • [21] Feature extraction and classification of learners using neural networks
    Hayashida, Tomohiro
    Yamamoto, Toru
    Wakitani, Shin
    Nishizaki, Ichiro
    Sekizaki, Shinya
    Tanimoto, Yusuke
    2018 IEEE FRONTIERS IN EDUCATION CONFERENCE (FIE), 2018,
  • [22] Feature extraction and classification of VHR images with attribute profiles and convolutional neural networks
    Tian Tian
    Lang Gao
    Weijing Song
    Kim-Kwang Raymond Choo
    Jijun He
    Multimedia Tools and Applications, 2018, 77 : 18637 - 18656
  • [23] Smart feature extraction and classification of hyperspectral images based on convolutional neural networks
    Hamouda, Maissa
    Ettabaa, Karim Saheb
    Bouhlel, Med Salim
    IET IMAGE PROCESSING, 2020, 14 (10) : 1999 - 2005
  • [24] Deep Feature Extraction and Classification of Hyperspectral Images Based on Convolutional Neural Networks
    Chen, Yushi
    Jiang, Hanlu
    Li, Chunyang
    Jia, Xiuping
    Ghamisi, Pedram
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2016, 54 (10): : 6232 - 6251
  • [25] Feature extraction and classification of VHR images with attribute profiles and convolutional neural networks
    Tian, Tian
    Gao, Lang
    Song, Weijing
    Choo, Kim-Kwang Raymond
    He, Jijun
    MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (14) : 18637 - 18656
  • [26] Classification of Guitar Effects and Extraction of Their Parameter Settings from Instrument Mixes Using Convolutional Neural Networks
    Hinrichs, Reemt
    Gerkens, Kevin
    Ostermann, Joern
    ARTIFICIAL INTELLIGENCE IN MUSIC, SOUND, ART AND DESIGN (EVOMUSART 2022), 2022, : 101 - 116
  • [27] AGRICULTURAL HARVESTER SOUND CLASSIFICATION USING CONVOLUTIONAL NEURAL NETWORKS AND SPECTROGRAMS
    Khorasani, Nioosha E.
    Thomas, Gabriel
    Balocco, Simone
    Mann, Danny
    APPLIED ENGINEERING IN AGRICULTURE, 2022, 38 (02) : 455 - 459
  • [28] Lung Sound Classification Using Snapshot Ensemble of Convolutional Neural Networks
    Truc Nguyen
    Pernkopf, Franz
    42ND ANNUAL INTERNATIONAL CONFERENCES OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY: ENABLING INNOVATIVE TECHNOLOGIES FOR GLOBAL HEALTHCARE EMBC'20, 2020, : 760 - 763
  • [29] Tool Classification in Laparoscopic Images Using Feature Fusion Convolutional Neural Networks: A Single Label Classification Approach
    ElMoaqet, H.
    Qaddoura, H.
    AlMasri, T.
    Alshirbaji, T. Abdulbaki
    Jalal, N. A.
    Moeller, K.
    IFAC PAPERSONLINE, 2024, 58 (24): : 391 - 396
  • [30] ECG Feature Extraction and Classification Using Cepstrum and Neural Networks
    Jen, Kuo-Kuang
    Hwang, Yean-Ren
    JOURNAL OF MEDICAL AND BIOLOGICAL ENGINEERING, 2008, 28 (01) : 31 - 37