Pseudo-color cochleagram image feature and sequential feature selection for robust acoustic event recognition

被引:15
|
作者
Sharan, Roneel V. [1 ]
Moir, Tom J. [2 ]
机构
[1] Univ Queensland, Sch Informat Technol & Elect Engn, Brisbane, Qld 4072, Australia
[2] Auckland Univ Technol, Sch Engn, Private Bag 92006, Auckland 1142, New Zealand
关键词
Acoustic event recognition; Cochleagram; Pseudo-color; Sequential backward feature selection; Support vector machines; Time-frequency image; CLASSIFICATION; NOISE;
D O I
10.1016/j.apacoust.2018.05.030
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This work proposes the use of pseudo-color cochleagram image of sound signals for feature extraction for robust acoustic event recognition. A cochleagram is a variation of the spectrogram. It utilizes a gammatone filter and has been shown to better reveal spectral information. We propose mapping of the grayscale cochleagram image to higher dimensional color space for improved characterization from environmental noise. The resulting time frequency representation is referred as pseudo-color cochleagram image and the resulting feature, which captures the statistical distribution, as pseudo-color cochleagram image feature (PC-CIF). In addition, sequential backward feature selection is applied for selecting the most useful feature dimensions, thereby reducing the feature dimension and improving the classification performance. We evaluate the effectiveness of the proposed methods using two classifiers, k-nearest neighbor and support vector machines. The performance is evaluated on a dataset containing 50 sound classes, taken from the Real World Computing Partnership Sound Scene Database in Real Acoustical Environments, with the addition of environmental noise at various signal-to-noise ratios. The experimental results show that the proposed techniques give significant improvement in classification performance over baseline methods. The most improved results were observed at low signal-to-noise ratios.
引用
收藏
页码:198 / 204
页数:7
相关论文
共 34 条
  • [31] RETRACTED: A Fused Heterogeneous Deep Neural Network and Robust Feature Selection Framework for Human Actions Recognition (Retracted Article)
    Khan, Muhammad Attique
    Zhang, Yu-Dong
    Alhusseni, Majed
    Kadry, Seifedine
    Wang, Shui-Hua
    Saba, Tanzila
    Iqbal, Tassawar
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2023, 48 (02) : 2609 - 2609
  • [32] A Fast One-Pass-Training Feature Selection Technique for GMM-based Acoustic Event Detection with Audio-Visual Data
    Butko, Taras
    Nadeu, Climent
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2338 - 2341
  • [33] A robust swarm intelligence-based feature selection model for neuro-fuzzy recognition of mild cognitive impairment from resting-state fMRI
    Anter, Ahmed M.
    Wei, Yichen
    Su, Jiahui
    Yuan, Yueming
    Lei, Beiying
    Duan, Gaoxiong
    Mai, Wei
    Nong, Xiucheng
    Yu, Bihan
    Li, Chong
    Fu, Zening
    Zhao, Lihua
    Deng, Demao
    Zhang, Zhiguo
    INFORMATION SCIENCES, 2019, 503 : 670 - 687
  • [34] Robust and Sparse Kernel-Free Quadratic Surface LSR via L2,p-Norm With Feature Selection for Multi-Class Image Classification
    Zhu, Yongqi
    Yang, Zhixia
    Ye, Junyou
    Hu, Yongxing
    IEEE ACCESS, 2025, 13 : 16362 - 16379