Pseudo-color cochleagram image feature and sequential feature selection for robust acoustic event recognition

被引:15
|
作者
Sharan, Roneel V. [1 ]
Moir, Tom J. [2 ]
机构
[1] Univ Queensland, Sch Informat Technol & Elect Engn, Brisbane, Qld 4072, Australia
[2] Auckland Univ Technol, Sch Engn, Private Bag 92006, Auckland 1142, New Zealand
关键词
Acoustic event recognition; Cochleagram; Pseudo-color; Sequential backward feature selection; Support vector machines; Time-frequency image; CLASSIFICATION; NOISE;
D O I
10.1016/j.apacoust.2018.05.030
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This work proposes the use of pseudo-color cochleagram image of sound signals for feature extraction for robust acoustic event recognition. A cochleagram is a variation of the spectrogram. It utilizes a gammatone filter and has been shown to better reveal spectral information. We propose mapping of the grayscale cochleagram image to higher dimensional color space for improved characterization from environmental noise. The resulting time frequency representation is referred as pseudo-color cochleagram image and the resulting feature, which captures the statistical distribution, as pseudo-color cochleagram image feature (PC-CIF). In addition, sequential backward feature selection is applied for selecting the most useful feature dimensions, thereby reducing the feature dimension and improving the classification performance. We evaluate the effectiveness of the proposed methods using two classifiers, k-nearest neighbor and support vector machines. The performance is evaluated on a dataset containing 50 sound classes, taken from the Real World Computing Partnership Sound Scene Database in Real Acoustical Environments, with the addition of environmental noise at various signal-to-noise ratios. The experimental results show that the proposed techniques give significant improvement in classification performance over baseline methods. The most improved results were observed at low signal-to-noise ratios.
引用
收藏
页码:198 / 204
页数:7
相关论文
共 34 条
  • [1] Cochleagram Image Feature for Improved Robustness in Sound Recognition
    Sharan, Roneel V.
    Moir, Tom J.
    2015 IEEE INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), 2015, : 441 - 444
  • [2] Acoustic event recognition using cochleagram image and convolutional neural networks
    Sharan, Roneel V.
    Moir, Tom J.
    APPLIED ACOUSTICS, 2019, 148 : 62 - 66
  • [3] A lightweight network based on multi-feature pseudo-color mapping for arrhythmia recognition
    Ma, Yijun
    Li, Junyan
    Zhang, Jinbiao
    Wang, Jilin
    Sun, Guozhen
    Zhang, Yatao
    HEALTH INFORMATION SCIENCE AND SYSTEMS, 2024, 12 (01):
  • [4] Robust acoustic event recognition using AVMD-PWVD time-frequency image
    Zhang, Yanhua
    Zhang, Ke
    Wang, Jingyu
    Su, Yu
    APPLIED ACOUSTICS, 2021, 178
  • [5] Sequential Forward Feature Selection for Facial Expression Recognition
    Gacav, Caner
    Benligiray, Burak
    Topal, Cihan
    2016 24TH SIGNAL PROCESSING AND COMMUNICATION APPLICATION CONFERENCE (SIU), 2016, : 1481 - 1484
  • [6] Robust jointly sparse regression for image feature selection
    Mo, Dongmei
    Lai, Zhihui
    PROCEEDINGS 2017 4TH IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION (ACPR), 2017, : 477 - 482
  • [7] A joint framework of feature reduction and robust feature selection for cucumber leaf diseases recognition
    Kianat, Jaweria
    Khan, Muhammad Attique
    Sharif, Muhammad
    Akram, Tallha
    Rehman, Amjad
    Saba, Tanzila
    OPTIK, 2021, 240
  • [8] ROBUST MINIMUM STATISTICS PROJECT COEFFICIENTS FEATURE FOR ACOUSTIC ENVIRONMENT RECOGNITION
    Deng, Shiwen
    Han, Jiqing
    Zhang, Chaozhu
    Zheng, Tieran
    Zheng, Guibin
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [9] Unsupervised Temporal Feature Learning Based on Sparse Coding Embedded BoAW for Acoustic Event Recognition
    Zhang Liwen
    Han Jiqing
    Deng Shiwen
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 3284 - 3288
  • [10] Cascaded Acoustic Group and Individual Feature Selection for Recognition of Food Likability
    Pir, Dara
    ICPRAM: PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION APPLICATIONS AND METHODS, 2019, : 881 - 886