TEMPORAL CODING OF LOCAL SPECTROGRAM FEATURES FOR ROBUST SOUND RECOGNITION

被引:0
|
作者
Dennis, Jonathan [1 ]
Qiang, Yu [1 ]
Tang Huajin [1 ]
Tran Huy Dat [1 ]
Li Haizhou [1 ]
机构
[1] ASTAR, Inst Infocomm Res, Singapore 138632, Singapore
来源
2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2013年
关键词
Sound recognition; neural coding; local features; AUTOMATIC SPEECH RECOGNITION; NOISE;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
There is much evidence to suggest that the human auditory system uses localised time-frequency information for the robust recognition of sounds. Despite this, conventional systems typically rely on features extracted from short windowed frames over time,covering the whole frequency spectrum. Such approaches are not inherently robust to noise, as each frame will contain a mixture of the spectral information from noise and signal. Here, we propose a novel approach based on the temporal coding of Local Spectrogram Features (LSFs), which generate spikes that are used to traina Spiking Neural Network (SNN) with temporal learning. LSFs represent robust location information in the spectrogram surrounding keypoints,which are detected in a signal-driven manner such that the effect of noise on the temporal coding is reduced. Our experiments demonstrate the robust performance of our approach a cross a variety of noise conditions, such that it is able to out perform the conventional frame-based baseline methods
引用
收藏
页码:803 / 807
页数:5
相关论文
共 50 条
  • [1] Overlapping Sound Event Recognition using Local Spectrogram Features with the Generalised Hough Transform
    Dennis, Jonathan
    Huy Dat Tran
    Chng, Eng Siong
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 2263 - 2266
  • [2] Overlapping sound event recognition using local spectrogram features and the generalised hough transform
    Dennis, J.
    Tran, H. D.
    Chng, E. S.
    PATTERN RECOGNITION LETTERS, 2013, 34 (09) : 1085 - 1093
  • [3] Robust local features for remote face recognition
    Chen, Jie
    Patel, Vishal M.
    Liu, Li
    Kellokumpu, Vili
    Zhao, Guoying
    Pietikainen, Matti
    Chellappa, Rama
    IMAGE AND VISION COMPUTING, 2017, 64 : 34 - 46
  • [4] AMPLITUDE MODULATION SPECTROGRAM BASED FEATURES FOR ROBUST SPEECH RECOGNITION IN NOISY AND REVERBERANT ENVIRONMENTS
    Moritz, Niko
    Anemueller, Joern
    Kollmeier, Birger
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5492 - 5495
  • [5] Flow Pattern Recognition Using Spectrogram of Flow Generated Sound with New Adaptive LBP Features
    Parsai, Soroosh
    Ahmadi, Majid
    PROCEEDINGS OF SEVENTH INTERNATIONAL CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGY, ICICT 2022, VOL. 3, 2023, 464 : 401 - 413
  • [6] PaImprint Recognition Based on CNN and Local Coding Features
    Yang, Aoqi
    Zhang, Jianxin
    Sun, Qiule
    Zhang, Qiang
    PROCEEDINGS OF 2017 6TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2017), 2017, : 482 - 487
  • [7] Ear recognition via sparse coding of local features
    Al Rahhal, Mohamad Mahmoud
    Mekhalfi, Mohamed Lamine
    Ali, Taghreed Abdullah Mohammed
    Bazi, Yakoub
    Al Zuair, Mansour
    Rangarajan, Lalitha
    JOURNAL OF ELECTRONIC IMAGING, 2018, 27 (01)
  • [8] Sound recognition method for white feather broilers based on spectrogram features and the fusion classification model
    Lv, Meixuan
    Sun, Zhigang
    Zhang, Min
    Geng, Renxuan
    Gao, Mengmeng
    Wang, Guotao
    MEASUREMENT, 2023, 222
  • [9] Robust speech recognition using the modulation spectrogram
    Kingsbury, BED
    Morgan, N
    Greenberg, S
    SPEECH COMMUNICATION, 1998, 25 (1-3) : 117 - 132
  • [10] Animal Sound Recognition Based on Double Feature of Spectrogram
    LI Ying
    HUANG Hongkeng
    WU Zhibin
    Chinese Journal of Electronics, 2019, 28 (04) : 667 - 673