TEMPORAL CODING OF LOCAL SPECTROGRAM FEATURES FOR ROBUST SOUND RECOGNITION

被引:0
作者
Dennis, Jonathan [1 ]
Qiang, Yu [1 ]
Tang Huajin [1 ]
Tran Huy Dat [1 ]
Li Haizhou [1 ]
机构
[1] ASTAR, Inst Infocomm Res, Singapore 138632, Singapore
来源
2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2013年
关键词
Sound recognition; neural coding; local features; AUTOMATIC SPEECH RECOGNITION; NOISE;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
There is much evidence to suggest that the human auditory system uses localised time-frequency information for the robust recognition of sounds. Despite this, conventional systems typically rely on features extracted from short windowed frames over time,covering the whole frequency spectrum. Such approaches are not inherently robust to noise, as each frame will contain a mixture of the spectral information from noise and signal. Here, we propose a novel approach based on the temporal coding of Local Spectrogram Features (LSFs), which generate spikes that are used to traina Spiking Neural Network (SNN) with temporal learning. LSFs represent robust location information in the spectrogram surrounding keypoints,which are detected in a signal-driven manner such that the effect of noise on the temporal coding is reduced. Our experiments demonstrate the robust performance of our approach a cross a variety of noise conditions, such that it is able to out perform the conventional frame-based baseline methods
引用
收藏
页码:803 / 807
页数:5
相关论文
共 50 条
  • [31] Increased reliance on temporal coding when target sound is softer than the background
    Alamatsaz, Nima
    Rosen, Merri J.
    Ihlefeld, Antje
    [J]. SCIENTIFIC REPORTS, 2024, 14 (01)
  • [32] Local Features Applied to Dermoscopy Images: Bag-of-Features versus Sparse Coding
    Barata, Catarina
    Figueiredo, Mario A. T.
    Emre Celebi, M.
    Marques, Jorge S.
    [J]. PATTERN RECOGNITION AND IMAGE ANALYSIS (IBPRIA 2017), 2017, 10255 : 528 - 536
  • [33] Multispectral Palmprint Recognition based on Fusion of Local Features
    Amraoui, Amine
    Fakhri, Youssef
    Kerroum, Mounir Ait
    [J]. PROCEEDINGS OF 2018 6TH INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS (ICMCS), 2018, : 115 - 120
  • [34] Unimodal Palmprint Recognition System Based on Local Features
    Amraoui, Amine
    Fakhri, Youssef
    Ait Kerroum, Mounir
    [J]. 2017 3RD INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR SIGNAL AND IMAGE PROCESSING (ATSIP), 2017, : 403 - 407
  • [35] Local and global features extracting and fusion for microbial recognition
    Li Xiaojuan
    Chen Cunshe
    Liang Anbo
    Shi Yan
    [J]. SNPD 2007: EIGHTH ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING, AND PARALLEL/DISTRIBUTED COMPUTING, VOL 2, PROCEEDINGS, 2007, : 507 - +
  • [36] Enhancing the magnitude spectrum of speech features for robust speech recognition
    Hung, Jeih-weih
    Fan, Hao-teng
    Tu, Wen-hsiang
    [J]. EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2012,
  • [37] ROBUST EXCITATION-BASED FEATURES FOR AUTOMATIC SPEECH RECOGNITION
    Drugman, Thomas
    Stylianou, Yannis
    Chen, Langzhou
    Chen, Xie
    Gales, Mark J. F.
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4664 - 4668
  • [38] Robust HI and dysarthric speaker recognition - perceptual features and models
    Revathi, A.
    Nagakrishnan, R.
    Sasikaladevi, N.
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (06) : 8215 - 8233
  • [39] Sound-Event Classification Using Robust Texture Features for Robot Hearing
    Ren, Jianfeng
    Jiang, Xudong
    Yuan, Junsong
    Magnenat-Thalmann, Nadia
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2017, 19 (03) : 447 - 458
  • [40] Deep Learning Based Dereverberation of Temporal Envelopes for Robust Speech Recognition
    Purushothaman, Anurenjan
    Sreeram, Anirudh
    Kumar, Rohit
    Ganapathy, Sriram
    [J]. INTERSPEECH 2020, 2020, : 1688 - 1692