TEMPORAL CODING OF LOCAL SPECTROGRAM FEATURES FOR ROBUST SOUND RECOGNITION

被引:0
作者
Dennis, Jonathan [1 ]
Qiang, Yu [1 ]
Tang Huajin [1 ]
Tran Huy Dat [1 ]
Li Haizhou [1 ]
机构
[1] ASTAR, Inst Infocomm Res, Singapore 138632, Singapore
来源
2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2013年
关键词
Sound recognition; neural coding; local features; AUTOMATIC SPEECH RECOGNITION; NOISE;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
There is much evidence to suggest that the human auditory system uses localised time-frequency information for the robust recognition of sounds. Despite this, conventional systems typically rely on features extracted from short windowed frames over time,covering the whole frequency spectrum. Such approaches are not inherently robust to noise, as each frame will contain a mixture of the spectral information from noise and signal. Here, we propose a novel approach based on the temporal coding of Local Spectrogram Features (LSFs), which generate spikes that are used to traina Spiking Neural Network (SNN) with temporal learning. LSFs represent robust location information in the spectrogram surrounding keypoints,which are detected in a signal-driven manner such that the effect of noise on the temporal coding is reduced. Our experiments demonstrate the robust performance of our approach a cross a variety of noise conditions, such that it is able to out perform the conventional frame-based baseline methods
引用
收藏
页码:803 / 807
页数:5
相关论文
共 50 条
[41]   Feature selection for robust automatic speech recognition: a temporal offset approach [J].
Trottier, Ludovic ;
Giguere, Philippe ;
Chaib-draa, Brahim .
INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2015, 18 (03) :395-404
[42]   Temporal Envelope Subtraction for Robust Speech Recognition Using Modulation Spectrum [J].
Ganapathy, Sriram ;
Thomas, Samuel ;
Hermansky, Hynek .
2009 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION & UNDERSTANDING (ASRU 2009), 2009, :164-169
[43]   Combining feature space discriminative training with long-term spectro-temporal features for noise-robust speech recognition [J].
Fukuda, Takashi ;
Ichikawa, Osamu ;
Nishimura, Masafumi .
12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, :236-239
[44]   Spectro-temporal Power Spectrum Features for Noise Robust ASR [J].
Seresht, Hamed Riazati ;
Ahadi, Seyed Mohammad ;
Seyedin, Sanaz .
CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2017, 36 (08) :3222-3242
[45]   Spectro-temporal Power Spectrum Features for Noise Robust ASR [J].
Hamed Riazati Seresht ;
Seyed Mohammad Ahadi ;
Sanaz Seyedin .
Circuits, Systems, and Signal Processing, 2017, 36 :3222-3242
[46]   Facial Expression Recognition Based on Local Features of Transfer Learning [J].
Feng, Haiqiang ;
Shao, Jingfeng .
PROCEEDINGS OF 2020 IEEE 4TH INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC 2020), 2020, :71-76
[47]   LOCAL FEATURES AND SPARSE REPRESENTATION FOR FACE RECOGNITION WITH PARTIAL OCCLUSIONS [J].
Adamo, A. ;
Grossi, G. ;
Lanzarotti, R. .
2013 20TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2013), 2013, :3008-3012
[48]   Facial Expression Recognition from Global and a Combination of Local Features [J].
Praseeda, Lekshmi V. ;
Sasikumar, M. .
IETE TECHNICAL REVIEW, 2009, 26 (01) :41-46
[49]   Image Recognition Using Local Features Based NNSC Model [J].
Shang, Li ;
Zhou, Yan ;
Sun, Zhanli .
INTELLIGENT COMPUTING THEORIES AND APPLICATION, ICIC 2017, PT I, 2017, 10361 :190-199
[50]   Multiple Face Recognition Using Local Features and Swarm Intelligence [J].
Chidambaram, Chidambaram ;
Vieira Neto, Hugo ;
Dorini, Leyza Elmeri Baldo ;
Lopes, Heitor Silverio .
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2014, E97D (06) :1614-1623