TEMPORAL CODING OF LOCAL SPECTROGRAM FEATURES FOR ROBUST SOUND RECOGNITION

被引：0

作者：

Dennis, Jonathan ^{[1
]}

Qiang, Yu ^{[1
]}

Tang Huajin ^{[1
]}

Tran Huy Dat ^{[1
]}

Li Haizhou ^{[1
]}

机构：

[1] ASTAR, Inst Infocomm Res, Singapore 138632, Singapore

来源：

2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2013年

关键词：

Sound recognition; neural coding; local features; AUTOMATIC SPEECH RECOGNITION; NOISE;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

There is much evidence to suggest that the human auditory system uses localised time-frequency information for the robust recognition of sounds. Despite this, conventional systems typically rely on features extracted from short windowed frames over time,covering the whole frequency spectrum. Such approaches are not inherently robust to noise, as each frame will contain a mixture of the spectral information from noise and signal. Here, we propose a novel approach based on the temporal coding of Local Spectrogram Features (LSFs), which generate spikes that are used to traina Spiking Neural Network (SNN) with temporal learning. LSFs represent robust location information in the spectrogram surrounding keypoints,which are detected in a signal-driven manner such that the effect of noise on the temporal coding is reduced. Our experiments demonstrate the robust performance of our approach a cross a variety of noise conditions, such that it is able to out perform the conventional frame-based baseline methods

引用

页码：803 / 807

页数：5

共 50 条

[31] Increased reliance on temporal coding when target sound is softer than the background
Alamatsaz, Nima
Rosen, Merri J.
Ihlefeld, Antje
[J]. SCIENTIFIC REPORTS, 2024, 14 (01)
[32] Local Features Applied to Dermoscopy Images: Bag-of-Features versus Sparse Coding
Barata, Catarina
Figueiredo, Mario A. T.
Emre Celebi, M.
Marques, Jorge S.
[J]. PATTERN RECOGNITION AND IMAGE ANALYSIS (IBPRIA 2017), 2017, 10255 : 528 - 536
[33] Multispectral Palmprint Recognition based on Fusion of Local Features
Amraoui, Amine
Fakhri, Youssef
Kerroum, Mounir Ait
[J]. PROCEEDINGS OF 2018 6TH INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS (ICMCS), 2018, : 115 - 120
[34] Unimodal Palmprint Recognition System Based on Local Features
Amraoui, Amine
Fakhri, Youssef
Ait Kerroum, Mounir
[J]. 2017 3RD INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR SIGNAL AND IMAGE PROCESSING (ATSIP), 2017, : 403 - 407
[35] Local and global features extracting and fusion for microbial recognition
Li Xiaojuan
Chen Cunshe
Liang Anbo
Shi Yan
[J]. SNPD 2007: EIGHTH ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING, AND PARALLEL/DISTRIBUTED COMPUTING, VOL 2, PROCEEDINGS, 2007, : 507 - +
[36] Enhancing the magnitude spectrum of speech features for robust speech recognition
Hung, Jeih-weih
Fan, Hao-teng
Tu, Wen-hsiang
[J]. EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2012,
[37] ROBUST EXCITATION-BASED FEATURES FOR AUTOMATIC SPEECH RECOGNITION
Drugman, Thomas
Stylianou, Yannis
Chen, Langzhou
Chen, Xie
Gales, Mark J. F.
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4664 - 4668
[38] Robust HI and dysarthric speaker recognition - perceptual features and models
Revathi, A.
Nagakrishnan, R.
Sasikaladevi, N.
[J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (06) : 8215 - 8233
[39] Sound-Event Classification Using Robust Texture Features for Robot Hearing
Ren, Jianfeng
Jiang, Xudong
Yuan, Junsong
Magnenat-Thalmann, Nadia
[J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2017, 19 (03) : 447 - 458
[40] Deep Learning Based Dereverberation of Temporal Envelopes for Robust Speech Recognition
Purushothaman, Anurenjan
Sreeram, Anirudh
Kumar, Rohit
Ganapathy, Sriram
[J]. INTERSPEECH 2020, 2020, : 1688 - 1692

← 1 2 3 4 5 →