TEMPORAL CODING OF LOCAL SPECTROGRAM FEATURES FOR ROBUST SOUND RECOGNITION

被引：0

作者：

Dennis, Jonathan ^{[1
]}

Qiang, Yu ^{[1
]}

Tang Huajin ^{[1
]}

Tran Huy Dat ^{[1
]}

Li Haizhou ^{[1
]}

机构：

[1] ASTAR, Inst Infocomm Res, Singapore 138632, Singapore

来源：

2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2013年

关键词：

Sound recognition; neural coding; local features; AUTOMATIC SPEECH RECOGNITION; NOISE;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

There is much evidence to suggest that the human auditory system uses localised time-frequency information for the robust recognition of sounds. Despite this, conventional systems typically rely on features extracted from short windowed frames over time,covering the whole frequency spectrum. Such approaches are not inherently robust to noise, as each frame will contain a mixture of the spectral information from noise and signal. Here, we propose a novel approach based on the temporal coding of Local Spectrogram Features (LSFs), which generate spikes that are used to traina Spiking Neural Network (SNN) with temporal learning. LSFs represent robust location information in the spectrogram surrounding keypoints,which are detected in a signal-driven manner such that the effect of noise on the temporal coding is reduced. Our experiments demonstrate the robust performance of our approach a cross a variety of noise conditions, such that it is able to out perform the conventional frame-based baseline methods

引用

页码：803 / 807

页数：5

共 50 条

[1] Overlapping Sound Event Recognition using Local Spectrogram Features with the Generalised Hough Transform
Dennis, Jonathan
Huy Dat Tran
Chng, Eng Siong
13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 2263 - 2266
[2] Overlapping sound event recognition using local spectrogram features and the generalised hough transform
Dennis, J.
Tran, H. D.
Chng, E. S.
PATTERN RECOGNITION LETTERS, 2013, 34 (09) : 1085 - 1093
[3] Robust local features for remote face recognition
Chen, Jie
Patel, Vishal M.
Liu, Li
Kellokumpu, Vili
Zhao, Guoying
Pietikainen, Matti
Chellappa, Rama
IMAGE AND VISION COMPUTING, 2017, 64 : 34 - 46
[4] AMPLITUDE MODULATION SPECTROGRAM BASED FEATURES FOR ROBUST SPEECH RECOGNITION IN NOISY AND REVERBERANT ENVIRONMENTS
Moritz, Niko
Anemueller, Joern
Kollmeier, Birger
2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5492 - 5495
[5] Flow Pattern Recognition Using Spectrogram of Flow Generated Sound with New Adaptive LBP Features
Parsai, Soroosh
Ahmadi, Majid
PROCEEDINGS OF SEVENTH INTERNATIONAL CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGY, ICICT 2022, VOL. 3, 2023, 464 : 401 - 413
[6] PaImprint Recognition Based on CNN and Local Coding Features
Yang, Aoqi
Zhang, Jianxin
Sun, Qiule
Zhang, Qiang
PROCEEDINGS OF 2017 6TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2017), 2017, : 482 - 487
[7] Ear recognition via sparse coding of local features
Al Rahhal, Mohamad Mahmoud
Mekhalfi, Mohamed Lamine
Ali, Taghreed Abdullah Mohammed
Bazi, Yakoub
Al Zuair, Mansour
Rangarajan, Lalitha
JOURNAL OF ELECTRONIC IMAGING, 2018, 27 (01)
[8] Sound recognition method for white feather broilers based on spectrogram features and the fusion classification model
Lv, Meixuan
Sun, Zhigang
Zhang, Min
Geng, Renxuan
Gao, Mengmeng
Wang, Guotao
MEASUREMENT, 2023, 222
[9] Robust speech recognition using the modulation spectrogram
Kingsbury, BED
Morgan, N
Greenberg, S
SPEECH COMMUNICATION, 1998, 25 (1-3) : 117 - 132
[10] Animal Sound Recognition Based on Double Feature of Spectrogram
LI Ying
HUANG Hongkeng
WU Zhibin
Chinese Journal of Electronics, 2019, 28 (04) : 667 - 673

← 1 2 3 4 5 →