Selective Gammatone Filterbank Feature for Robust Sound Event Recognition

被引：0

作者：

Leng, Yi Ren ^{[1
]}

Huy Dat Tran ^{[1
]}

Kitaoka, Norihide ^{[2
]}

Li, Haizhou ^{[1
]}

机构：

[1] ASTAR, Inst Infocomm Res, Human Language Technol Dept, Singapore 138632, Singapore

[2] Nagoya Univ, Nagoya, Aichi 4648601, Japan

来源：

11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4 | 2010年

关键词：

gammatone filterbank; Hidden Markov Model; robust recognition; sound event recognition;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

This paper introduces a novel feature based on the raw output of the garnmatone filterbank. Channel selection is used to enhance robustness over a range of signal-to-noise ratios (SNR) of additive noise. The recognition accuracy of the proposed feature is tested on a sound event database using a Hidden Markov Model (HMM) recogniser. A comparison with a series of similar features and the conventional Mel-Frequency Cepstral Coefficients (MFCC) shows that the proposed feature offers significant improvement in low SNR conditions.

引用

页码：2246 / +

页数：2

共 50 条

[1] Selective Gammatone Envelope Feature for Robust Sound Event Recognition
Leng, Yi Ren
Huy Dat Tran
Kitaoka, Norihide
Li, Haizhou
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2012, E95D (05): : 1229 - 1237
[2] Voice biometric feature using Gammatone filterbank and ICA
Abdulla, Waleed H.
Zhang, Yushi
INTERNATIONAL JOURNAL OF BIOMETRICS, 2010, 2 (04) : 330 - 349
[3] Novel Gammatone Filterbank Based Spectro-Temporal Features for Robust Phoneme Recognition
Nagpal, Ankit
Patil, Hemant A.
PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PREMI 2017, 2017, 10597 : 342 - 350
[4] Acoustic Event Filterbank for Enabling Robust Event Recognition by Cleaning Robot
Park, Sangwook
Choi, Woohyun
Han, David K.
Ko, Hanseok
IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2015, 61 (02) : 189 - 196
[5] Whispered speech recognition based on gammatone filterbank cepstral coefficients
B. Marković
J. Galić
Ð. Grozdić
S. T. Jovičić
M. Mijić
Journal of Communications Technology and Electronics, 2017, 62 : 1255 - 1261
[6] Whispered Speech Recognition Based on Gammatone Filterbank Cepstral Coefficients
Markovic, B.
Galic, J.
Grozdic, D.
Jovicic, S. T.
Mijic, M.
JOURNAL OF COMMUNICATIONS TECHNOLOGY AND ELECTRONICS, 2017, 62 (11) : 1255 - 1261
[7] Acoustic features for speech recognition based on Gammatone filterbank and instantaneous frequency
Yin, Hui
Hohmann, Volker
Nadeu, Climent
SPEECH COMMUNICATION, 2011, 53 (05) : 707 - 715
[8] Filterbank Analysis of MFCC Feature Extraction in Robust Children Speech Recognition
Naing, Hay Mar Soe
Miyanaga, Yoshikazu
Hidayat, Risanuri
Winduratna, Bondhan
2019 INTERNATIONAL SYMPOSIUM ON MULTIMEDIA AND COMMUNICATION TECHNOLOGY (ISMAC), 2019,
[9] Underwater Acoustic Target Recognition Based on Gammatone Filterbank and Instantaneous Frequency
Lian, Zixu
Xu, Ke
Wan, Jianwei
Li, Gang
Chen, Yong
2017 IEEE 9TH INTERNATIONAL CONFERENCE ON COMMUNICATION SOFTWARE AND NETWORKS (ICCSN), 2017, : 1207 - 1211
[10] A novel hybrid feature method based on Caelen auditory model and gammatone filterbank for robust speaker recognition under noisy environment and speech coding distortion
Krobba, Ahmed
Debyeche, Mohamed
Selouani, Sid Ahmed
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (11) : 16195 - 16212

← 1 2 3 4 5 →