Selective Gammatone Filterbank Feature for Robust Sound Event Recognition

被引:0
|
作者
Leng, Yi Ren [1 ]
Huy Dat Tran [1 ]
Kitaoka, Norihide [2 ]
Li, Haizhou [1 ]
机构
[1] ASTAR, Inst Infocomm Res, Human Language Technol Dept, Singapore 138632, Singapore
[2] Nagoya Univ, Nagoya, Aichi 4648601, Japan
关键词
gammatone filterbank; Hidden Markov Model; robust recognition; sound event recognition;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper introduces a novel feature based on the raw output of the garnmatone filterbank. Channel selection is used to enhance robustness over a range of signal-to-noise ratios (SNR) of additive noise. The recognition accuracy of the proposed feature is tested on a sound event database using a Hidden Markov Model (HMM) recogniser. A comparison with a series of similar features and the conventional Mel-Frequency Cepstral Coefficients (MFCC) shows that the proposed feature offers significant improvement in low SNR conditions.
引用
收藏
页码:2246 / +
页数:2
相关论文
共 50 条
  • [1] Selective Gammatone Envelope Feature for Robust Sound Event Recognition
    Leng, Yi Ren
    Huy Dat Tran
    Kitaoka, Norihide
    Li, Haizhou
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2012, E95D (05): : 1229 - 1237
  • [2] Voice biometric feature using Gammatone filterbank and ICA
    Abdulla, Waleed H.
    Zhang, Yushi
    INTERNATIONAL JOURNAL OF BIOMETRICS, 2010, 2 (04) : 330 - 349
  • [3] Novel Gammatone Filterbank Based Spectro-Temporal Features for Robust Phoneme Recognition
    Nagpal, Ankit
    Patil, Hemant A.
    PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PREMI 2017, 2017, 10597 : 342 - 350
  • [4] Acoustic Event Filterbank for Enabling Robust Event Recognition by Cleaning Robot
    Park, Sangwook
    Choi, Woohyun
    Han, David K.
    Ko, Hanseok
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2015, 61 (02) : 189 - 196
  • [5] Whispered speech recognition based on gammatone filterbank cepstral coefficients
    B. Marković
    J. Galić
    Ð. Grozdić
    S. T. Jovičić
    M. Mijić
    Journal of Communications Technology and Electronics, 2017, 62 : 1255 - 1261
  • [6] Whispered Speech Recognition Based on Gammatone Filterbank Cepstral Coefficients
    Markovic, B.
    Galic, J.
    Grozdic, D.
    Jovicic, S. T.
    Mijic, M.
    JOURNAL OF COMMUNICATIONS TECHNOLOGY AND ELECTRONICS, 2017, 62 (11) : 1255 - 1261
  • [7] Acoustic features for speech recognition based on Gammatone filterbank and instantaneous frequency
    Yin, Hui
    Hohmann, Volker
    Nadeu, Climent
    SPEECH COMMUNICATION, 2011, 53 (05) : 707 - 715
  • [8] Filterbank Analysis of MFCC Feature Extraction in Robust Children Speech Recognition
    Naing, Hay Mar Soe
    Miyanaga, Yoshikazu
    Hidayat, Risanuri
    Winduratna, Bondhan
    2019 INTERNATIONAL SYMPOSIUM ON MULTIMEDIA AND COMMUNICATION TECHNOLOGY (ISMAC), 2019,
  • [9] Underwater Acoustic Target Recognition Based on Gammatone Filterbank and Instantaneous Frequency
    Lian, Zixu
    Xu, Ke
    Wan, Jianwei
    Li, Gang
    Chen, Yong
    2017 IEEE 9TH INTERNATIONAL CONFERENCE ON COMMUNICATION SOFTWARE AND NETWORKS (ICCSN), 2017, : 1207 - 1211
  • [10] A novel hybrid feature method based on Caelen auditory model and gammatone filterbank for robust speaker recognition under noisy environment and speech coding distortion
    Krobba, Ahmed
    Debyeche, Mohamed
    Selouani, Sid Ahmed
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (11) : 16195 - 16212