Selective Gammatone Filterbank Feature for Robust Sound Event Recognition

被引:0
|
作者
Leng, Yi Ren [1 ]
Huy Dat Tran [1 ]
Kitaoka, Norihide [2 ]
Li, Haizhou [1 ]
机构
[1] ASTAR, Inst Infocomm Res, Human Language Technol Dept, Singapore 138632, Singapore
[2] Nagoya Univ, Nagoya, Aichi 4648601, Japan
关键词
gammatone filterbank; Hidden Markov Model; robust recognition; sound event recognition;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper introduces a novel feature based on the raw output of the garnmatone filterbank. Channel selection is used to enhance robustness over a range of signal-to-noise ratios (SNR) of additive noise. The recognition accuracy of the proposed feature is tested on a sound event database using a Hidden Markov Model (HMM) recogniser. A comparison with a series of similar features and the conventional Mel-Frequency Cepstral Coefficients (MFCC) shows that the proposed feature offers significant improvement in low SNR conditions.
引用
收藏
页码:2246 / +
页数:2
相关论文
共 50 条
  • [21] Enhanced Local Feature Approach for Overlapping Sound Event Recognition
    Dennis, Jonathan
    Huy Dat Tran
    2014 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2014,
  • [22] Gammatone Wavelet Cepstral Coefficients for Robust Speech Recognition
    Adiga, Aniruddha
    Magimai-Doss, Mathew
    Seelamantula, Chandra Sekhar
    2013 IEEE INTERNATIONAL CONFERENCE OF IEEE REGION 10 (TENCON), 2013,
  • [23] Using Blob Detection in Missing Feature Linear-Frequency Cepstral Coefficients for Robust Sound Event Recognition
    Leng, Yi Ren
    Huy Dat Tran
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 2505 - 2508
  • [24] ROBUST SOUND EVENT RECOGNITION USING CONVOLUTIONAL NEURAL NETWORKS
    Zhang, Haomin
    McLoughlin, Ian
    Song, Yan
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 559 - 563
  • [25] Unsupervised Singing Voice Separation Using Gammatone Auditory Filterbank and Constraint Robust Principal Component Analysis
    Li, Feng
    Akagi, Masato
    2018 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2018, : 1924 - 1928
  • [26] Gammatone features and feature combination for large vocabulary speech recognition
    Schlueter, R.
    Bezrukov, I.
    Wagner, H.
    Ney, H.
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 649 - 652
  • [27] AUDITORY FEATURES BASED ON GAMMATONE FILTERS FOR ROBUST SPEECH RECOGNITION
    Qi, Jun
    Wang, Dong
    Jiang, Yi
    Liu, Runsheng
    2013 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2013, : 305 - 308
  • [28] Alternative Frequency Scale Cepstral Coefficient for Robust Sound Event Recognition
    Leng, Yi Ren
    Huy Dat Tran
    Kitaoka, Norihide
    Li, Haizhou
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 304 - +
  • [29] A Robust Sound Event Recognition Framework Under TV Playing Conditions
    Terence, Ng Wen Zheng
    Dat, Tran Huy
    Dennis, Jonathan
    Siong, Chng Eng
    2013 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2013,
  • [30] Speech Emotion Recognition Using Multichannel Parallel Convolutional Recurrent Neural Networks based on Gammatone Auditory Filterbank
    Peng, Zhichao
    Zhu, Zhi
    Unoki, Masashi
    Dang, Jianwu
    Akagi, Masato
    2017 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC 2017), 2017, : 1750 - 1755