Sparse Representation with Temporal Max-Smoothing for Acoustic Event Detection

被引:0
|
作者
Lu, Xugang [1 ]
Shen, Peng [1 ]
Tsao, Yu [2 ]
Hori, Chiori [1 ]
Kawai, Hisashi [1 ]
机构
[1] Natl Inst Informat & Commun Technol, Koganei, Tokyo, Japan
[2] Acad Sinica, Res Ctr Informat Technol Innovat, Taipei, Taiwan
来源
16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5 | 2015年
关键词
Feature learning; matching pursuit; temporal max-smoothing; acoustic event detection;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In order to incorporate long temporal-frequency structure for acoustic event detection, we have proposed a spectral patch based learning and representation method. The learned spectral patches were regarded as acoustic words which were further used in sparse encoding for acoustic feature representation and modeling. In our previous study, during feature encoding stage, each spectral patch was encoded independently. Considering that spectral patches taken from a time sequence should keep similar representations for neighboring patches after encoding, in this study, we propose to enhance the temporal correlation of feature representation using a temporal max-smoothing algorithm. The max-smoothing tries to pick up the maximum response in a local time window as the representative feature for detection task. We tested the new feature for automatic detection of acoustic events which were selected from lecture audio data. Experimental results showed that the temporal max-smoothing significantly improved the performance.
引用
收藏
页码:1176 / 1180
页数:5
相关论文
共 50 条
  • [21] EAR-TUKE: The Acoustic Event Detection System
    Lojka, Martin
    Pleva, Matus
    Kiktova, Eva
    Juhar, Jozef
    Cizmar, Anton
    MULTIMEDIA COMMUNICATIONS, SERVICES AND SECURITY, MCSS 2014, 2014, 429 : 137 - 148
  • [22] A DATABASE AND CHALLENGE FOR ACOUSTIC SCENE CLASSIFICATION AND EVENT DETECTION
    Giannoulis, Dimitrios
    Stowell, Dan
    Benetos, Emmanouil
    Rossignol, Mathias
    Lagrange, Mathieu
    Plumbley, Mark D.
    2013 PROCEEDINGS OF THE 21ST EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2013,
  • [23] "WOW!" BAYESIAN SURPRISE FOR SALIENT ACOUSTIC EVENT DETECTION
    Schauerte, B.
    Stiefelhagen, R.
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 6402 - 6406
  • [24] Acoustic Event Detection for Spotting "Hot Spots" in Podcasts
    Sumi, Kouhei
    Kawahara, Tatsuya
    Ogata, Jun
    Goto, Masataka
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1171 - +
  • [25] A TANDEM CONNECTIONIST MODEL USING COMBINATION OF MULTI-SCALE SPECTRO-TEMPORAL FEATURES FOR ACOUSTIC EVENT DETECTION
    Espi, Miquel
    Fujimoto, Masakiyo
    Saito, Daisuke
    Ono, Nobutaka
    Sagayama, Shigeki
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4293 - 4296
  • [26] Compression of Acoustic Event Detection Models With Quantized Distillation
    Shi, Bowen
    Sun, Ming
    Kao, Chieh-Chi
    Rozgic, Viktor
    Matsoukas, Spyros
    Wang, Chao
    INTERSPEECH 2019, 2019, : 3639 - 3643
  • [27] ACOUSTIC SCENE CLASSIFICATION USING SPARSE FEATURE LEARNING AND EVENT-BASED POOLING
    Lee, Kyogu
    Hyung, Ziwon
    Nam, Juhan
    2013 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2013,
  • [28] Comparison of Feature Selection Algorithms for Acoustic Event Detection System
    Kiktova, Eva
    Lojka, Martin
    Juhar, Jozef
    Cizmar, Anton
    2014 56TH INTERNATIONAL SYMPOSIUM ELMAR (ELMAR), 2014, : 47 - 50
  • [29] Sound learning–based event detection for acoustic surveillance sensors
    Jeong-Sik Park
    Seok-Hoon Kim
    Multimedia Tools and Applications, 2020, 79 : 16127 - 16139
  • [30] Bag-of-Features Methods for Acoustic Event Detection and Classification
    Grzeszick, Rene
    Plinge, Axel
    Fink, Gernot A.
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (06) : 1242 - 1252