Detecting novel objects in acoustic scenes through classifier incongruence

被引:0
作者
Bach, Joerg-Hendrik [1 ]
Anemueller, Joern [1 ]
机构
[1] Carl von Ossietzky Univ Oldenburg, D-26111 Oldenburg, Germany
来源
11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4 | 2010年
关键词
sound classification; acoustic objects; event detection; novelty detection; modulation spectrogram; NOVELTY DETECTION;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this study, a new generic framework for the detection and interpretation of disagreement ("incongruence") between different classifiers [1] is applied to the problem of detecting novel acoustic objects in an office environment. Using a general model that detects generic acoustic objects (standing out from a stationary background) and specific models tuned to particular sounds expected in the office, a novel object is detected as an incongruence between the models: the general model detects it as a generic object, but the specific models can not identify it as any of the known office-related sources. The detectors are realized using amplitude modulation spectrogram and RASTA-PLP features with support vector machine classification. Data considered are speech and non-speech sounds embedded in real office background at signal-to-noise ratios (SNR) from +20 dB to -20 dB. Our approach yields approximately 90% hit rate for novel events at -20 dB SNR, 75% at 0 dB and reaches chance level below -10 dB.
引用
收藏
页码:2206 / 2209
页数:4
相关论文
共 15 条
  • [11] Novelty detection: a review - part 1: statistical approaches
    Markou, M
    Singh, S
    [J]. SIGNAL PROCESSING, 2003, 83 (12) : 2481 - 2497
  • [12] DISAMBIGUATING SOUND THROUGH CONTEXT
    Niessen, Maria E.
    Van Maanen, Leendert
    Andringa, Tjeerd C.
    [J]. INTERNATIONAL JOURNAL OF SEMANTIC COMPUTING, 2008, 2 (03) : 327 - 341
  • [13] Layered representations for learning and inferring office activity from multiple sensory channels
    Oliver, N
    Garg, A
    Horvitz, E
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2004, 96 (02) : 163 - 180
  • [14] Rouvier M., 2009, P INT, P1155
  • [15] Weinshall D., 2008, NIPS 2008, P1745