A Fast One-Pass-Training Feature Selection Technique for GMM-based Acoustic Event Detection with Audio-Visual Data

被引:0
作者
Butko, Taras [1 ]
Nadeu, Climent [1 ]
机构
[1] Univ Politecn Cataluna, Dept Signal Theory & Commun, TALP Res Ctr, Barcelona, Spain
来源
11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4 | 2010年
关键词
acoustic event detection; feature selection; hill-climbing approach; hidden Markov models; one-against-all strategy; GMMs; CLASSIFICATION;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Acoustic event detection becomes a difficult task, even for a small number of events, in scenarios where events are produced rather spontaneously and often overlap in time. In this work, we aim to improve the detection rate by means of feature selection. Using a one-against-all detection approach, a new fast one-pass-training algorithm, and an associated highly-precise metric are developed. Choosing a different subset of multimodal features for each acoustic event class, the results obtained from audiovisual data collected in the UPC multimodal room show an improvement in average detection rate with respect to using the whole set of features.
引用
收藏
页码:2338 / 2341
页数:4
相关论文
共 10 条
  • [1] [Anonymous], 2000, Pattern Classification
  • [2] Butko T., 2009, P INTERSPEECH
  • [3] Wrappers for feature subset selection
    Kohavi, R
    John, GH
    [J]. ARTIFICIAL INTELLIGENCE, 1997, 97 (1-2) : 273 - 324
  • [4] Classification of general audio data for content-based retrieval
    Li, DG
    Sethi, IK
    Dimitrova, N
    McGee, T
    [J]. PATTERN RECOGNITION LETTERS, 2001, 22 (05) : 533 - 544
  • [5] Peeters G., 2004, CUIDADO 1 PROJECT RE
  • [6] Rifkin R, 2004, J MACH LEARN RES, V5, P101
  • [7] Srinivasan S. H., 2004, IEEE P ICASSP, P321
  • [8] TEMKO A, 2007, LNCS, V4122
  • [9] Acoustic event detection in meeting-room environments
    Temko, Andrey
    Nadeu, Climent
    [J]. PATTERN RECOGNITION LETTERS, 2009, 30 (14) : 1281 - 1288
  • [10] ZHOU X, 2008, LNCS, V4625