Spectral Patch Based Sparse Coding for Acoustic Event Detection

被引：0

作者：

Lu, Xugang ^{[1
]}

Tsao, Yu ^{[2
]}

Shen, Peng ^{[1
]}

Hori, Chiori ^{[1
]}

机构：

[1] Natl Inst Informat & Commun Technol, Gaithersburg, MD 20899 USA

[2] Acad Sinica, Res Ctr Informat Technol Innovat, Taipei, Taiwan

来源：

2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP) | 2014年

关键词：

Acoustic event detection; sparse coding; support vector machine;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In most algorithms for acoustic event detection (AED), frame based acoustic representations are used in acoustic modeling. Due to lack of context information in feature representation, large model confusions may occur during modeling. We have proposed a feature learning and representation algorithm to explore context information from temporal-frequency patches of signal for AED. With the algorithm, a sparse feature was extracted based on an acoustic dictionary composed of a bag of spectral patches. In our previous algorithm, the feature was obtained based on a definition of Euclidian distance between input signal and acoustic dictionary. In this study, we formulate the sparse feature extraction as l(1) regularization in signal reconstruction. The sparsity of the representation is efficiently controlled via varying a regularization parameter. A support vector machine (SVM) classifier was built on the extracted sparse feature for AED. Our experimental results showed that the spectral patch based sparse representation effectively improved the performance by incorporating temporal-frequency context information in modeling.

引用

页码：317 / +

页数：2

共 13 条

[1] K-SVD: An algorithm for designing overcomplete dictionaries for sparse representation [J].

Aharon, Michal ;

Elad, Michael ;

Bruckstein, Alfred .

IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2006, 54 (11) :4311-4322

[2]

Coates A, 2011, P 14 INT C ART INT S, P215

[3]

Fan RE, 2008, J MACH LEARN RES, V9, P1871

[4]

Giannoulis D, 2013, IEEE WORK APPL SIG

[5] Context-dependent sound event detection [J].

Heittola, Toni ;

Mesaros, Annamaria ;

Eronen, Antti ;

Virtanen, Tuomas .

EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2013,

[6]

Huang Z, 2013, INTERSPEECH, P2281

[7]

Lu X., 2014, ICASSP

[8]

Lu XG, 2013, INTERSPEECH, P436

[9]

Mairal J., 2009, P 26 ANN INT C MACH, P689

[10]

Soong F. K., 1985, ICASSP 85. Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (Cat. No. 85CH2118-8), P387

← 1 2 →