Audio Classification with Thermodynamic Criteria

被引:2
作者
Singh, Rita [1 ]
机构
[1] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
来源
2014 IEEE INTERNATIONAL CONFERENCE ON CLOUD ENGINEERING (IC2E) | 2014年
关键词
Bayesian classification; Free energy; Entropy; Temperature; Audio classification; Speech recognition; ENTROPY;
D O I
10.1109/IC2E.2014.23
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Detecting sound events in audio recordings is a challenging problem. A detector must be trained for each sound to be classified. However, the recordings of the examples used to train the detector rarely match the conditions found in the test audio to be classified. If the event detection problem is posed as one of Bayes classification, the problem may be viewed as one of mismatch between the true distribution of the data and that represented by the classifier. The Bayes classification rule results in suboptimal performance under such mismatch, and a modified classification rule is required. Alternately stated, the classification rule must optimize a different objective criterion than the Bayes error rate computed from the training distributions. The use of entropy as an optimization criterion for various classification tasks has been well established in the literature. In this paper we show that free-energy, a thermodynamic concept directly related to entropy, can also be used as an objective criterion for classification in such scenarios. We demonstrate with examples on classification with HMMs that minimization of free-energy is an effective criterion for classification under conditions of mismatch.
引用
收藏
页码:526 / 533
页数:8
相关论文
共 26 条
[1]  
ACKLEY DH, 1985, COGNITIVE SCI, V9, P147
[2]  
[Anonymous], STAT PHYS SPIN GLASS
[3]  
[Anonymous], 37 ANN INT M
[4]  
Benarousse L., 2001, INF SYST TECHN PAN I, P2210
[5]   THE THERMODYNAMICS OF COMPUTATION - A REVIEW [J].
BENNETT, CH .
INTERNATIONAL JOURNAL OF THEORETICAL PHYSICS, 1982, 21 (12) :905-940
[6]  
Berger AL, 1996, COMPUT LINGUIST, V22, P39
[7]  
Brand M, 1999, ADV NEUR IN, V11, P723
[8]  
Chaudhury Sourish, 2012, EXPLOITING TEMPORAL
[9]   STOCHASTIC RELAXATION, GIBBS DISTRIBUTIONS, AND THE BAYESIAN RESTORATION OF IMAGES [J].
GEMAN, S ;
GEMAN, D .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1984, 6 (06) :721-741
[10]  
Hopfield J. J., 1982, P NATL ACAD SCI US, V79