Automatic Environmental Sound Recognition: Performance Versus Computational Cost

被引：56

作者：

Sigtia, Siddharth ^{[1
]}

Stark, Adam M. ^{[1
]}

Krstulovic, Sacha ^{[2
]}

Plumbley, Mark D. ^{[3
]}

机构：

[1] Queen Mary Univ London, London E1 4NS, England

[2] Audio Analyt Ltd, Cambridge CB2 3AH, England

[3] Univ Surrey, Guildford GU2 7XH, Surrey, England

来源：

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2016年 / 24卷 / 11期

基金：

英国工程与自然科学研究理事会;

关键词：

Automatic environmental sound recognition; computational auditory scene analysis; deep learning; machine learning;

D O I：

10.1109/TASLP.2016.2592698

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In the context of the Internet of Things, sound sensing applications are required to run on embedded platforms where notions of product pricing and form factor impose hard constraints on the available computing power. Whereas Automatic Environmental Sound Recognition (AESR) algorithms are most often developed with limited consideration for computational cost, this paper seeks which AESR algorithm can make the most of a limited amount of computing power by comparing the sound classification performance as a function of its computational cost. Results suggest that Deep Neural Networks yield the best ratio of sound classification accuracy across a range of computational costs, while Gaussian Mixture Models offer a reasonable accuracy at a consistently small cost, and Support Vector Machines stand between both in terms of compromise between accuracy and computational cost.

引用

页码：2096 / 2107

页数：12

共 57 条

[1]

[Anonymous], 2006, PROC IEEE INT C ACOU

[2]

[Anonymous], 2009, Computer Arithmetic: Algorithms and Hardware Designs

[3]

[Anonymous], 1993, Fundamentals of speech recognition

[4]

[Anonymous], 2014, Big Data Internet of Things: A Roadmap Smart Environments

[5]

Arthur D, 2007, PROCEEDINGS OF THE EIGHTEENTH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, P1027

[6]

Bengio Y, 2013, INT CONF ACOUST SPEE, P8624, DOI 10.1109/ICASSP.2013.6639349

[7]

Beritelli F., 2008, Signal Processing and Communication Systems, P1

[8]

Bishop C.M., 2006, PATTERN RECOGN, V4, P738, DOI DOI 10.1117/1.2819119

[9]

Bonfigli R, 2014, 2014 6TH EUROPEAN EMBEDDED DESIGN IN EDUCATION AND RESEARCH CONFERENCE (EDERC), P307, DOI 10.1109/EDERC.2014.6924410

[10] A tutorial on Support Vector Machines for pattern recognition [J].

Burges, CJC .

DATA MINING AND KNOWLEDGE DISCOVERY, 1998, 2 (02) :121-167

← 1 2 3 4 5 6 →