An incremental technique for real-time bioacoustic signal segmentation

被引：48

作者：

Colonna, Juan Gabriel ^{[1
]}

Cristo, Marco ^{[1
]}

Salvatierra Junior, Mario ^{[1
]}

Nakamura, Eduardo Freire ^{[1
]}

机构：

[1] Fed Univ Amazonas UFAM, Inst Comp Icomp, Manaus, Amazonas, Brazil

来源：

EXPERT SYSTEMS WITH APPLICATIONS | 2015年 / 42卷 / 21期

关键词：

Bioacoustic signal segmentation; Wireless Sensor Networks; Unsupervised learning; Stream data mining; CLASSIFICATION;

D O I：

10.1016/j.eswa.2015.05.030

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

A bioacoustical animal recognition system is composed of two parts: (1) the segmenter, responsible for detecting syllables (animal vocalization) in the audio; and (2) the classifier, which determines the species/animal whose the syllables belong to. In this work, we first present a novel technique for automatic segmentation of anuran calls in real time; then, we present a method to assess the performance of the whole system. The proposed segmentation method performs an unsupervised binary classification of time series (audio) that incrementally computes two exponentially-weighted features (Energy and Zero Crossing Rate). In our proposal, classical sliding temporal windows are replaced with counters that give higher weights to new data, allowing us to distinguish between a syllable and ambient noise (considered as silences). Compared to sliding-window approaches, the associated memory cost of our proposal is lower, and processing speed is higher. Our evaluation of the segmentation component considers three metrics: (1) the Matthews Correlation Coefficient for point-to-point comparison; (2) the WinPR to quantify the precision of boundaries; and (3) the AEER for event-to-event counting. The experiments were carried out in a dataset with 896 syllables of seven different species of anurans. To evaluate the whole system, we derived four equations that helps understand the impact that the precision and recall of the segmentation component has on the classification task. Finally, our experiments show a segmentation/recognition improvement of 37%, while reducing memory and data communication. Therefore, results suggest that our proposal is suitable for resource-constrained systems, such as Wireless Sensor Networks (WSNs). (C) 2015 Elsevier Ltd. All rights reserved.

引用

页码：7367 / 7374

页数：8

共 35 条

[1]

Akyildiz I, 2002, COMPUT NETWORKS J, P38

[2]

[Anonymous], 2007, EVALUATION PRECISION

[3]

[Anonymous], 2012, NAACL HLT 2012

[4]

Carey C, 2001, CONSERV BIOL, P15

[5] A call-independent and automatic acoustic system for the individual recognition of animals: A novel model using four passerines [J].

Cheng, Jinkui ;

Sun, Yuehua ;

Ji, Liqiang .

PATTERN RECOGNITION, 2010, 43 (11) :3846-3852

[6]

Colon J., 2012, 2 INT FAECAL SLUDGE, P1, DOI DOI 10.1109/IJCNN.2012.6252794

[7]

Colonna J. G., 2014, 22 INT C PATT REC

[8]

Evangelista T. L F, AUTOMATIC SEGMENTATI, P223

[9] Bird species recognition using support vector machines [J].

Fagerlund, Seppo .

EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2007, 2007 (1)

[10]

Finch T., 2009, TECHNICAL REPORT

← 1 2 3 4 →