One-shot learning for acoustic identification of bird species in non-stationary environments

被引:13
作者
Acconcjaioco, Michelangelo [1 ]
Ntalampiras, Stavros [1 ]
机构
[1] Univ Milan, Dept Comp Sci, I-20133 Milan, Italy
来源
2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR) | 2021年
关键词
RECOGNITION;
D O I
10.1109/ICPR48806.2021.9412005
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This work introduces the one-shot learning paradigm in the computational bioacoustics domain. Even though, most of the related literature assumes availability of data characterizing the entire class dictionary of the problem at hand, that is rarely true as a habitat's species composition is only known up to a certain extent. Thus, the problem needs to be addressed by methodologies able to cope with non-stationarity. To this end, we propose a framework able to detect changes in the class dictionary and incorporate new classes on the fly. We design an one-shot learning architecture composed of a Siamese Neural Network operating in the logMel spectrogram space. We extensively examine the proposed approach on two datasets of various bird species using suitable figures of merit. Interestingly, such a learning scheme exhibits state of the art performance, while taking into account extreme non-stationarity cases.
引用
收藏
页码:755 / 762
页数:8
相关论文
共 30 条
[1]  
Acconcjaioco M., 2019, 5 INT S SIGN PROC IN 5 INT S SIGN PROC IN
[2]   Real-time bioacoustics monitoring and automated species identification [J].
Aide, T. Mitchell ;
Corrada-Bravo, Carlos ;
Campos-Cerqueira, Marconi ;
Milan, Carlos ;
Vega, Giovany ;
Alvarez, Rafael .
PEERJ, 2013, 1
[3]  
Bromley J., 1993, International Journal of Pattern Recognition and Artificial Intelligence, V7, P669, DOI 10.1142/S0218001493000339
[4]   Learning a similarity metric discriminatively, with application to face verification [J].
Chopra, S ;
Hadsell, R ;
LeCun, Y .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :539-546
[5]  
Grill T., 2017, P 25 EUR SIGN PROC C P 25 EUR SIGN PROC C
[6]  
Ichita T, 2018, ASIAPAC SIGN INFO PR, P1148, DOI 10.23919/APSIPA.2018.8659544
[7]  
Koch G., 2015, P DEEP LEARN WORKSH
[8]   A Spatial-Cue-Based Probabilistic Model for Bird Song Scene Analysis [J].
Kojima, Ryosuke ;
Sugiyama, Osamu ;
Hoshiba, Kotaro ;
Suzuki, Reiji ;
Nakadai, Kazuhiro .
2017 IEEE INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA), 2017, :395-404
[9]  
Mun S, 2017, INT CONF ACOUST SPEE, P796, DOI 10.1109/ICASSP.2017.7952265
[10]  
Ntalampiras S., 2016, 2016 IEEE 26 INT WOR, P1, DOI DOI 10.1109/MLSP.2016.7738905