Auto-Associative Recurrent Neural Networks and Long Term Dependencies in Novelty Detection for Audio Surveillance Applications

被引:0
作者
Rossi, A. [1 ]
Montefoschi, F. [1 ]
Rizzo, A. [1 ]
Diligenti, M. [2 ]
Festucci, C. [3 ]
机构
[1] Univ Siena, Dept Social Polit & Cognit Sci, Siena, Italy
[2] Univ Siena, Dept Informat Engn & Math, Siena, Italy
[3] Banca Monte Paschi Siena, Siena, Italy
来源
2017 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE APPLICATIONS AND TECHNOLOGIES (AIAAT 2017) | 2017年 / 261卷
关键词
D O I
10.1088/1757-899X/261/1/012009
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Machine Learning applied to Automatic Audio Surveillance has been attracting increasing attention in recent years. In spite of several investigations based on a large number of different approaches, little attention had been paid to the environmental temporal evolution of the input signal. In this work, we propose an exploration in this direction comparing the temporal correlations extracted at the feature level with the one learned by a representational structure. To this aim we analysed the prediction performances of a Recurrent Neural Network architecture varying the length of the processed input sequence and the size of the time window used in the feature extraction. Results corroborated the hypothesis that sequential models work better when dealing with data characterized by temporal order. However, so far the optimization of the temporal dimension remains an open issue.
引用
收藏
页数:8
相关论文
共 28 条
[1]  
[Anonymous], 2013, P 2013 C EMPIRICAL M
[2]  
[Anonymous], 2006, Proc. Adv. Neural Inf. Process. Syst.
[3]  
Antti, 2006, IEEE T AUDIO SPEECH, V14, P321
[4]   LEARNING LONG-TERM DEPENDENCIES WITH GRADIENT DESCENT IS DIFFICULT [J].
BENGIO, Y ;
SIMARD, P ;
FRASCONI, P .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1994, 5 (02) :157-166
[5]   NOVELTY DETECTION AND NEURAL-NETWORK VALIDATION [J].
BISHOP, CM .
IEE PROCEEDINGS-VISION IMAGE AND SIGNAL PROCESSING, 1994, 141 (04) :217-222
[6]  
Chachada S., 2013, Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2013, P1, DOI [10.1109/APSIPA.2013.6694338, DOI 10.1109/APSIPA.2013.6694338]
[7]  
Cho K, 2014, ARXIV14061078, P1724, DOI [DOI 10.3115/V1/D14-1179, 10.3115/V1/D14-1179]
[8]  
Choi W, 2012, ADV VIDEO SIGNAL BAS, P118
[9]   Comparison of techniques for environmental sound recognition [J].
Cowling, M ;
Sitte, R .
PATTERN RECOGNITION LETTERS, 2003, 24 (15) :2895-2907
[10]   Audio Surveillance: A Systematic Review [J].
Crocco, Marco ;
Cristani, Marco ;
Trucco, Andrea ;
Murino, Vittorio .
ACM COMPUTING SURVEYS, 2016, 48 (04)