Machine Learning Algorithms for Environmental Sound Recognition: Towards Soundscape Semantics

被引:26
作者
Bountourakis, Vasileios [1 ]
Vrysis, Lazaros [1 ]
Papanikolaou, George [1 ]
机构
[1] Aristotle Univ Thessaloniki, AUTh Univ Campus, Thessaloniki 54124, Greece
来源
PROCEEDINGS OF THE 10TH AUDIO MOSTLY: A CONFERENCE ON INTERACTION WITH SOUND, AM'15 | 2015年
关键词
Environmental Sound Recognition; audio classification; semantic audio analysis; computer audition; feature extraction; feature selection; machine learning algorithms;
D O I
10.1145/2814895.2814905
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper investigates methods aiming at the automatic recognition and classification of discrete environmental sounds, for the purpose of subsequently applying these methods to the recognition of soundscapes. Research in audio recognition has traditionally focused on the domains of speech and music. Comparatively little research has been done towards recognizing non-speech environmental sounds. For this reason, in this paper, we apply existing techniques that have been proved efficient in the other two domains. These techniques are comprehensively compared to determine the most appropriate one for addressing the problem of environmental sound recognition.
引用
收藏
页数:7
相关论文
共 23 条
  • [11] Investigation of broadcast-audio semantic analysis scenarios employing radio-programme-adaptive pattern classification
    Kotsakis, R.
    Kalliris, G.
    Dimoulas, C.
    [J]. SPEECH COMMUNICATION, 2012, 54 (06) : 743 - 762
  • [12] Kotsakis Rigas., 2012, 132 AES CONV, P513
  • [13] Mitrovic D., ELMAR 2009 ELMAR 09, P201
  • [14] Features for Content-Based Audio Retrieval
    Mitrovic, Dalibor
    Zeppelzauer, Matthias
    Breiteneder, Christian
    [J]. ADVANCES IN COMPUTERS, VOL 78: IMPROVING THE WEB, 2010, 78 : 71 - 150
  • [15] Ntalampiras S., 2008, AUTOMATIC RECOGNITIO
  • [16] Peeters G., 2004, Tech. Rep.
  • [17] Powers D., 2008, Journal of Machine Learning Technologies, V2, DOI DOI 10.9735/2229-3981
  • [18] Salamon J., 2014, P 15 INT SOC MUS INF
  • [19] Tsau E., 2012, SIGN INF PROC ASS AN, P1
  • [20] Tsipas N., 2013, AUDIO ENG SOC CONVEN