Machine Learning Algorithms for Environmental Sound Recognition: Towards Soundscape Semantics

被引:26
作者
Bountourakis, Vasileios [1 ]
Vrysis, Lazaros [1 ]
Papanikolaou, George [1 ]
机构
[1] Aristotle Univ Thessaloniki, AUTh Univ Campus, Thessaloniki 54124, Greece
来源
PROCEEDINGS OF THE 10TH AUDIO MOSTLY: A CONFERENCE ON INTERACTION WITH SOUND, AM'15 | 2015年
关键词
Environmental Sound Recognition; audio classification; semantic audio analysis; computer audition; feature extraction; feature selection; machine learning algorithms;
D O I
10.1145/2814895.2814905
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper investigates methods aiming at the automatic recognition and classification of discrete environmental sounds, for the purpose of subsequently applying these methods to the recognition of soundscapes. Research in audio recognition has traditionally focused on the domains of speech and music. Comparatively little research has been done towards recognizing non-speech environmental sounds. For this reason, in this paper, we apply existing techniques that have been proved efficient in the other two domains. These techniques are comprehensively compared to determine the most appropriate one for addressing the problem of environmental sound recognition.
引用
收藏
页数:7
相关论文
共 23 条
  • [1] [Anonymous], THESIS
  • [2] Bountourakis V., 2015, SEMANTIC ANAL ENV SO
  • [3] Bullock J., 2007, P INT COMP MUS C, V43
  • [4] Cannam Chris., 2006, P 7 INT C MUSIC INFO, P324
  • [5] Environmental sound recognition: a survey
    Chachada, Sachin
    Kuo, C. -C. Jay
    [J]. APSIPA TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING, 2014, 3
  • [6] Environmental Sound Recognition With Time-Frequency Audio Features
    Chu, Selina
    Narayanan, Shrikanth
    Kuo, C. -C. Jay
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2009, 17 (06): : 1142 - 1158
  • [7] Comparison of techniques for environmental sound recognition
    Cowling, M
    Sitte, R
    [J]. PATTERN RECOGNITION LETTERS, 2003, 24 (15) : 2895 - 2907
  • [8] Dimoulas C., 2013, P 134 AES CONVENTION, P509
  • [9] Dimoulas C., 2007, P 122 AES CONV
  • [10] Hall M., 2009, ACM SIGKDD Explor. Newslett., V11, P10, DOI [DOI 10.1145/1656274.1656278, 10.1145/1656274.1656278]