A Dataset and Taxonomy for Urban Sound Research

被引:674
作者
Salamon, Justin [1 ,2 ]
Jacoby, Christopher [1 ]
Bello, Juan Pablo [1 ]
机构
[1] NYU, Mus & Audio Res Lab, New York, NY 10003 USA
[2] NYU, Ctr Urban Sci & Progress, New York, NY 10003 USA
来源
PROCEEDINGS OF THE 2014 ACM CONFERENCE ON MULTIMEDIA (MM'14) | 2014年
关键词
Urban sound; dataset; taxonomy; classification;
D O I
10.1145/2647868.2655045
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Automatic urban sound classification is a growing area of research with applications in multimedia retrieval and urban informatics. In this paper we identify two main barriers to research in this area - the lack of a common taxonomy and the scarceness of large, real-world, annotated data. To address these issues we present a taxonomy of urban sounds and a new dataset, UrbanSound, containing 27 hours of audio with 18.5 hours of annotated sound event occurrences across 10 sound classes. The challenges presented by the new dataset are studied through a series of experiments using a baseline classification system.
引用
收藏
页码:1041 / 1044
页数:4
相关论文
共 16 条
  • [1] [Anonymous], 2013, GIS OSTRAVA
  • [2] [Anonymous], 1993, The Soundscape: our sonic environment and the tuning of the world
  • [3] Bogdanov D., 2013, P 21 ACM INT C MULT, P855, DOI [10.1145/2502081.2502229, DOI 10.1145/2502081]
  • [4] Towards standardization in soundscape preference assessment
    Brown, A. L.
    Kang, Jian
    Gjestland, Truls
    [J]. APPLIED ACOUSTICS, 2011, 72 (06) : 387 - 392
  • [5] A flexible framework for key audio effects detection and auditory context inference
    Cai, R
    Lu, L
    Hanjalic, A
    Zhang, HJ
    Cai, LH
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (03): : 1026 - 1039
  • [6] Chaudhuri S, 2013, INT CONF ACOUST SPEE, P833, DOI 10.1109/ICASSP.2013.6637765
  • [7] Environmental Sound Recognition With Time-Frequency Audio Features
    Chu, Selina
    Narayanan, Shrikanth
    Kuo, C. -C. Jay
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2009, 17 (06): : 1142 - 1158
  • [8] Cotton CV, 2011, 2011 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), P69, DOI 10.1109/ASPAA.2011.6082331
  • [9] Ellis DPW, 2011, INT CONF ACOUST SPEE, P5880
  • [10] Giannoulis D., 2013, 21 EUSIPCO