MELAUDIS: A Large-Scale Benchmark Acoustic Dataset For Intelligent Transportation Systems Research

被引:1
作者
Parineh, Hossein [1 ]
Sarvi, Majid [1 ]
Bagloee, Saeed Asadi [1 ]
机构
[1] Univ Melbourne, Dept Infrastructure Engn, Melbourne 3053, Australia
关键词
VEHICLE SPEED ESTIMATION;
D O I
10.1038/s41597-025-04689-3
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Acoustic traffic sensors provide valuable information about road traffic at an affordable cost, gaining significant attention in recent years. However, the field of audio signal processing for Intelligent Transportation Systems (ITS) lacks real-world complex datasets. We introduce MELAUDIS, the first comprehensive real-world dataset designed for vehicle detection, traffic status monitoring, and vehicle type classification. In MELAUDIS audio recordings from multi-lane roads with two-way traffic are provided that encompass various traffic conditions, ambient noises, and weather settings, including urban environments and rainy weather. The dataset includes six vehicle types, bicycles, motorcycles, cars, buses, trucks, and trams, in both single-vehicle and multi-vehicle contexts. It is comprised of 5,792 background noise recordings, 7,345 vehicle sound samples, and 2,955 idling sound recordings, making it the largest urban acoustic dataset. Labeling and data cleansing required over 1,200 man-hours, improving classification accuracy from 65.1% to 82.84% using log-mel-spectrograms and CNNs. By offering a diverse range of labeled audio recordings, MELAUDIS serves as a benchmark to advance research in ITS.
引用
收藏
页数:13
相关论文
共 45 条
[1]  
Abdic I, 2016, INT C PATT RECOG, P3458, DOI 10.1109/ICPR.2016.7900169
[2]  
Abesser J, 2021, EUR SIGNAL PR CONF, P551
[3]  
Aljaafreh Ahmad, 2010, 2010 International Conference on Networking, Sensing and Control (ICNSC 2010), P570, DOI 10.1109/ICNSC.2010.5461596
[4]   Large-scale audio dataset for emergency vehicle sirens and road noises [J].
Asif, Muhammad ;
Usaid, Muhammad ;
Rashid, Munaf ;
Rajab, Tabarka ;
Hussain, Samreen ;
Wasi, Sarwar .
SCIENTIFIC DATA, 2022, 9 (01)
[5]  
Bakirci E., 2022, INTERNOISE 2022, DOI [10.3397/in20220766, DOI 10.3397/IN20220766]
[6]  
Brockmann EM, 1997, IEEE SYS MAN CYBERN, P3817, DOI 10.1109/ICSMC.1997.633265
[7]   Traffic Density Classification Using Sound Datasets: An Empirical Study on Traffic Flow at Asymmetric Roads [J].
Bui, Khac-Hoai Nam ;
Oh, Hyeonjeong ;
Yi, Hongsuk .
IEEE ACCESS, 2020, 8 :125671-125679
[8]  
Bulatovic N, 2022, 2022 26 INT C INF TE, P1, DOI [DOI 10.1109/IT54280.2022.9743540, 10.1109/IT54280.2022.9743540]
[9]   Towards a system for automatic traffic sound event detection [J].
Chavdar, Marko ;
Gerazov, Branislav ;
Ivanovski, Zoran ;
Kartalov, Tomislav .
2020 28TH TELECOMMUNICATIONS FORUM (TELFOR), 2020, :209-212
[10]   Distributed Acoustic Sensor Systems for Vehicle Detection and Classification [J].
Chiang, Chia-Yen ;
Jaber, Mona ;
Chai, Kok Keong ;
Loo, Jonathan .
IEEE ACCESS, 2023, 11 :31293-31303