"Seeing Sound": Audio Classification Using theWigner-Ville Distribution and Convolutional Neural Networks

被引：1

作者：

Marios, Christonasis Antonios ^{[1
]}

van Eijndhoven, Stef ^{[1
]}

Duin, Peter ^{[2
]}

机构：

[1] Tech Univ Eindhoven TUE, Engn Doctorate Data Sci, Eindhoven, Netherlands

[2] Dutch Natl Police, Eindhoven, Netherlands

来源：

INTELLIGENT SYSTEMS AND APPLICATIONS, VOL 1, INTELLISYS 2023 | 2024年 / 822卷

关键词：

Audio classification; Sound classification; Wigner-Ville distribution; Spectrogram; Convolutional neural networks; Deep learning; Machine learning; Time-Frequency analysis; Time-Frequency distributions; VGG; Imagenet; Audioset;

D O I：

10.1007/978-3-031-47721-8_9

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

With big data becoming increasingly available, IoT hardware becoming widely adopted, and AI capabilities becoming more powerful, organizations are continuously investing in sensing. Data coming from sensor networks are currently combined with sensor fusion and AI algorithms to drive innovation in fields such as self-driving cars. Data from these sensors can be utilized in numerous use cases, including alerts in safety systems of urban settings, for events such as gun shots and explosions. Moreover, diverse types of sensors, such as sound sensors, can be utilized in low-light conditions or at locations where a camera is not available. This paper investigates the potential of the utilization of sound-sensor data in an urban context. Technically, we propose a novel approach of classifying sound data using theWigner-Ville distribution and Convolutional Neural Networks. The Wigner-Ville distribution has not been considered in similar applications in literature. In this paper, we report on the performance of the approach on open-source datasets. The concept and work presented is based on my doctoral thesis, which was performed as part of the Engineering Doctorate program in Data Science at the University of Eindhoven, in collaboration with the Dutch National Police. Additional work on real-world datasets was performed during the thesis, which is not presented here due to confidentiality.

引用

页码：145 / 155

页数：11

共 17 条

[1]

[Anonymous], 2013, Discussion on Time-Frequency Distributions

[2]

Boualem B., 1986, Seismic applications of theWigner-Ville distribution, V4

[3] Arrhythmia Detection Using Wigner-Ville Distribution Based Neural Network [J].

Dash, Sanjit K. ;

Rao, G. Sasibhusan .

INTERNATIONAL CONFERENCE ON COMPUTATIONAL MODELLING AND SECURITY (CMS 2016), 2016, 85 :806-811

[4] Vehicle classification in distributed sensor networks [J].

Duarte, MF ;

Hu, YH .

JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2004, 64 (07) :826-838

[5]

Gemmeke JF, 2017, INT CONF ACOUST SPEE, P776, DOI 10.1109/ICASSP.2017.7952261

[6]

Hershey S., 2016, CNN architectures for large-scale audio classification

[7]

Hershey S, 2017, INT CONF ACOUST SPEE, P131, DOI 10.1109/ICASSP.2017.7952132

[8]

Hormigo J, 2003, Adv. Imaging Electron Phys., V14

[9] ImageNet Classification with Deep Convolutional Neural Networks [J].

Krizhevsky, Alex ;

Sutskever, Ilya ;

Hinton, Geoffrey E. .

COMMUNICATIONS OF THE ACM, 2017, 60 (06) :84-90

[10]

Moss J.C., Acomparison between the modified spectrogram and the pseudoWigner-Ville distribution with and without modification

← 1 2 →