Using Deep Autoencoders for In-vehicle Audio Anomaly Detection

被引:10
作者
Pereira, Pedro Jose [1 ]
Coelho, Gabriel [1 ]
Ribeiro, Alexandrine [2 ]
Matos, Luis Miguel [1 ]
Nunes, Eduardo C. [1 ]
Ferreira, Andre [3 ]
Pilastri, Andre [2 ]
Cortez, Paulo [1 ]
机构
[1] Univ Minho, Dept Informat Syst, ALGORITMI Ctr, Guimaraes, Portugal
[2] CCG ZGDV Inst, EPMQ IT, Guimaraes, Portugal
[3] Bosch Car Multimedia SA, Braga, Portugal
来源
KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS (KSE 2021) | 2021年 / 192卷
关键词
Anomaly Detection; Audio Input Representation; Deep Learning; In-vehicle Data; Unsupervised Learning;
D O I
10.1016/j.procs.2021.08.031
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Current developments on self-driving cars have increased the interest on autonomous shared taxicabs. While most self-driving technologies focus on the outside environment, there is also a need to provide in-vehicle intelligence (e.g., detect health and safety issues related with the car occupants). Set within an R&D project focused on in-vehicle cockpit intelligence, the research presented in this paper addresses an unsupervised Acoustic Anomaly Detection (AAD) task. Since data is nonexistent in this domain, we first design an in-vehicle sound event data simulator that can realistically mix background audios (recorded from car driving trips) with normal (e.g., people talking, radio on) and abnormal (e.g., people arguing, cough) event sounds, allowing the generation of three synthetic in-vehicle sound datasets. Then, we explore two main sound feature extraction methods (based on a combination of three audio features and mel frequency energy coefficients) and propose a novel Long Short-Term Memory Autoencoder (LSTM-AE) deep learning architecture for in-vehicle sound anomaly detection. Competitive results were achieved by the proposed LSTM-AE when compared with two state-of-the-art methods, namely a dense Autoencoder (AE) and a two-stage clustering. (C) 2021 The Authors. Published by Elsevier B.V. This is an open access article under the CC BY-NC-ND license (https://crativecommons.org/licenses/by-nc-nd/4.0) Peer-review under responsibility of the scientific committee of KES International.
引用
收藏
页码:298 / 307
页数:10
相关论文
共 17 条
  • [1] Bu SJ, 2019, IEEE INT CONF BIG DA, P3545, DOI 10.1109/BigData47090.2019.9005960
  • [2] Duman T.B., 2020, INT WORKSH SOFT COMP, P432
  • [3] Unsupervised log message anomaly detection
    Farzad, Amir
    Gulliver, T. Aaron
    [J]. ICT EXPRESS, 2020, 6 (03): : 229 - 237
  • [4] An introduction to ROC analysis
    Fawcett, Tom
    [J]. PATTERN RECOGNITION LETTERS, 2006, 27 (08) : 861 - 874
  • [5] Gulli A., 2017, Packt
  • [6] IRESE: An intelligent rare-event detection system using unsupervised learning on the IoT edge
    Janjua, Zaffar Haider
    Vecchio, Massimo
    Antonini, Mattia
    Antonelli, Fabio
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2019, 84 : 41 - 50
  • [7] Autonomous Taxi Service Design and User Experience
    Kim, Sangwon
    Chang, Jennifer Jah Eun
    Park, Hyun Ho
    Song, Seon Uk
    Cha, Chang Bae
    Kim, Ji Won
    Kang, Namwoo
    [J]. INTERNATIONAL JOURNAL OF HUMAN-COMPUTER INTERACTION, 2020, 36 (05) : 429 - 448
  • [8] Kingma DP., 2014, P 2 INT C LEARN REPR
  • [9] Koizumi Yuma, 2020, ARXIV200605822
  • [10] Koops HV, 2015, 2015 IEEE INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), P422, DOI 10.1109/ICDSP.2015.7251906