Environment Sound Event Classification With a Two-Stream Convolutional Neural Network

被引:26
|
作者
Dong, Xifeng [1 ]
Yin, Bo [1 ,2 ]
Cong, Yanping [1 ]
Du, Zehua [1 ]
Huang, Xianqing [1 ]
机构
[1] Ocean Univ China, Sch Informat Sci & Engn, Qingdao 266100, Peoples R China
[2] Pilot Natl Lab Marine Sci & Technol, Qingdao 266237, Peoples R China
基金
中国国家自然科学基金;
关键词
Environmental sound classification; sound recognition; convolutional neural networks; data processing; pre-emphasis; two stream model; RECOGNITION; REPRESENTATIONS;
D O I
10.1109/ACCESS.2020.3007906
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In recent years, with the construction of intelligent cities, the importance of environmental sound classification (ESC) research has become increasingly prominent. However, due to the non-stationary nature of environment sound and the strong interference of ambient noise, the recognition accuracy of ESC is not high enough. Even with deep learning methods, it is difficult to fully extract features from models with a single input. Aiming to improve the recognition accuracy of ESC, this paper proposes a two-stream convolutional neural network (CNN) based on raw audio CNN (RACNN) and logmel CNN (LMCNN). In this method, a pre-emphasis module is first constructed to deal with raw audio signal. The processed audio data and logmel data are imported into RACNN and LMCNN, respectively to obtain both of time and frequency features of audio. In addition, a random-padding method is proposed to patch shorter data sequences. In such a way, the available data for experiment are greatly increased. Finally, the effectiveness of the methods has been verified based on UrbanSound8K dataset in experimental part.
引用
收藏
页码:125714 / 125721
页数:8
相关论文
共 50 条
  • [31] Two-stream convolutional networks for skin cancer classification
    Mohammed Aloraini
    Multimedia Tools and Applications, 2024, 83 : 30741 - 30753
  • [32] Two-Stream spectral-spatial convolutional capsule network for Hyperspectral image classification
    Zhai, Han
    Zhao, Jie
    INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2024, 127
  • [33] Constructing two-stream input matrices in a convolutional neural network for photovoltaic power prediction
    Chen, Zhi-ru
    Bai, Yu-long
    Hong, Jun-tao
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 135
  • [34] Rolling Bearing Fault Diagnosis Based on CWT and Two-Stream Convolutional Neural Network
    Wang, Yanping
    Cheng, Longsheng
    Mao, Ting
    Wu, Mengdie
    2024 5TH INFORMATION COMMUNICATION TECHNOLOGIES CONFERENCE, ICTC 2024, 2024, : 1 - 6
  • [35] Driver Distraction Recognition with Pose-aware Two-stream Convolutional Neural Network
    Tao, Chenghao
    Ma, Sheqiang
    Proceedings of SPIE - The International Society for Optical Engineering, 2023, 12790
  • [36] SkinningNet: Two-Stream Graph Convolutional Neural Network for Skinning Prediction of Synthetic Characters
    Mosella-Montoro, Albert
    Ruiz-Hidalgo, Javier
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 18572 - 18581
  • [37] Environment Sound Classification Using a Two-Stream CNN Based on Decision-Level Fusion
    Su, Yu
    Zhang, Ke
    Wang, Jingyu
    Madani, Kurosh
    SENSORS, 2019, 19 (07)
  • [38] Two-Stream Convolutional Neural Networks for Emergency Recognition in Images
    Chen, Jia
    Duan, Shihui
    Long, Fei
    Wang, Yongxing
    Wang, Song
    Ling, Qiang
    PROCEEDINGS OF THE 39TH CHINESE CONTROL CONFERENCE, 2020, : 6470 - 6474
  • [39] Convolutional Two-Stream Network Fusion for Video Action Recognition
    Feichtenhofer, Christoph
    Pinz, Axel
    Zisserman, Andrew
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 1933 - 1941
  • [40] A Two-stream Convolutional Network for Musculoskeletal and Neurological Disorders Prediction
    Zhu, Manli
    Men, Qianhui
    Ho, Edmond S. L.
    Leung, Howard
    Shum, Hubert P. H.
    JOURNAL OF MEDICAL SYSTEMS, 2022, 46 (11)