Domestic Activities Classification from Audio Recordings Using Multi-scale Dilated Depthwise Separable Convolutional Network

被引:1
|
作者
Zeng, Yufei [1 ]
Li, Yanxiong [1 ]
Zhou, Zhenfeng [1 ]
Wang, Ruiqi [1 ]
Lu, Difeng [1 ]
机构
[1] South China Univ Technol, Sch Elect & Informat Engn, Guangzhou, Peoples R China
来源
IEEE MMSP 2021: 2021 IEEE 23RD INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP) | 2021年
基金
中国国家自然科学基金;
关键词
Domestic activities classification; multi-scale embedding; dilated convolution; depthwise separable convolution; NEURAL-NETWORK; SCENE;
D O I
10.1109/MMSP53017.2021.9733646
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Domestic activities classification (DAC) from audio recordings aims at classifying audio recordings into pre-defined categories of domestic activities, which is an effective way for estimation of daily activities performed in home environment. In this paper, we propose a method for DAC from audio recordings using a multi-scale dilated depthwise separable convolutional network (DSCN). The DSCN is a lightweight neural network with small size of parameters and thus suitable to be deployed in portable terminals with limited computing resources. To expand the receptive field with the same size of DSCN's parameters, dilated convolution, instead of normal convolution, is used in the DSCN for further improving the DSCN's performance. In addition, the embeddings of various scales learned by the dilated DSCN are concatenated as a multi-scale embedding for representing property differences among various classes of domestic activities. Evaluated on a public dataset of the Task 5 of the 2018 challenge on Detection and Classification of Acoustic Scenes and Events (DCASE-2018), the results show that: both dilated convolution and multi-scale embedding contribute to the performance improvement of the proposed method; and the proposed method outperforms the methods based on state-of-the-art lightweight network in terms of classification accuracy.
引用
收藏
页数:5
相关论文
共 30 条
  • [21] Multi-scale Convolutional Attention Fuzzy Broad Network for Few-Shot Hyperspectral Image Classification
    Hu, Xiaopei
    Zhao, Guixin
    Yuan, Lu
    Dong, Xiangjun
    Dong, Aimei
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING-ICANN 2024, PT II, 2024, 15017 : 46 - 60
  • [22] Pediatric Seizure Prediction in Scalp EEG Using a Multi-Scale Neural Network With Dilated Convolutions
    Gao, Yikai
    Chen, Xun
    Liu, Aiping
    Liang, Deng
    Wu, Le
    Qian, Ruobing
    Xie, Hongtao
    Zhang, Yongdong
    IEEE JOURNAL OF TRANSLATIONAL ENGINEERING IN HEALTH AND MEDICINE, 2022, 10
  • [23] Multi-Scale Convolutional Attention and Riemannian Geometry Network for EEG-Based Motor Imagery Classification
    Zhou, Ben
    Wang, Lei
    Xu, Wenchang
    Jiang, Chenyu
    IEEE ACCESS, 2024, 12 : 79731 - 79740
  • [24] Semi-Supervised Classification for PolSAR Data With Multi-Scale Evolving Weighted Graph Convolutional Network
    Ren, Shijie
    Zhou, Feng
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2021, 14 (14) : 2911 - 2927
  • [25] Attention-Based Multi-Scale Convolutional Neural Network (A plus MCNN) for Multi-Class Classification in Road Images
    Eslami, Elham
    Yun, Hae-Bum
    SENSORS, 2021, 21 (15)
  • [26] Deep multi-scale separable convolutional network with triple attention mechanism: A novel multi-task domain adaptation method for intelligent fault diagnosis*
    Zhao, Bo
    Zhang, Xianmin
    Zhan, Zhenhui
    Wu, Qiqiang
    EXPERT SYSTEMS WITH APPLICATIONS, 2021, 182 (182)
  • [27] Deep Multi-scale Feature Fusion Convolutional Neural Network for Automatic Epilepsy Detection Using EEG Signals
    Qin, Hongshuai
    Deng, Bin
    Wang, Jiang
    Yi, Guosheng
    Wang, Ruofan
    Zhang, Zhen
    PROCEEDINGS OF THE 39TH CHINESE CONTROL CONFERENCE, 2020, : 7061 - 7066
  • [28] MSFNet-2SE: A multi-scale fusion convolutional network for Alzheimer's disease classification on magnetic resonance images
    Zhang, Liwen
    Xia, Rongwei
    Yang, Baiyang
    Zhang, Jincan
    Wang, Jinchan
    INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 2024, 34 (04)
  • [29] Road Extraction from GF-1 Remote Sensing Images Based on Dilated Convolution Residual Network with Multi-Scale Feature Fusion
    Ma Tianhao
    Tan Hai
    Li Tianqi
    Wu Yanan
    Liu Qi
    LASER & OPTOELECTRONICS PROGRESS, 2021, 58 (02)
  • [30] Automatic modulation classification scheme for next-generation cellular networks using optimized adaptive multi-scale dual attention network
    Dinesh, G.
    Priya, W. Deva
    Shirley, C. P.
    Vignesh, T.
    PEER-TO-PEER NETWORKING AND APPLICATIONS, 2025, 18 (03)