Sample Dropout for Audio Scene Classification Using Multi-scale Dense Connected Convolutional Neural Network

被引:1
|
作者
Feng, Dawei [1 ]
Xu, Kele [1 ,2 ]
Mi, Haibo [1 ]
Liao, Feifan [2 ]
Zhou, Yan [2 ]
机构
[1] Natl Univ Def Technol, Sch Comp, Sci & Technol Parallel & Distributed Lab, Changsha 410073, Hunan, Peoples R China
[2] Natl Univ Def Technol, Sch Informat & Commun, Wuhan 430010, Hubei, Peoples R China
来源
KNOWLEDGE MANAGEMENT AND ACQUISITION FOR INTELLIGENT SYSTEMS (PKAW 2018) | 2018年 / 11016卷
关键词
Sample dropout; Audio scene classification; Convolutional neural network; Multi-scale;
D O I
10.1007/978-3-319-97289-3_9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Acoustic scene classification is an intricate problem for a machine. As an emerging field of research, deep Convolutional Neural Networks (CNN) achieve convincing results. In this paper, we explore the use of multi-scale Dense connected convolutional neural network (DenseNet) for the classification task, with the goal to improve the classification performance as multi-scale features can be extracted from the time-frequency representation of the audio signal. On the other hand, most of previous CNN-based audio scene classification approaches aim to improve the classification accuracy, by employing different regularization techniques, such as the dropout of hidden units and data augmentation, to reduce overfitting. It is widely known that outliers in the training set have a high negative influence on the trained model, and culling the outliers may improve the classification performance, while it is often under-explored in previous studies. In this paper, inspired by the silence removal in the speech signal processing, a novel sample dropout approach is proposed, which aims to remove outliers in the training dataset. Using the DCASE 2017 audio scene classification datasets, the experimental results demonstrates the proposed multi-scale DenseNet providing a superior performance than the traditional single-scale DenseNet, while the sample dropout method can further improve the classification robustness of multi-scale DenseNet.
引用
收藏
页码:114 / 123
页数:10
相关论文
共 50 条
  • [1] Multi-scale Convolutional Neural Network for Remote Sensing Scene Classification
    Alhichri, Haikel
    Alajlan, Naif
    Bazi, Yakoub
    Rabczuk, Timon
    2018 IEEE INTERNATIONAL CONFERENCE ON ELECTRO/INFORMATION TECHNOLOGY (EIT), 2018, : 113 - 117
  • [2] A Multi-Scale Densely Connected Convolutional Neural Network for Automated Thyroid Nodule Classification
    Wang, Luoyan
    Zhou, Xiaogen
    Nie, Xingqing
    Lin, Xingtao
    Li, Jing
    Zheng, Haonan
    Xue, Ensheng
    Chen, Shun
    Chen, Cong
    Du, Min
    Tong, Tong
    Gao, Qinquan
    Zheng, Meijuan
    FRONTIERS IN NEUROSCIENCE, 2022, 16
  • [3] A multi-scale convolutional neural network for heartbeat classification
    Zheng, Lesong
    Zhang, Miao
    Qiu, Lishen
    Ma, Gang
    Zhu, Wenliang
    Wang, Lirong
    2021 IEEE 20TH INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (TRUSTCOM 2021), 2021, : 1488 - 1492
  • [4] Automatic Modulation Classification Using Multi-Scale Convolutional Neural Network
    Chen, Hongtai
    Guo, Li
    Dong, Chao
    Cong, Fuze
    Mu, Xidong
    2020 IEEE 31ST ANNUAL INTERNATIONAL SYMPOSIUM ON PERSONAL, INDOOR AND MOBILE RADIO COMMUNICATIONS (IEEE PIMRC), 2020,
  • [5] Audio Scene Classification Under Convolutional Neural Network
    Chen, Shengbo
    Tan, Aoshuai
    Liu, Ximin
    Tang, Pengjie
    Lv, Jingxiang
    Guo, Chen
    PROCEEDINGS OF INTERNATIONAL CONFERENCE ON ALGORITHMS, SOFTWARE ENGINEERING, AND NETWORK SECURITY, ASENS 2024, 2024, : 329 - 334
  • [6] Multi-Scale Scene Text Detection Based on Convolutional Neural Network
    Lu, Yan-Feng
    Zhang, Ai-Xuan
    Li, Yi
    Yu, Qian-Hui
    Qiao, Hong
    2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 583 - 587
  • [7] Deep Multi-scale Convolutional Neural Network for Dynamic Scene Deblurring
    Nah, Seungjun
    Kim, Tae Hyun
    Lee, Kyoung Mu
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 257 - 265
  • [8] Macular OCT Classification Using a Multi-Scale Convolutional Neural Network Ensemble
    Rasti, Reza
    Rabbani, Hossein
    Mehridehnavi, Alireza
    Hajizadeh, Fedra
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2018, 37 (04) : 1024 - 1034
  • [9] Acoustic Scene Classification using Convolutional Neural Networks and Multi-Scale Multi-Feature Extraction
    Dang, An
    Vu, Toan H.
    Wang, Jia-Ching
    2018 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2018,
  • [10] A multi-scale dense residual correlation network for remote sensing scene classification
    Dai, Wei
    Shi, Furong
    Wang, Xinyu
    Xu, Haixia
    Yuan, Liming
    Wen, Xianbin
    SCIENTIFIC REPORTS, 2024, 14 (01):