Mixup-Based Acoustic Scene Classification Using Multi-channel Convolutional Neural Network

被引:45
|
作者
Xu, Kele [1 ,2 ]
Feng, Dawei [1 ]
Mi, Haibo [1 ]
Zhu, Boqing [1 ]
Wang, Dezhi [3 ]
Zhang, Lilun [3 ]
Cai, Hengxing [4 ]
Liu, Shuwen [5 ]
机构
[1] Natl Univ Def Technol, Sci & Technol Parallel & Distributed Lab, Changsha, Hunan, Peoples R China
[2] Natl Univ Def Technol, Coll Informat Commun, Wuhan, Hubei, Peoples R China
[3] Natl Univ Def Technol, Coll Meteorol & Oceanog, Changsha, Hunan, Peoples R China
[4] Sun Yat Sen Univ, Coll Engn, Guangzhou, Guangdong, Peoples R China
[5] Nanjing Univ Technol, Coll Comp Sci, Nanjing, Jiangsu, Peoples R China
来源
ADVANCES IN MULTIMEDIA INFORMATION PROCESSING, PT III | 2018年 / 11166卷
关键词
D O I
10.1007/978-3-030-00764-5_2
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Audio scene classification, the problem of predicting class labels of audio scenes, has drawn lots of attention during the last several years. However, it remains challenging and falls short of accuracy and efficiency. Recently, Convolutional Neural Network (CNN)-based methods have achieved better performance with comparison to the traditional methods. Nevertheless, conventional single channel CNN may fail to consider the fact that additional cues may be embedded in the multi-channel recordings. In this paper, we explore the use of Multi-channel CNN for the classification task, which aims to extract features from different channels in an end-to-end manner. We conduct the evaluation compared with the conventional CNN and traditional Gaussian Mixture Model-based methods. Moreover, to improve the classification accuracy further, this paper explores the using of mixup method. In brief, mixup trains the neural network on linear combinations of pairs of the representation of audio scene examples and their labels. By employing the mixup approach for data augmentation, the novel model can provide higher prediction accuracy and robustness in contrast with previous models, while the generalization error can also be reduced on the evaluation data.
引用
收藏
页码:14 / 23
页数:10
相关论文
共 50 条
  • [1] Acoustic Scene Classification Based on Dense Convolutional Networks Incorporating Multi-channel Features
    Wang, Dezhi
    Zhang, Lilun
    Xu, Kele
    Wang, Yongxian
    2018 3RD INTERNATIONAL CONFERENCE ON COMMUNICATION, IMAGE AND SIGNAL PROCESSING, 2019, 1169
  • [2] Classification of Hyperspectral Data Using a Multi-Channel Convolutional Neural Network
    Chen, Chen
    Zhang, Jing-Jing
    Zheng, Chun-Hou
    Yan, Qing
    Xun, Li-Na
    INTELLIGENT COMPUTING METHODOLOGIES, ICIC 2018, PT III, 2018, 10956 : 81 - 92
  • [3] A Hybrid Approach with Multi-channel I-Vectors and Convolutional Neural Networks for Acoustic Scene Classification
    Eghbal-zadeh, Hamid
    Lehner, Bernhard
    Dorfer, Matthias
    Widmer, Gerhard
    2017 25TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2017, : 2749 - 2753
  • [4] Multi-channel Convolutional Neural Network for Precise Meme Classification
    Sherratt, Victoria
    Pimbblet, Kevin
    Dethlefs, Nina
    PROCEEDINGS OF THE 2023 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2023, 2023, : 190 - 198
  • [5] SVD-BASED CHANNEL PRUNING FOR CONVOLUTIONAL NEURAL NETWORK IN ACOUSTIC SCENE CLASSIFICATION MODEL
    Wang, Jun
    Li, Shengchen
    Wang, Wenwu
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2019, : 390 - 395
  • [6] A multi-channel attention graph convolutional neural network for node classification
    Zhai, Rui
    Zhang, Libo
    Wang, Yingqi
    Song, Yalin
    Yu, Junyang
    JOURNAL OF SUPERCOMPUTING, 2023, 79 (04): : 3561 - 3579
  • [7] Multi-channel Convolutional Neural Network with Sentiment Information for Sentiment Classification
    Yan, Hao
    Li, Huixin
    Yi, Benshun
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2023, 48 (08) : 10551 - 10561
  • [8] Multi-channel Convolutional Neural Network with Sentiment Information for Sentiment Classification
    Hao Yan
    Huixin Li
    Benshun Yi
    Arabian Journal for Science and Engineering, 2023, 48 : 10551 - 10561
  • [9] A multi-channel attention graph convolutional neural network for node classification
    Rui Zhai
    Libo Zhang
    Yingqi Wang
    Yalin Song
    Junyang Yu
    The Journal of Supercomputing, 2023, 79 : 3561 - 3579
  • [10] A Convolutional Neural Network Approach for Acoustic Scene Classification
    Valenti, Michele
    Squartini, Stefano
    Diment, Aleksandr
    Parascandolo, Giambattista
    Virtanen, Tuomas
    2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 1547 - 1554