Mixup-Based Acoustic Scene Classification Using Multi-channel Convolutional Neural Network

被引：45

作者：

Xu, Kele ^{[1
,2
]}

Feng, Dawei ^{[1
]}

Mi, Haibo ^{[1
]}

Zhu, Boqing ^{[1
]}

Wang, Dezhi ^{[3
]}

Zhang, Lilun ^{[3
]}

Cai, Hengxing ^{[4
]}

Liu, Shuwen ^{[5
]}

机构：

[1] Natl Univ Def Technol, Sci & Technol Parallel & Distributed Lab, Changsha, Hunan, Peoples R China

[2] Natl Univ Def Technol, Coll Informat Commun, Wuhan, Hubei, Peoples R China

[3] Natl Univ Def Technol, Coll Meteorol & Oceanog, Changsha, Hunan, Peoples R China

[4] Sun Yat Sen Univ, Coll Engn, Guangzhou, Guangdong, Peoples R China

[5] Nanjing Univ Technol, Coll Comp Sci, Nanjing, Jiangsu, Peoples R China

来源：

ADVANCES IN MULTIMEDIA INFORMATION PROCESSING, PT III | 2018年 / 11166卷

关键词：

D O I：

10.1007/978-3-030-00764-5_2

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Audio scene classification, the problem of predicting class labels of audio scenes, has drawn lots of attention during the last several years. However, it remains challenging and falls short of accuracy and efficiency. Recently, Convolutional Neural Network (CNN)-based methods have achieved better performance with comparison to the traditional methods. Nevertheless, conventional single channel CNN may fail to consider the fact that additional cues may be embedded in the multi-channel recordings. In this paper, we explore the use of Multi-channel CNN for the classification task, which aims to extract features from different channels in an end-to-end manner. We conduct the evaluation compared with the conventional CNN and traditional Gaussian Mixture Model-based methods. Moreover, to improve the classification accuracy further, this paper explores the using of mixup method. In brief, mixup trains the neural network on linear combinations of pairs of the representation of audio scene examples and their labels. By employing the mixup approach for data augmentation, the novel model can provide higher prediction accuracy and robustness in contrast with previous models, while the generalization error can also be reduced on the evaluation data.

引用

页码：14 / 23

页数：10

共 50 条

[1] Acoustic Scene Classification Based on Dense Convolutional Networks Incorporating Multi-channel Features
Wang, Dezhi
Zhang, Lilun
Xu, Kele
Wang, Yongxian
2018 3RD INTERNATIONAL CONFERENCE ON COMMUNICATION, IMAGE AND SIGNAL PROCESSING, 2019, 1169
[2] Classification of Hyperspectral Data Using a Multi-Channel Convolutional Neural Network
Chen, Chen
Zhang, Jing-Jing
Zheng, Chun-Hou
Yan, Qing
Xun, Li-Na
INTELLIGENT COMPUTING METHODOLOGIES, ICIC 2018, PT III, 2018, 10956 : 81 - 92
[3] A Hybrid Approach with Multi-channel I-Vectors and Convolutional Neural Networks for Acoustic Scene Classification
Eghbal-zadeh, Hamid
Lehner, Bernhard
Dorfer, Matthias
Widmer, Gerhard
2017 25TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2017, : 2749 - 2753
[4] Multi-channel Convolutional Neural Network for Precise Meme Classification
Sherratt, Victoria
Pimbblet, Kevin
Dethlefs, Nina
PROCEEDINGS OF THE 2023 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2023, 2023, : 190 - 198
[5] SVD-BASED CHANNEL PRUNING FOR CONVOLUTIONAL NEURAL NETWORK IN ACOUSTIC SCENE CLASSIFICATION MODEL
Wang, Jun
Li, Shengchen
Wang, Wenwu
2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2019, : 390 - 395
[6] A multi-channel attention graph convolutional neural network for node classification
Zhai, Rui
Zhang, Libo
Wang, Yingqi
Song, Yalin
Yu, Junyang
JOURNAL OF SUPERCOMPUTING, 2023, 79 (04): : 3561 - 3579
[7] Multi-channel Convolutional Neural Network with Sentiment Information for Sentiment Classification
Yan, Hao
Li, Huixin
Yi, Benshun
ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2023, 48 (08) : 10551 - 10561
[8] Multi-channel Convolutional Neural Network with Sentiment Information for Sentiment Classification
Hao Yan
Huixin Li
Benshun Yi
Arabian Journal for Science and Engineering, 2023, 48 : 10551 - 10561
[9] A multi-channel attention graph convolutional neural network for node classification
Rui Zhai
Libo Zhang
Yingqi Wang
Yalin Song
Junyang Yu
The Journal of Supercomputing, 2023, 79 : 3561 - 3579
[10] A Convolutional Neural Network Approach for Acoustic Scene Classification
Valenti, Michele
Squartini, Stefano
Diment, Aleksandr
Parascandolo, Giambattista
Virtanen, Tuomas
2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 1547 - 1554

← 1 2 3 4 5 →