Audio Scene Classification Under Convolutional Neural Network

被引：0

作者：

Chen, Shengbo ^{[1
]}

Tan, Aoshuai ^{[2
]}

Liu, Ximin ^{[3
]}

Tang, Pengjie ^{[4
]}

Lv, Jingxiang ^{[4
]}

Guo, Chen ^{[4
]}

机构：

[1] Henan Univ, Sch Comp & Informat Engn, Zhengzhou 475001, Henan, Peoples R China

[2] Henan Univ, Sch Software, Zhengzhou 475001, Henan, Peoples R China

[3] Shenzhen Ruier Elect Co Ltd, Shenzhen 518000, Guangdong, Peoples R China

[4] Jinggangshan Univ, Dept Comp Sci, Jian 343009, Jiangxi, Peoples R China

来源：

PROCEEDINGS OF INTERNATIONAL CONFERENCE ON ALGORITHMS, SOFTWARE ENGINEERING, AND NETWORK SECURITY, ASENS 2024 | 2024年

基金：

中国国家自然科学基金;

关键词：

Signal processing; Audio classification; Convolutional neural networks; Feature extraction;

D O I：

10.1145/3677182.3677241

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Audio scene classification is a way of supporting security monitoring applications such as audio surveillance, anomaly detection, and risk management by recognizing and categorizing environmental labels in audio data. With the significant increase in the volume of audio data generated by audio-video surveillance systems, the limitations of traditional classification methods are becoming increasingly apparent. In contrast, deep learning techniques, leveraging their advantages in data feature processing and pattern recognition, have become key technologies for solving such problems. Building upon this, this paper focuses on optimizing the audio scene classification system using a Convolutional Neural Network model and delving deeper into existing dataset information without increasing additional data volume. Additionally, the network structure is adjusted without increasing the computational burden. This model approach effectively improves the recognition accuracy in specific scenarios, as evidenced by comparison analysis with a human baseline system.

引用

页码：329 / 334

页数：6

共 50 条

[1] Deep Convolutional Neural Network with Scalogram for Audio Scene Modeling
Chen, Hangting
Zhang, Pengyuan
Bai, Haichuan
Yuan, Qingsheng
Bao, Xiuguo
Yan, Yonghong
19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 3304 - 3308
[2] Convolutional Neural Network based Audio Event Classification
Lim, Minkyu
Lee, Donghyun
Park, Hosung
Kang, Yoseb
Oh, Junseok
Park, Jeong-Sik
Jang, Gil-Jin
Kim, Ji-Hwan
KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2018, 12 (06): : 2748 - 2760
[3] Scene Classification Based on Multiscale Convolutional Neural Network
Liu, Yanfei
Zhong, Yanfei
Qin, Qianqing
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2018, 56 (12): : 7109 - 7121
[4] Farmland scene classification based on convolutional neural network
Zhu Deli
Chen Bingqi
Zhu Deli
Yang Yunong
2016 INTERNATIONAL CONFERENCE ON CYBERWORLDS (CW), 2016, : 159 - 162
[5] A Convolutional Neural Network Approach for Acoustic Scene Classification
Valenti, Michele
Squartini, Stefano
Diment, Aleksandr
Parascandolo, Giambattista
Virtanen, Tuomas
2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 1547 - 1554
[6] Sample Dropout for Audio Scene Classification Using Multi-scale Dense Connected Convolutional Neural Network
Feng, Dawei
Xu, Kele
Mi, Haibo
Liao, Feifan
Zhou, Yan
KNOWLEDGE MANAGEMENT AND ACQUISITION FOR INTELLIGENT SYSTEMS (PKAW 2018), 2018, 11016 : 114 - 123
[7] Scene Classification with Simple Machine Learning and Convolutional Neural Network
Yosboon, Simon
2022 INTERNATIONAL CONFERENCE ON DECISION AID SCIENCES AND APPLICATIONS (DASA), 2022, : 616 - 619
[8] A Time Delay Convolutional Neural Network for Acoustic Scene Classification
Lee, Younglo
Park, Sangwook
Ko, Hanseok
2018 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2018,
[9] A Convolutional Neural Network Based on Grouping Structure for Scene Classification
Wu, Xuan
Zhang, Zhijie
Zhang, Wanchang
Yi, Yaning
Zhang, Chuanrong
Xu, Qiang
REMOTE SENSING, 2021, 13 (13)
[10] Sparsity Through Spiking Convolutional Neural Network for Audio Classification at the Edge
Leow, Cong Sheng
Goh, Wang Ling
Gao, Yuan
2023 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, ISCAS, 2023,

← 1 2 3 4 5 →