Audio Scene Classification Under Convolutional Neural Network

被引:0
|
作者
Chen, Shengbo [1 ]
Tan, Aoshuai [2 ]
Liu, Ximin [3 ]
Tang, Pengjie [4 ]
Lv, Jingxiang [4 ]
Guo, Chen [4 ]
机构
[1] Henan Univ, Sch Comp & Informat Engn, Zhengzhou 475001, Henan, Peoples R China
[2] Henan Univ, Sch Software, Zhengzhou 475001, Henan, Peoples R China
[3] Shenzhen Ruier Elect Co Ltd, Shenzhen 518000, Guangdong, Peoples R China
[4] Jinggangshan Univ, Dept Comp Sci, Jian 343009, Jiangxi, Peoples R China
来源
PROCEEDINGS OF INTERNATIONAL CONFERENCE ON ALGORITHMS, SOFTWARE ENGINEERING, AND NETWORK SECURITY, ASENS 2024 | 2024年
基金
中国国家自然科学基金;
关键词
Signal processing; Audio classification; Convolutional neural networks; Feature extraction;
D O I
10.1145/3677182.3677241
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Audio scene classification is a way of supporting security monitoring applications such as audio surveillance, anomaly detection, and risk management by recognizing and categorizing environmental labels in audio data. With the significant increase in the volume of audio data generated by audio-video surveillance systems, the limitations of traditional classification methods are becoming increasingly apparent. In contrast, deep learning techniques, leveraging their advantages in data feature processing and pattern recognition, have become key technologies for solving such problems. Building upon this, this paper focuses on optimizing the audio scene classification system using a Convolutional Neural Network model and delving deeper into existing dataset information without increasing additional data volume. Additionally, the network structure is adjusted without increasing the computational burden. This model approach effectively improves the recognition accuracy in specific scenarios, as evidenced by comparison analysis with a human baseline system.
引用
收藏
页码:329 / 334
页数:6
相关论文
共 50 条
  • [1] Deep Convolutional Neural Network with Scalogram for Audio Scene Modeling
    Chen, Hangting
    Zhang, Pengyuan
    Bai, Haichuan
    Yuan, Qingsheng
    Bao, Xiuguo
    Yan, Yonghong
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 3304 - 3308
  • [2] Convolutional Neural Network based Audio Event Classification
    Lim, Minkyu
    Lee, Donghyun
    Park, Hosung
    Kang, Yoseb
    Oh, Junseok
    Park, Jeong-Sik
    Jang, Gil-Jin
    Kim, Ji-Hwan
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2018, 12 (06): : 2748 - 2760
  • [3] Scene Classification Based on Multiscale Convolutional Neural Network
    Liu, Yanfei
    Zhong, Yanfei
    Qin, Qianqing
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2018, 56 (12): : 7109 - 7121
  • [4] Farmland scene classification based on convolutional neural network
    Zhu Deli
    Chen Bingqi
    Zhu Deli
    Yang Yunong
    2016 INTERNATIONAL CONFERENCE ON CYBERWORLDS (CW), 2016, : 159 - 162
  • [5] A Convolutional Neural Network Approach for Acoustic Scene Classification
    Valenti, Michele
    Squartini, Stefano
    Diment, Aleksandr
    Parascandolo, Giambattista
    Virtanen, Tuomas
    2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 1547 - 1554
  • [6] Sample Dropout for Audio Scene Classification Using Multi-scale Dense Connected Convolutional Neural Network
    Feng, Dawei
    Xu, Kele
    Mi, Haibo
    Liao, Feifan
    Zhou, Yan
    KNOWLEDGE MANAGEMENT AND ACQUISITION FOR INTELLIGENT SYSTEMS (PKAW 2018), 2018, 11016 : 114 - 123
  • [7] Scene Classification with Simple Machine Learning and Convolutional Neural Network
    Yosboon, Simon
    2022 INTERNATIONAL CONFERENCE ON DECISION AID SCIENCES AND APPLICATIONS (DASA), 2022, : 616 - 619
  • [8] A Time Delay Convolutional Neural Network for Acoustic Scene Classification
    Lee, Younglo
    Park, Sangwook
    Ko, Hanseok
    2018 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2018,
  • [9] A Convolutional Neural Network Based on Grouping Structure for Scene Classification
    Wu, Xuan
    Zhang, Zhijie
    Zhang, Wanchang
    Yi, Yaning
    Zhang, Chuanrong
    Xu, Qiang
    REMOTE SENSING, 2021, 13 (13)
  • [10] Sparsity Through Spiking Convolutional Neural Network for Audio Classification at the Edge
    Leow, Cong Sheng
    Goh, Wang Ling
    Gao, Yuan
    2023 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, ISCAS, 2023,