Research on Acoustic Events Recognition Method With Dimensionality Reduction Combining Attention and Mutual Information

被引:2
作者
Liu, Haitao [1 ,2 ]
Zhou, Jiasheng [1 ]
Xi, Guanglei [1 ]
Peng, Bo [2 ]
Zhang, Sheng [3 ]
Xiao, Qian [1 ]
机构
[1] East China Jiaotong Univ, Sch Mechanotron & Vehicle Engn, Nanchang 330013, Jiangxi, Peoples R China
[2] Tsinghua Univ, Suzhou Automot Res Inst, Suzhou 215131, Peoples R China
[3] Suzhou Acoust Technol Inst Co Ltd, Suzhou 215131, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Mutual information; Mel frequency cepstral coefficient; Feature extraction; Dimensionality reduction; Acoustics; Sensors; Computer architecture; Environment sound classification; mutual information; feature dimensionality reduction; LSTM; attention mechanism; CLASSIFICATION;
D O I
10.1109/JSEN.2022.3155706
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The environment sound classification(ESC) is of great significance to the monitoring and control of urban noise. Aiming at the curse of dimensionality phenomenon in ESC, a feature dimensionality reduction architecture combining attention and mutual information is proposed. In order to match the two-dimensional MFCC (Mel Frequency Cepstral Coefficients) feature matrix, the proposed method separates and reconstructs the feature frames of different samples, and achieves the effect of dimensionality reduction by making decisions on the information entropy between the feature frames and labels. In addition, the method combines LSTM (Long Short-Term Memory) model with attention mechanism to ensure the recognition accuracy of the model after dimensionality reduction. Ten urban acoustic events from UrbanSound8k (US8K) dataset are selected to verify the performances of the proposed method by simulation experiments, which are also compared with the existing classification methods. The simulation results show that by combining the attention mechanism and mutual information, the recognition accuracy of the proposed method on the UrbanSound8k dataset is 95.16%, and the parameter scale is the smallest, only 0.92M. Moreover, the model parameter scale is adjustable by dynamic frame retention mechanism to balance the recognition accuracy and speed. This method not only ensures a high classification accuracy, but also can reduce computing power consumption and storage space of monitoring equipment, which shows a better practical performance for urban acoustic events recognition.
引用
收藏
页码:8622 / 8632
页数:11
相关论文
共 48 条
  • [21] Compression of a Deep Competitive Network Based on Mutual Information for Underwater Acoustic Targets Recognition
    Shen, Sheng
    Yang, Honghui
    Sheng, Meiping
    ENTROPY, 2018, 20 (04)
  • [22] The recognition method of unknown chinese words in fragments based on mutual information
    Zhu Q.
    Cheng X.-Y.
    Gao Z.-J.
    Journal of Convergence Information Technology, 2010, 5 (03) : 68 - 72
  • [23] Nonlinear dimensionality reduction combining MR imaging with non-imaging information
    Wolz, Robin
    Aljabar, Paul
    Hajnal, Joseph V.
    Lotjonen, Jyrki
    Rueckert, Daniel
    MEDICAL IMAGE ANALYSIS, 2012, 16 (04) : 819 - 830
  • [24] Dimensionality reduction by combining category information and latent semantic index for text categorization
    Zheng, Wenbin
    An, Lixin
    Xu, Zhanyi
    Journal of Information and Computational Science, 2013, 10 (08): : 2463 - 2469
  • [25] Novel image registration method combining morphological gradient mutual information with multiresolution optimizer
    College of Electrical Engineering, Nantong University, Nantong 226007, China
    Zidonghua Xuebao, 2008, 3 (246-250): : 246 - 250
  • [26] A noise-robust semi-supervised dimensionality reduction method for face recognition
    Gan, Haitao
    OPTIK, 2018, 157 : 858 - 865
  • [27] An incremental dimensionality reduction method on discriminant information for pattern classification
    Hu, Xiaoqin
    Yang, Zhixia
    Jing, Ling
    PATTERN RECOGNITION LETTERS, 2009, 30 (15) : 1416 - 1423
  • [28] A Chinese sign language recognition system combining attention mechanism and acoustic sensing
    Shi, Yuepeng
    Wu, Yansheng
    Li, Qian
    Zhang, Junyi
    MCB Molecular and Cellular Biomechanics, 2024, 21 (04):
  • [29] A dimensionality reduction method based on structured sparse representation for face recognition
    Guanghua Gu
    Zhichao Hou
    Chunxia Chen
    Yao Zhao
    Artificial Intelligence Review, 2016, 46 : 431 - 443
  • [30] A Novel Nonlinear Dimensionality Reduction Method for Robust Wood Image Recognition
    Zhang, Zhao
    Ye, Ning
    2009 INTERNATIONAL JOINT CONFERENCE ON BIOINFORMATICS, SYSTEMS BIOLOGY AND INTELLIGENT COMPUTING, PROCEEDINGS, 2009, : 533 - 536