Research on Acoustic Events Recognition Method With Dimensionality Reduction Combining Attention and Mutual Information

被引:2
作者
Liu, Haitao [1 ,2 ]
Zhou, Jiasheng [1 ]
Xi, Guanglei [1 ]
Peng, Bo [2 ]
Zhang, Sheng [3 ]
Xiao, Qian [1 ]
机构
[1] East China Jiaotong Univ, Sch Mechanotron & Vehicle Engn, Nanchang 330013, Jiangxi, Peoples R China
[2] Tsinghua Univ, Suzhou Automot Res Inst, Suzhou 215131, Peoples R China
[3] Suzhou Acoust Technol Inst Co Ltd, Suzhou 215131, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Mutual information; Mel frequency cepstral coefficient; Feature extraction; Dimensionality reduction; Acoustics; Sensors; Computer architecture; Environment sound classification; mutual information; feature dimensionality reduction; LSTM; attention mechanism; CLASSIFICATION;
D O I
10.1109/JSEN.2022.3155706
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The environment sound classification(ESC) is of great significance to the monitoring and control of urban noise. Aiming at the curse of dimensionality phenomenon in ESC, a feature dimensionality reduction architecture combining attention and mutual information is proposed. In order to match the two-dimensional MFCC (Mel Frequency Cepstral Coefficients) feature matrix, the proposed method separates and reconstructs the feature frames of different samples, and achieves the effect of dimensionality reduction by making decisions on the information entropy between the feature frames and labels. In addition, the method combines LSTM (Long Short-Term Memory) model with attention mechanism to ensure the recognition accuracy of the model after dimensionality reduction. Ten urban acoustic events from UrbanSound8k (US8K) dataset are selected to verify the performances of the proposed method by simulation experiments, which are also compared with the existing classification methods. The simulation results show that by combining the attention mechanism and mutual information, the recognition accuracy of the proposed method on the UrbanSound8k dataset is 95.16%, and the parameter scale is the smallest, only 0.92M. Moreover, the model parameter scale is adjustable by dynamic frame retention mechanism to balance the recognition accuracy and speed. This method not only ensures a high classification accuracy, but also can reduce computing power consumption and storage space of monitoring equipment, which shows a better practical performance for urban acoustic events recognition.
引用
收藏
页码:8622 / 8632
页数:11
相关论文
共 48 条
  • [41] Combining Prediction Models and Dimensionality Reduction Technology for Water Resources Management Under Incomplete Information and Dynamic Change
    Yang, Xi
    Chen, Zhihe
    WATER RESOURCES MANAGEMENT, 2024, 38 (14) : 5629 - 5644
  • [42] A Novel SAR Target Recognition Method Combining Electromagnetic Scattering Information and GCN
    Li, Chen
    Du, Lan
    Li, Yi
    Song, Jialun
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [43] Research on named entity recognition method of marine natural products based on attention mechanism
    Ma, Xiaodong
    Yu, Rilei
    Gao, Chunxiao
    Wei, Zhiqiang
    Xia, Yimin
    Wang, Xiaowei
    Liu, Hao
    FRONTIERS IN CHEMISTRY, 2023, 11
  • [44] Deep learning method with auditory passive attention for underwater acoustic target recognition under the condition of ship interference
    Li, Junhao
    Yang, Honghui
    OCEAN ENGINEERING, 2024, 302
  • [45] Research on dimensionality reduction method of a high-dimensional nonlinear gear dynamics model and its dynamic characteristic analysis
    Li, Zhibin
    Zheng, Jinde
    Wang, Sanmin
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART K-JOURNAL OF MULTI-BODY DYNAMICS, 2025,
  • [46] Research on Re-recognition Method of Multi-branch Fusion Attention Mechanism for Occluded Pedestrian
    Zhao, Haiyan
    Xu, Yan
    2023 8TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA ANALYTICS, ICCCBDA, 2023, : 477 - 480
  • [47] Remote Sensing Image Recognition of Dust Cover Net Construction Waste: A Method Combining Convolutional Block Attention Module and U-Net
    Lv, Shangwei
    Liu, Xiaoyu
    Cao, Yifei
    SENSORS AND MATERIALS, 2024, 36 (07) : 3131 - 3145
  • [48] Research on feature extraction method of ship radiated noise with K-nearest neighbor mutual information variational mode decomposition, neural network estimation time entropy and self-organizing map neural network
    Li, Guohui
    Liu, Feng
    Yang, Hong
    MEASUREMENT, 2022, 199