Research on Acoustic Events Recognition Method With Dimensionality Reduction Combining Attention and Mutual Information

被引:2
作者
Liu, Haitao [1 ,2 ]
Zhou, Jiasheng [1 ]
Xi, Guanglei [1 ]
Peng, Bo [2 ]
Zhang, Sheng [3 ]
Xiao, Qian [1 ]
机构
[1] East China Jiaotong Univ, Sch Mechanotron & Vehicle Engn, Nanchang 330013, Jiangxi, Peoples R China
[2] Tsinghua Univ, Suzhou Automot Res Inst, Suzhou 215131, Peoples R China
[3] Suzhou Acoust Technol Inst Co Ltd, Suzhou 215131, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Mutual information; Mel frequency cepstral coefficient; Feature extraction; Dimensionality reduction; Acoustics; Sensors; Computer architecture; Environment sound classification; mutual information; feature dimensionality reduction; LSTM; attention mechanism; CLASSIFICATION;
D O I
10.1109/JSEN.2022.3155706
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The environment sound classification(ESC) is of great significance to the monitoring and control of urban noise. Aiming at the curse of dimensionality phenomenon in ESC, a feature dimensionality reduction architecture combining attention and mutual information is proposed. In order to match the two-dimensional MFCC (Mel Frequency Cepstral Coefficients) feature matrix, the proposed method separates and reconstructs the feature frames of different samples, and achieves the effect of dimensionality reduction by making decisions on the information entropy between the feature frames and labels. In addition, the method combines LSTM (Long Short-Term Memory) model with attention mechanism to ensure the recognition accuracy of the model after dimensionality reduction. Ten urban acoustic events from UrbanSound8k (US8K) dataset are selected to verify the performances of the proposed method by simulation experiments, which are also compared with the existing classification methods. The simulation results show that by combining the attention mechanism and mutual information, the recognition accuracy of the proposed method on the UrbanSound8k dataset is 95.16%, and the parameter scale is the smallest, only 0.92M. Moreover, the model parameter scale is adjustable by dynamic frame retention mechanism to balance the recognition accuracy and speed. This method not only ensures a high classification accuracy, but also can reduce computing power consumption and storage space of monitoring equipment, which shows a better practical performance for urban acoustic events recognition.
引用
收藏
页码:8622 / 8632
页数:11
相关论文
共 48 条
  • [31] A novel filter based on three variables mutual information for dimensionality reduction and classification of hyperspectral images
    Elmaizi, Asma
    Sarhrouni, Elkebir
    Hammouch, Ahmed
    Nacir, Chafik
    2016 INTERNATIONAL CONFERENCE ON ELECTRICAL AND INFORMATION TECHNOLOGIES (ICEIT), 2016, : 368 - 373
  • [32] A dimensionality reduction method based on structured sparse representation for face recognition
    Gu, Guanghua
    Hou, Zhichao
    Chen, Chunxia
    Zhao, Yao
    ARTIFICIAL INTELLIGENCE REVIEW, 2016, 46 (04) : 431 - 443
  • [33] An analysis of evaluation information: A method based on SVD & dimensionality reduction model
    Qi, H
    Liu, YF
    Wang, XP
    Xiao, HG
    Wang, SS
    ICIA 2004: Proceedings of 2004 International Conference on Information Acquisition, 2004, : 40 - 45
  • [34] A Noise Reduction Method for Photoacoustic Imaging In Vivo Based on EMD and Conditional Mutual Information
    Zhou, Meng
    Xia, Haibo
    Zhong, Hongtao
    Zhang, Jiayao
    Gao, Fei
    IEEE PHOTONICS JOURNAL, 2019, 11 (01):
  • [35] Feature selection method based on mutual information and class separability for dimension reduction in multidimensional time series for clinical data
    Fang, Liying
    Zhao, Han
    Wang, Pu
    Yu, Mingwei
    Yan, Jianzhuo
    Cheng, Wenshuai
    Chen, Peiyu
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2015, 21 : 82 - 89
  • [36] A supervised dimensionality reduction method-based sparse representation for face recognition
    Zhang, Xinxin
    Peng Yali
    Liu, Shigang
    Wu, Jie
    Ren, Pingan
    JOURNAL OF MODERN OPTICS, 2017, 64 (08) : 799 - 806
  • [37] Research on the Method of Nonlinear Dimensionality Reduction for the Text of Laws and Regulations of construction
    Su, Bian-ping
    Wang, Yi-ping
    Zhi, Hui
    2009 5TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND MOBILE COMPUTING, VOLS 1-8, 2009, : 5371 - +
  • [38] PCA-BASED DIMENSIONALITY REDUCTION METHOD FOR USER INFORMATION IN UNIVERSAL NETWORK
    Dai, Yu
    Guan, Jianfeng
    Quan, Wei
    Xu, Changqiao
    Zhang, Hongke
    2012 IEEE 2ND INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTELLIGENT SYSTEMS (CCIS) VOLS 1-3, 2012, : 70 - 74
  • [39] Open-set Pig Face Recognition Method Combining Attention Mechanism
    Wang R.
    Gao R.
    Li Q.
    Liu S.
    Yu Q.
    Feng L.
    Nongye Jixie Xuebao/Transactions of the Chinese Society for Agricultural Machinery, 2023, 54 (02): : 256 - 264
  • [40] Channel Selection Method for EEG Emotion Recognition Using Normalized Mutual Information
    Wang, Zhong-Min
    Hu, Shu-Yuan
    Song, Hui
    IEEE ACCESS, 2019, 7 : 143303 - 143311