Research on Acoustic Events Recognition Method With Dimensionality Reduction Combining Attention and Mutual Information

被引:2
作者
Liu, Haitao [1 ,2 ]
Zhou, Jiasheng [1 ]
Xi, Guanglei [1 ]
Peng, Bo [2 ]
Zhang, Sheng [3 ]
Xiao, Qian [1 ]
机构
[1] East China Jiaotong Univ, Sch Mechanotron & Vehicle Engn, Nanchang 330013, Jiangxi, Peoples R China
[2] Tsinghua Univ, Suzhou Automot Res Inst, Suzhou 215131, Peoples R China
[3] Suzhou Acoust Technol Inst Co Ltd, Suzhou 215131, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Mutual information; Mel frequency cepstral coefficient; Feature extraction; Dimensionality reduction; Acoustics; Sensors; Computer architecture; Environment sound classification; mutual information; feature dimensionality reduction; LSTM; attention mechanism; CLASSIFICATION;
D O I
10.1109/JSEN.2022.3155706
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The environment sound classification(ESC) is of great significance to the monitoring and control of urban noise. Aiming at the curse of dimensionality phenomenon in ESC, a feature dimensionality reduction architecture combining attention and mutual information is proposed. In order to match the two-dimensional MFCC (Mel Frequency Cepstral Coefficients) feature matrix, the proposed method separates and reconstructs the feature frames of different samples, and achieves the effect of dimensionality reduction by making decisions on the information entropy between the feature frames and labels. In addition, the method combines LSTM (Long Short-Term Memory) model with attention mechanism to ensure the recognition accuracy of the model after dimensionality reduction. Ten urban acoustic events from UrbanSound8k (US8K) dataset are selected to verify the performances of the proposed method by simulation experiments, which are also compared with the existing classification methods. The simulation results show that by combining the attention mechanism and mutual information, the recognition accuracy of the proposed method on the UrbanSound8k dataset is 95.16%, and the parameter scale is the smallest, only 0.92M. Moreover, the model parameter scale is adjustable by dynamic frame retention mechanism to balance the recognition accuracy and speed. This method not only ensures a high classification accuracy, but also can reduce computing power consumption and storage space of monitoring equipment, which shows a better practical performance for urban acoustic events recognition.
引用
收藏
页码:8622 / 8632
页数:11
相关论文
共 48 条
  • [1] Stochastic mutual information gradient estimation for dimensionality reduction networks
    Oezdenizci, Ozan
    Erdogmus, Deniz
    INFORMATION SCIENCES, 2021, 570 : 298 - 305
  • [2] Dimensionality Reduction by Mutual Information for Text Classification
    刘丽珍
    宋瀚涛
    陆玉昌
    Journal of Beijing Institute of Technology, 2005, (01) : 32 - 36
  • [3] Band Elimination for Dimensionality Reduction of Hyperspectral Images using Mutual Information
    Dey, Abhishek
    Ghosh, Susmita
    Ghosh, Ashish, Sr.
    IGARSS 2020 - 2020 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2020, : 2025 - 2028
  • [4] Quadratic mutual information for dimensionality reduction and classification
    Gray, David M.
    Principe, Jose C.
    AUTOMATIC TARGET RECOGNITION XX; ACQUISITION, TRACKING, POINTING, AND LASER SYSTEMS TECHNOLOGIES XXIV; AND OPTICAL PATTERN RECOGNITION XXI, 2010, 7696
  • [5] DIMENSIONALITY REDUCTION FOR IMAGE CLASSIFICATION VIA MUTUAL INFORMATION MAXIMIZATION
    Huang, Shuai
    Tran, Trac D.
    2016 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2016, : 509 - 513
  • [6] Hyperspectral images classification and Dimensionality Reduction using Homogeneity feature and mutual information
    Nhaila, Hasna
    Merzouqi, Maria
    Sarhrouni, Elkebir
    Hammouch, Ahmed
    2015 INTELLIGENT SYSTEMS AND COMPUTER VISION (ISCV), 2015,
  • [7] Graph Embedded Nonparametric Mutual Information For Supervised Dimensionality Reduction
    Bouzas, Dimitrios
    Arvanitopoulos, Nikolaos
    Tefas, Anastasios
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 26 (05) : 951 - 963
  • [8] Dimensionality reduction based on non-parametric mutual information
    Faivishevsky, Lev
    Goldberger, Jacob
    NEUROCOMPUTING, 2012, 80 : 31 - 37
  • [9] Text Dimensionality Reduction with Mutual Information Preserving Mapping
    Yang Zhen
    Yao Fei
    Fan Kefeng
    Huang Jian
    CHINESE JOURNAL OF ELECTRONICS, 2017, 26 (05) : 919 - 925
  • [10] Text Dimensionality Reduction with Mutual Information Preserving Mapping
    YANG Zhen
    YAO Fei
    FAN Kefeng
    HUANG Jian
    Chinese Journal of Electronics, 2017, 26 (05) : 919 - 925