SASEGAN-TCN: Speech enhancement algorithm based on self-attention generative adversarial network and temporal convolutional network

被引:1
|
作者
Lv R. [1 ]
Chen N. [1 ]
Cheng S. [1 ]
Fan G. [1 ]
Rao L. [1 ]
Song X. [1 ]
Lv W. [2 ]
Yang D. [3 ]
机构
[1] School of Electronic Information Engineering, Shanghai Dianji University, Shanghai
[2] School of Electronic and Electrical Engineering, Shanghai University of Engineering Science, Shanghai
[3] Alibaba Group, Shanghai
基金
中国国家自然科学基金;
关键词
autoencoder; deep learning; generative adversarial network; speech enhancement;
D O I
10.3934/mbe.2024172
中图分类号
学科分类号
摘要
Traditional unsupervised speech enhancement models often have problems such as non-aggregation of input feature information, which will introduce additional noise during training, thereby reducing the quality of the speech signal. In order to solve the above problems, this paper analyzed the impact of problems such as non-aggregation of input speech feature information on its performance. Moreover, this article introduced a temporal convolutional neural network and proposed a SASEGAN-TCN speech enhancement model, which captured local features information and aggregated global feature information to improve model effect and training stability. The simulation experiment results showed that the model can achieve 2.1636 and 92.78% in perceptual evaluation of speech quality (PESQ) score and short-time objective intelligibility (STOI) on the Valentini dataset, and can accordingly reach 1.8077 and 83.54% on the THCHS30 dataset. In addition, this article used the enhanced speech data for the acoustic model to verify the recognition accuracy. The speech recognition error rate was reduced by 17.4%, which was a significant improvement compared to the baseline model experimental results. © 2024 the Author(s).
引用
收藏
页码:3860 / 3875
页数:15
相关论文
共 50 条
  • [31] BaMSGAN: Self-Attention Generative Adversarial Network with Blur and Memory for Anime Face Generation
    Li, Xu
    Li, Bowei
    Fang, Minghao
    Huang, Rui
    Huang, Xiaoran
    MATHEMATICS, 2023, 11 (20)
  • [32] Multi-scale self-attention generative adversarial network for pathology image restoration
    Meiyan Liang
    Qiannan Zhang
    Guogang Wang
    Na Xu
    Lin Wang
    Haishun Liu
    Cunlin Zhang
    The Visual Computer, 2023, 39 : 4305 - 4321
  • [33] Application of Self-Attention Generative Adversarial Network for Electromagnetic Imaging in Half-Space
    Chiu, Chien-Ching
    Lee, Yang-Han
    Chen, Po-Hsiang
    Shih, Ying-Chen
    Hao, Jiang
    SENSORS, 2024, 24 (07)
  • [34] Distorted underwater image reconstruction for an autonomous underwater vehicle based on a self-attention generative adversarial network
    Li, Tengyue
    Yang, Qianqian
    Rong, Shenghui
    Chen, Long
    He, Bo
    APPLIED OPTICS, 2020, 59 (32) : 10049 - 10060
  • [35] Image Super-Resolution Reconstruction Based on Self-Attention Mechanism and Deep Generative Adversarial Network
    Zhao, Yu-Feng
    He, Jie
    Journal of Network Intelligence, 2024, 9 (04): : 1936 - 1950
  • [36] Image Classification based on Self-attention Convolutional Neural Network
    Cai, Xiaohong
    Li, Ming
    Cao, Hui
    Ma, Jingang
    Wang, Xiaoyan
    Zhuang, Xuqiang
    SIXTH INTERNATIONAL WORKSHOP ON PATTERN RECOGNITION, 2021, 11913
  • [37] VSEGAN: VISUAL SPEECH ENHANCEMENT GENERATIVE ADVERSARIAL NETWORK
    Xu, Xinmeng
    Wang, Yang
    Xu, Dongxiang
    Peng, Yiyuan
    Zhang, Cong
    Jia, Jie
    Chen, Binbin
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 7307 - 7311
  • [38] Speech Enhancement Using Generative Adversarial Network (GAN)
    Huq, Mahmudul
    Maskeliunas, Rytis
    HYBRID INTELLIGENT SYSTEMS, HIS 2021, 2022, 420 : 273 - 282
  • [39] TCN-SA: A Social Attention Network Based on Temporal Convolutional Network for Vehicle Trajectory Prediction
    Li, Qin
    Ou, Bingguang
    Liang, Yifa
    Wang, Yong
    Yang, Xuan
    Li, Linchao
    JOURNAL OF ADVANCED TRANSPORTATION, 2023, 2023
  • [40] AEGANB3: An Efficient Framework with Self-attention Mechanism and Deep Convolutional Generative Adversarial Network for Breast Cancer Classification
    Huong Hoang Luong
    Hai Thanh Nguyen
    Thai-Nghe, Nguyen
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (05) : 1386 - 1398