SASEGAN-TCN: Speech enhancement algorithm based on self-attention generative adversarial network and temporal convolutional network

被引:1
|
作者
Lv R. [1 ]
Chen N. [1 ]
Cheng S. [1 ]
Fan G. [1 ]
Rao L. [1 ]
Song X. [1 ]
Lv W. [2 ]
Yang D. [3 ]
机构
[1] School of Electronic Information Engineering, Shanghai Dianji University, Shanghai
[2] School of Electronic and Electrical Engineering, Shanghai University of Engineering Science, Shanghai
[3] Alibaba Group, Shanghai
基金
中国国家自然科学基金;
关键词
autoencoder; deep learning; generative adversarial network; speech enhancement;
D O I
10.3934/mbe.2024172
中图分类号
学科分类号
摘要
Traditional unsupervised speech enhancement models often have problems such as non-aggregation of input feature information, which will introduce additional noise during training, thereby reducing the quality of the speech signal. In order to solve the above problems, this paper analyzed the impact of problems such as non-aggregation of input speech feature information on its performance. Moreover, this article introduced a temporal convolutional neural network and proposed a SASEGAN-TCN speech enhancement model, which captured local features information and aggregated global feature information to improve model effect and training stability. The simulation experiment results showed that the model can achieve 2.1636 and 92.78% in perceptual evaluation of speech quality (PESQ) score and short-time objective intelligibility (STOI) on the Valentini dataset, and can accordingly reach 1.8077 and 83.54% on the THCHS30 dataset. In addition, this article used the enhanced speech data for the acoustic model to verify the recognition accuracy. The speech recognition error rate was reduced by 17.4%, which was a significant improvement compared to the baseline model experimental results. © 2024 the Author(s).
引用
收藏
页码:3860 / 3875
页数:15
相关论文
共 50 条
  • [41] Self-Attention-Based Convolutional GRU for Enhancement of Adversarial Speech Examples
    Jannu, Chaitanya
    Vanambathina, Sunny Dayal
    INTERNATIONAL JOURNAL OF IMAGE AND GRAPHICS, 2024, 24 (06)
  • [42] Self-attention based progressive generative adversarial network optimized with momentum search optimization algorithm for classification of brain tumor on MRI image
    Nagarani, N.
    Karthick, R.
    Sophia, M. Sandra Carmel
    Binda, M. B.
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 88
  • [43] Self-attention based progressive generative adversarial network optimized with momentum search optimization algorithm for classification of brain tumor on MRI image
    Nagarani, N.
    Karthick, R.
    Sophia, M. Sandra Carmel
    Binda, M. B.
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 88
  • [44] Self-attention generative adversarial capsule network optimized with atomic orbital search algorithm based sentiment analysis for online product recommendation
    Periakaruppan, Sudhakaran
    Shanmugapriya, N.
    Sivan, Rajeswari
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 44 (06) : 9347 - 9362
  • [45] Self-Attention conditional generative adversarial network optimised with crayfish optimization algorithm for improving cyber security in cloud computing
    Jose, G. Sahaya Stalin
    Sugitha, G.
    Lakshmi, S. Ayshwarya
    Chaluvaraj, Preethi Bangalore
    COMPUTERS & SECURITY, 2024, 140
  • [46] Self-attention driven adversarial similarity learning network
    Gao, Xinjian
    Zhang, Zhao
    Mu, Tingting
    Zhang, Xudong
    Cui, Chaoran
    Wang, Meng
    PATTERN RECOGNITION, 2020, 105
  • [47] Lung disease detection using Self-Attention Generative Adversarial Capsule network optimized with sun flower Optimization Algorithm
    Kumar, N. B. Mahesh
    Premalatha, K.
    Suvitha, S.
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 79
  • [48] Research on clothing patterns generation based on multi-scales self-attention improved generative adversarial network
    Yu, Zi-yan
    Luo, Tian-jian
    INTERNATIONAL JOURNAL OF INTELLIGENT COMPUTING AND CYBERNETICS, 2021, 14 (04) : 647 - 663
  • [49] Noise Prior Knowledge Learning for Speech Enhancement via Gated Convolutional Generative Adversarial Network
    Fan, Cunhang
    Liu, Bin
    Tao, Jianhua
    Yi, Jiangyan
    Wen, Zhengqi
    Bai, Ye
    2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 662 - 666
  • [50] Defense method of smart grid GPS spoofing attack based on improved self-attention generative adversarial network
    Li Y.
    Yang S.
    Dianli Zidonghua Shebei/Electric Power Automation Equipment, 2021, 41 (11): : 100 - 106