SASEGAN-TCN: Speech enhancement algorithm based on self-attention generative adversarial network and temporal convolutional network

被引:1
|
作者
Lv R. [1 ]
Chen N. [1 ]
Cheng S. [1 ]
Fan G. [1 ]
Rao L. [1 ]
Song X. [1 ]
Lv W. [2 ]
Yang D. [3 ]
机构
[1] School of Electronic Information Engineering, Shanghai Dianji University, Shanghai
[2] School of Electronic and Electrical Engineering, Shanghai University of Engineering Science, Shanghai
[3] Alibaba Group, Shanghai
基金
中国国家自然科学基金;
关键词
autoencoder; deep learning; generative adversarial network; speech enhancement;
D O I
10.3934/mbe.2024172
中图分类号
学科分类号
摘要
Traditional unsupervised speech enhancement models often have problems such as non-aggregation of input feature information, which will introduce additional noise during training, thereby reducing the quality of the speech signal. In order to solve the above problems, this paper analyzed the impact of problems such as non-aggregation of input speech feature information on its performance. Moreover, this article introduced a temporal convolutional neural network and proposed a SASEGAN-TCN speech enhancement model, which captured local features information and aggregated global feature information to improve model effect and training stability. The simulation experiment results showed that the model can achieve 2.1636 and 92.78% in perceptual evaluation of speech quality (PESQ) score and short-time objective intelligibility (STOI) on the Valentini dataset, and can accordingly reach 1.8077 and 83.54% on the THCHS30 dataset. In addition, this article used the enhanced speech data for the acoustic model to verify the recognition accuracy. The speech recognition error rate was reduced by 17.4%, which was a significant improvement compared to the baseline model experimental results. © 2024 the Author(s).
引用
收藏
页码:3860 / 3875
页数:15
相关论文
共 50 条
  • [1] SELF-ATTENTION GENERATIVE ADVERSARIAL NETWORK FOR SPEECH ENHANCEMENT
    Huy Phan
    Nguyen, Huy Le
    Chen, Oliver Y.
    Koch, Philipp
    Duong, Ngoc Q. K.
    McLoughlin, Ian
    Mertins, Alfred
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 7103 - 7107
  • [2] Self-attention generative adversarial network with the conditional constraint
    Jia Y.
    Ma L.
    Xi'an Dianzi Keji Daxue Xuebao/Journal of Xidian University, 2019, 46 (06): : 163 - 170
  • [3] Attribute Network Representation Learning Based on Generative Adversarial Network and Self-attention Mechanism
    Li, Shanshan
    Tang, Meiling
    Dong, Yingnan
    International Journal of Network Security, 2024, 26 (01) : 51 - 58
  • [4] SAPCGAN: Self-Attention based Generative Adversarial Network for Point Clouds
    Li, Yushi
    Baciu, George
    PROCEEDINGS OF 2020 IEEE 19TH INTERNATIONAL CONFERENCE ON COGNITIVE INFORMATICS & COGNITIVE COMPUTING (ICCI*CC 2020), 2020, : 52 - 59
  • [5] Dialogue Generation Using Self-Attention Generative Adversarial Network
    Hatua, Amartya
    Nguyen, Trung T.
    Sung, Andrew H.
    2019 IEEE INTERNATIONAL CONFERENCE ON CONVERSATIONAL DATA & KNOWLEDGE ENGINEERING (CDKE), 2019, : 33 - 38
  • [6] GSC Based Speech Enhancement with Generative Adversarial Network
    Zhou, Yao
    Bao, Changchun
    Cheng, Rui
    2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 901 - 906
  • [7] Conditional self-attention generative adversarial network with differential evolution algorithm for imbalanced data classification
    Niu, Jiawei
    Liu, Zhunga
    Pan, Quan
    Yang, Yanbo
    LI, Yang
    CHINESE JOURNAL OF AERONAUTICS, 2023, 36 (03) : 303 - 315
  • [8] Conditional self-attention generative adversarial network with differential evolution algorithm for imbalanced data classification
    Jiawei NIU
    Zhunga LIU
    Quan PAN
    Yanbo YANG
    Yang LI
    Chinese Journal of Aeronautics , 2023, (03) : 303 - 315
  • [9] Generative Adversarial Network Based on Self-Attention Mechanism for Automatic Page Layout Generation
    Sun, Peng
    Liu, Xiaomei
    Weng, Liguo
    Liu, Ziheng
    APPLIED SCIENCES-BASEL, 2025, 15 (05):
  • [10] Conditional self-attention generative adversarial network with differential evolution algorithm for imbalanced data classification
    Jiawei NIU
    Zhunga LIU
    Quan PAN
    Yanbo YANG
    Yang LI
    Chinese Journal of Aeronautics, 2023, 36 (03) : 303 - 315