RESEARCH ON ENGLISH SPEECH ENHANCEMENT ALGORITHM BASED ON IMPROVED SPECTRAL SUBTRACTION AND DEEP NEURAL NETWORK

被引:1
作者
Zhou, Qiaoling [1 ]
机构
[1] Fujian Agr & Forestry Univ, Int Coll, 15 Shangxiadian Rd, Fuzhou 350002, Peoples R China
来源
INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL | 2020年 / 16卷 / 05期
关键词
Improved spectrum subtraction; Deep neural network; Speech enhancement; Amplitude spectrum; English communication; NOISE;
D O I
10.24507/ijicic.16.05.1711
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In order to solve the introduced unstructured voiceless problems of conventional spectrum subtraction in English speech signals enhancement, this paper proposes a novel English speech signals enhancement algorithm. This algorithm uses an improved minimal controlled recursive averaging (IMCRA) method to estimate noise spectrum, and tracks the estimated noise spectrum in real time. Then, the deep neural network (DNN) is used to construct the nonlinear mapping function of log amplitude spectrum between speech with noises and ideal pure speech for English speech enhancement. To validate the feasibility and effectiveness of the proposed algorithm, the standard IEEE speech signals and Noise-91 noise signals are used for experiments. Experimental results have shown that the proposed IMCRA method has stronger ability to avoid noises in speech signals, and the DNN method can well recover the speech components and spectrum structure polluted by noises. To enhance English speech in daily international speech communication, the proposed combination method has strong robustness to various real noise environments, and brings significant improvement to interpersonal communication and human computer communication.
引用
收藏
页码:1711 / 1723
页数:13
相关论文
共 50 条
  • [21] A Speech Enhancement Algorithm Using Computational Auditory Scene Analysis with Spectral Subtraction
    Guo, Cong
    Hui, Like
    Zhang, Wei-Qiang
    Liu, Jia
    2016 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT), 2016, : 6 - 10
  • [22] Deep neural network and noise classification-based speech enhancement
    Shi, Wenhua
    Zhang, Xiongwei
    Zou, Xia
    Han, Wei
    MODERN PHYSICS LETTERS B, 2017, 31 (19-21):
  • [23] Speech Enhancement Algorithm Combining Cochlear Features and Deep Neural Network with Skip Connections
    Lan, Chaofeng
    Wang, Yuqiao
    Zhang, Lei
    Yu, Zelong
    Liu, Chundong
    Guo, Xiaoxia
    JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2023, 95 (08): : 979 - 989
  • [24] Speech enhancement by spectral subtraction based on subspace decomposition
    Murakami, T
    Hoya, T
    Ishida, Y
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2005, E88A (03): : 690 - 701
  • [25] Speech Enhancement Based on a Modified Spectral Subtraction Method
    Islam, Md. T.
    Shahnaz, C.
    Fattah, S. A.
    2014 IEEE 57TH INTERNATIONAL MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS (MWSCAS), 2014, : 1085 - 1088
  • [26] Deep Neural Network for Supervised Single-Channel Speech Enhancement
    Saleem, Nasir
    Irfan Khattak, Muhammad
    Ali, Muhammad Yousaf
    Shafi, Muhammad
    ARCHIVES OF ACOUSTICS, 2019, 44 (01) : 3 - 12
  • [27] Enhancement of speech using deep neural network with discrete cosine transform
    Ram, Rashmirekha
    Mohanty, Mihir Narayan
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2018, 35 (01) : 141 - 148
  • [28] Speech Enhancement Based on Improved Deep Neural Networks with MMSE Pretreatment Features
    Han, Wei
    Wu, Congming
    Zhang, Xiongwei
    Sun, Meng
    Min, Gang
    PROCEEDINGS OF 2016 IEEE 13TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP 2016), 2016, : 1140 - 1145
  • [29] A Maximum Likelihood Approach to Masking-based Speech Enhancement Using Deep Neural Network
    Wang, Qing
    Du, Jun
    Chai, Li
    Dai, Li-Rong
    Lee, Chin-Hui
    2018 11TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2018, : 295 - 299
  • [30] A Novel Single Channel Speech Enhancement Based on Joint Deep Neural Network and Wiener Filter
    Han, Wei
    Zhang, Xiongwei
    Min, Gang
    Zhou, Xingyu
    PROCEEDINGS OF 2015 IEEE INTERNATIONAL CONFERENCE ON PROGRESS IN INFORMATCS AND COMPUTING (IEEE PIC), 2015, : 163 - 167