RESEARCH ON ENGLISH SPEECH ENHANCEMENT ALGORITHM BASED ON IMPROVED SPECTRAL SUBTRACTION AND DEEP NEURAL NETWORK

被引：1

作者：

Zhou, Qiaoling ^{[1
]}

机构：

[1] Fujian Agr & Forestry Univ, Int Coll, 15 Shangxiadian Rd, Fuzhou 350002, Peoples R China

来源：

INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL | 2020年 / 16卷 / 05期

关键词：

Improved spectrum subtraction; Deep neural network; Speech enhancement; Amplitude spectrum; English communication; NOISE;

D O I：

10.24507/ijicic.16.05.1711

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In order to solve the introduced unstructured voiceless problems of conventional spectrum subtraction in English speech signals enhancement, this paper proposes a novel English speech signals enhancement algorithm. This algorithm uses an improved minimal controlled recursive averaging (IMCRA) method to estimate noise spectrum, and tracks the estimated noise spectrum in real time. Then, the deep neural network (DNN) is used to construct the nonlinear mapping function of log amplitude spectrum between speech with noises and ideal pure speech for English speech enhancement. To validate the feasibility and effectiveness of the proposed algorithm, the standard IEEE speech signals and Noise-91 noise signals are used for experiments. Experimental results have shown that the proposed IMCRA method has stronger ability to avoid noises in speech signals, and the DNN method can well recover the speech components and spectrum structure polluted by noises. To enhance English speech in daily international speech communication, the proposed combination method has strong robustness to various real noise environments, and brings significant improvement to interpersonal communication and human computer communication.

引用

页码：1711 / 1723

页数：13

共 50 条

[21] A Speech Enhancement Algorithm Using Computational Auditory Scene Analysis with Spectral Subtraction
Guo, Cong
Hui, Like
Zhang, Wei-Qiang
Liu, Jia
2016 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT), 2016, : 6 - 10
[22] Deep neural network and noise classification-based speech enhancement
Shi, Wenhua
Zhang, Xiongwei
Zou, Xia
Han, Wei
MODERN PHYSICS LETTERS B, 2017, 31 (19-21):
[23] Speech Enhancement Algorithm Combining Cochlear Features and Deep Neural Network with Skip Connections
Lan, Chaofeng
Wang, Yuqiao
Zhang, Lei
Yu, Zelong
Liu, Chundong
Guo, Xiaoxia
JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2023, 95 (08): : 979 - 989
[24] Speech enhancement by spectral subtraction based on subspace decomposition
Murakami, T
Hoya, T
Ishida, Y
IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2005, E88A (03): : 690 - 701
[25] Speech Enhancement Based on a Modified Spectral Subtraction Method
Islam, Md. T.
Shahnaz, C.
Fattah, S. A.
2014 IEEE 57TH INTERNATIONAL MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS (MWSCAS), 2014, : 1085 - 1088
[26] Deep Neural Network for Supervised Single-Channel Speech Enhancement
Saleem, Nasir
Irfan Khattak, Muhammad
Ali, Muhammad Yousaf
Shafi, Muhammad
ARCHIVES OF ACOUSTICS, 2019, 44 (01) : 3 - 12
[27] Enhancement of speech using deep neural network with discrete cosine transform
Ram, Rashmirekha
Mohanty, Mihir Narayan
JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2018, 35 (01) : 141 - 148
[28] Speech Enhancement Based on Improved Deep Neural Networks with MMSE Pretreatment Features
Han, Wei
Wu, Congming
Zhang, Xiongwei
Sun, Meng
Min, Gang
PROCEEDINGS OF 2016 IEEE 13TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP 2016), 2016, : 1140 - 1145
[29] A Maximum Likelihood Approach to Masking-based Speech Enhancement Using Deep Neural Network
Wang, Qing
Du, Jun
Chai, Li
Dai, Li-Rong
Lee, Chin-Hui
2018 11TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2018, : 295 - 299
[30] A Novel Single Channel Speech Enhancement Based on Joint Deep Neural Network and Wiener Filter
Han, Wei
Zhang, Xiongwei
Min, Gang
Zhou, Xingyu
PROCEEDINGS OF 2015 IEEE INTERNATIONAL CONFERENCE ON PROGRESS IN INFORMATCS AND COMPUTING (IEEE PIC), 2015, : 163 - 167

← 1 2 3 4 5 →