RESEARCH ON ENGLISH SPEECH ENHANCEMENT ALGORITHM BASED ON IMPROVED SPECTRAL SUBTRACTION AND DEEP NEURAL NETWORK

被引：1

作者：

Zhou, Qiaoling ^{[1
]}

机构：

[1] Fujian Agr & Forestry Univ, Int Coll, 15 Shangxiadian Rd, Fuzhou 350002, Peoples R China

来源：

INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL | 2020年 / 16卷 / 05期

关键词：

Improved spectrum subtraction; Deep neural network; Speech enhancement; Amplitude spectrum; English communication; NOISE;

D O I：

10.24507/ijicic.16.05.1711

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In order to solve the introduced unstructured voiceless problems of conventional spectrum subtraction in English speech signals enhancement, this paper proposes a novel English speech signals enhancement algorithm. This algorithm uses an improved minimal controlled recursive averaging (IMCRA) method to estimate noise spectrum, and tracks the estimated noise spectrum in real time. Then, the deep neural network (DNN) is used to construct the nonlinear mapping function of log amplitude spectrum between speech with noises and ideal pure speech for English speech enhancement. To validate the feasibility and effectiveness of the proposed algorithm, the standard IEEE speech signals and Noise-91 noise signals are used for experiments. Experimental results have shown that the proposed IMCRA method has stronger ability to avoid noises in speech signals, and the DNN method can well recover the speech components and spectrum structure polluted by noises. To enhance English speech in daily international speech communication, the proposed combination method has strong robustness to various real noise environments, and brings significant improvement to interpersonal communication and human computer communication.

引用

页码：1711 / 1723

页数：13

共 50 条

[31] A Maximum Likelihood Approach to Masking-based Speech Enhancement Using Deep Neural Network
Wang, Qing
Du, Jun
Chai, Li
Dai, Li-Rong
Lee, Chin-Hui
2018 11TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2018, : 295 - 299
[32] Speech Intelligibility Potential of General and Specialized Deep Neural Network Based Speech Enhancement Systems
Kolbaek, Morten
Tan, Zheng-Hua
Jensen, Jesper
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (01) : 153 - 167
[33] Improving Deep Neural Network Based Speech Enhancement in Low SNR Environments
Gao, Tian
Du, Jun
Xu, Yong
Liu, Cong
Dai, Li-Rong
Lee, Chin-Hui
LATENT VARIABLE ANALYSIS AND SIGNAL SEPARATION, LVA/ICA 2015, 2015, 9237 : 75 - 82
[34] SNR-Based Progressive Learning of Deep Neural Network for Speech Enhancement
Gao, Tian
Du, Jun
Dai, Li-Rong
Lee, Chin-Hui
17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3713 - 3717
[35] Speech enhancement method based on the perceptual joint optimization deep neural network
Yuan W.
Lou Y.
Liang C.
Wang Z.
Xi'an Dianzi Keji Daxue Xuebao/Journal of Xidian University, 2019, 46 (02): : 90 - 94
[36] Deep neural network based speech enhancement using mono channel mask
Ingale, Pallavi P.
Nalbalwar, Sanjay L.
INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2019, 22 (03) : 841 - 850
[37] Improved multi-band spectral subtraction method for speech enhancement
Ghanbari, Y
Karami-Mollaei, MR
Amelifard, B
PROCEEDINGS OF THE SIXTH IASTED INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING, 2004, : 225 - 230
[38] Spectral Phase Estimation Based on Deep Neural Networks for Single Channel Speech Enhancement
N. Saleem
M. I. Khattak
E. V. Perez
Journal of Communications Technology and Electronics, 2019, 64 : 1372 - 1382
[39] A Perceptually Motivated Multi-Band Spectral Subtraction Algorithm for Enhancement of Degraded Speech
Upadhyay, Navneet
Karmakar, Abhijit
2012 THIRD INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION TECHNOLOGY (ICCCT), 2012, : 340 - 345
[40] An Iterative Graph Spectral Subtraction Method for Speech Enhancement
Yan, Xue
Yang, Zhen
Wang, Tingting
Guo, Haiyan
SPEECH COMMUNICATION, 2020, 123 : 35 - 42

← 1 2 3 4 5 →