SHO based Deep Residual network and hierarchical speech features for speech enhancement

被引：0

作者：

Bhosle M.R. ^{[1
,2
]}

Narayaswamy N.K. ^{[2
,3
]}

机构：

[1] Electronics and Communication Engineering, Government Engineering College, Raichur

[2] Visvesvaraya Technological University, Karnataka, Belagavi

[3] Department of ECE, Nagarjuna College of Engineering and Technology, Bangalore

来源：

International Journal of Speech Technology | 2023年 / 26卷 / 02期

关键词：

Bark Frequency Cepstral Coefficients; Deep residual network; Harmony search optimization algorithm; Shuffled Shepherd Optimization Algorithm; Speech enhancement;

D O I：

10.1007/s10772-022-09972-x

中图分类号：

学科分类号：

摘要：

The human frequently finds difficulty in understanding the speech due to the real-world noises. The presence of external noises corrupts the listening comfort of user. Hence there is a need for the enhancement of speech. In this paper, the Shepherd Harmony Optimization (SHO)-based Deep Residual network (DRN) is developed for speech enhancement. Here, the developed SHO-based DRN is the combination of the Shuffled Shepherd Optimization Algorithm (SSOA) and Harmony Search optimization (HS). The Hanning window is used for the pre-processing of the input data. In this method, the Bark Frequency Cepstral Coefficients (BFCC) and Fractional Delta amplitude modulation spectrogram (FD-AMS) are used for the feature extraction. Moreover, the noises present in speech signals are predicted for eliminating the distorted noises and external calamities. Besides, the DRN classifier is utilized to improve the speech signal. The classifier is trained by newly devised optimization algorithm. Besides, the developed speech enhancement technique obtained better performance in terms of Perceptual Evaluation of Speech Quality (PESQ) with 2.646 and Root Mean Square Error (RMSE) with 0.0067. © 2022, The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature.

引用

页码：355 / 370

页数：15

共 50 条

[41] Microphone Array Speech Enhancement Via Beamforming Based Deep Learning Network
Pathrose, Jeyasingh
Ismail, M. Mohamed
Mohan, P. Madhan
INTERNATIONAL JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING SYSTEMS, 2023, 14 (07) : 781 - 790
[42] Speech enhancement method based on the perceptual joint optimization deep neural network
Yuan W.
Lou Y.
Liang C.
Wang Z.
Xi'an Dianzi Keji Daxue Xuebao/Journal of Xidian University, 2019, 46 (02): : 90 - 94
[43] SNR-Based Progressive Learning of Deep Neural Network for Speech Enhancement
Gao, Tian
Du, Jun
Dai, Li-Rong
Lee, Chin-Hui
17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3713 - 3717
[44] Improving Deep Neural Network Based Speech Enhancement in Low SNR Environments
Gao, Tian
Du, Jun
Xu, Yong
Liu, Cong
Dai, Li-Rong
Lee, Chin-Hui
LATENT VARIABLE ANALYSIS AND SIGNAL SEPARATION, LVA/ICA 2015, 2015, 9237 : 75 - 82
[45] A Deep Neural Network Based Kalman Filter for Time Domain Speech Enhancement
Yu, Hongjiang
Ouyang, Zhiheng
Zhu, Wei-Ping
Champagne, Benoit
Ji, Yunyun
2019 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2019,
[46] Deep neural network based speech enhancement using mono channel mask
Pallavi P. Ingale
Sanjay L. Nalbalwar
International Journal of Speech Technology, 2019, 22 : 841 - 850
[47] GLOBAL VARIANCE EQUALIZATION FOR IMPROVING DEEP NEURAL NETWORK BASED SPEECH ENHANCEMENT
Xu, Yong
Du, Jun
Dai, Li-Rong
Lee, Chin-Hui
2014 IEEE CHINA SUMMIT & INTERNATIONAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (CHINASIP), 2014, : 71 - 75
[48] Deep neural network based speech enhancement using mono channel mask
Ingale, Pallavi P.
Nalbalwar, Sanjay L.
INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2019, 22 (03) : 841 - 850
[49] Improved Sparse NMF based Speech Enhancement Method with Deep Neural Network
Zou, Xia
Zhang, Xiongwei
Shi, Wenhua
Wang, Fupeng
Zhang, Jingtao
Gao, Mingyue
PROCEEDINGS OF THE 2ND INTERNATIONAL FORUM ON MANAGEMENT, EDUCATION AND INFORMATION TECHNOLOGY APPLICATION (IFMEITA 2017), 2017, 130 : 231 - 234
[50] A Novel Adversarial Training Scheme for Deep Neural Network based Speech Enhancement
Cornell, Samuele
Principi, Emanuele
Squartini, Stefano
2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,

← 1 2 3 4 5 →