SHO based Deep Residual network and hierarchical speech features for speech enhancement

被引：0

作者：

Bhosle M.R. ^{[1
,2
]}

Narayaswamy N.K. ^{[2
,3
]}

机构：

[1] Electronics and Communication Engineering, Government Engineering College, Raichur

[2] Visvesvaraya Technological University, Karnataka, Belagavi

[3] Department of ECE, Nagarjuna College of Engineering and Technology, Bangalore

来源：

International Journal of Speech Technology | 2023年 / 26卷 / 02期

关键词：

Bark Frequency Cepstral Coefficients; Deep residual network; Harmony search optimization algorithm; Shuffled Shepherd Optimization Algorithm; Speech enhancement;

D O I：

10.1007/s10772-022-09972-x

中图分类号：

学科分类号：

摘要：

The human frequently finds difficulty in understanding the speech due to the real-world noises. The presence of external noises corrupts the listening comfort of user. Hence there is a need for the enhancement of speech. In this paper, the Shepherd Harmony Optimization (SHO)-based Deep Residual network (DRN) is developed for speech enhancement. Here, the developed SHO-based DRN is the combination of the Shuffled Shepherd Optimization Algorithm (SSOA) and Harmony Search optimization (HS). The Hanning window is used for the pre-processing of the input data. In this method, the Bark Frequency Cepstral Coefficients (BFCC) and Fractional Delta amplitude modulation spectrogram (FD-AMS) are used for the feature extraction. Moreover, the noises present in speech signals are predicted for eliminating the distorted noises and external calamities. Besides, the DRN classifier is utilized to improve the speech signal. The classifier is trained by newly devised optimization algorithm. Besides, the developed speech enhancement technique obtained better performance in terms of Perceptual Evaluation of Speech Quality (PESQ) with 2.646 and Root Mean Square Error (RMSE) with 0.0067. © 2022, The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature.

引用

页码：355 / 370

页数：15

共 50 条

[21] Fractional feature-based speech enhancement with deep neural network
Xu, Liyun
Zhang, Tong
SPEECH COMMUNICATION, 2023, 153
[22] Subjective intelligibility of deep neural network-based speech enhancement
Gelderblom, Femke B.
Tronstad, Tron V.
Viggen, Erlend Magnus
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1968 - 1972
[23] Speech Enhancement via Mask-Mapping Based Residual Dense Network
Zhou, Lin
Chen, Xijin
Wu, Chaoyan
Zhong, Qiuyue
Cheng, Xu
Tang, Yibin
CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 74 (01): : 1259 - 1277
[24] Speech enhancement with MAP estimation and ICA-based speech features
Lee, JH
Jung, HY
Lee, TW
Lee, SY
ELECTRONICS LETTERS, 2000, 36 (17) : 1506 - 1507
[25] Speech Enhancement Algorithm Combining Cochlear Features and Deep Neural Network with Skip Connections
Lan, Chaofeng
Wang, Yuqiao
Zhang, Lei
Yu, Zelong
Liu, Chundong
Guo, Xiaoxia
JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2023, 95 (08): : 979 - 989
[26] Speech Enhancement Algorithm Combining Cochlear Features and Deep Neural Network with Skip Connections
Chaofeng Lan
Yuqiao Wang
Lei Zhang
Zelong Yu
Chundong Liu
Xiaoxia Guo
Journal of Signal Processing Systems, 2023, 95 : 979 - 989
[27] Improvement of Speech Emotion Recognition by Deep Convolutional Neural Network and Speech Features
Mohanty, Aniruddha
Cherukuri, Ravindranath C.
Prusty, Alok Ranjan
THIRD CONGRESS ON INTELLIGENT SYSTEMS, CIS 2022, VOL 1, 2023, 608 : 117 - 129
[28] HIERARCHICAL NETWORK BASED ON THE FUSION OF STATIC AND DYNAMIC FEATURES FOR SPEECH EMOTION RECOGNITION
Cao, Qi
Hou, Mixiao
Chen, Bingzhi
Zhang, Zheng
Lu, Guangming
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6334 - 6338
[29] Speech Enhancement Based on Improved Deep Neural Networks with MMSE Pretreatment Features
Han, Wei
Wu, Congming
Zhang, Xiongwei
Sun, Meng
Min, Gang
PROCEEDINGS OF 2016 IEEE 13TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP 2016), 2016, : 1140 - 1145
[30] Binaural Deep Neural Network for Robust Speech Enhancement
Jiang, Yi
Liu, Runsheng
2014 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, COMMUNICATIONS AND COMPUTING (ICSPCC), 2014, : 692 - 695

← 1 2 3 4 5 →