SHO based Deep Residual network and hierarchical speech features for speech enhancement

被引:0
|
作者
Bhosle M.R. [1 ,2 ]
Narayaswamy N.K. [2 ,3 ]
机构
[1] Electronics and Communication Engineering, Government Engineering College, Raichur
[2] Visvesvaraya Technological University, Karnataka, Belagavi
[3] Department of ECE, Nagarjuna College of Engineering and Technology, Bangalore
关键词
Bark Frequency Cepstral Coefficients; Deep residual network; Harmony search optimization algorithm; Shuffled Shepherd Optimization Algorithm; Speech enhancement;
D O I
10.1007/s10772-022-09972-x
中图分类号
学科分类号
摘要
The human frequently finds difficulty in understanding the speech due to the real-world noises. The presence of external noises corrupts the listening comfort of user. Hence there is a need for the enhancement of speech. In this paper, the Shepherd Harmony Optimization (SHO)-based Deep Residual network (DRN) is developed for speech enhancement. Here, the developed SHO-based DRN is the combination of the Shuffled Shepherd Optimization Algorithm (SSOA) and Harmony Search optimization (HS). The Hanning window is used for the pre-processing of the input data. In this method, the Bark Frequency Cepstral Coefficients (BFCC) and Fractional Delta amplitude modulation spectrogram (FD-AMS) are used for the feature extraction. Moreover, the noises present in speech signals are predicted for eliminating the distorted noises and external calamities. Besides, the DRN classifier is utilized to improve the speech signal. The classifier is trained by newly devised optimization algorithm. Besides, the developed speech enhancement technique obtained better performance in terms of Perceptual Evaluation of Speech Quality (PESQ) with 2.646 and Root Mean Square Error (RMSE) with 0.0067. © 2022, The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature.
引用
收藏
页码:355 / 370
页数:15
相关论文
共 50 条
  • [21] Fractional feature-based speech enhancement with deep neural network
    Xu, Liyun
    Zhang, Tong
    SPEECH COMMUNICATION, 2023, 153
  • [22] Subjective intelligibility of deep neural network-based speech enhancement
    Gelderblom, Femke B.
    Tronstad, Tron V.
    Viggen, Erlend Magnus
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1968 - 1972
  • [23] Speech Enhancement via Mask-Mapping Based Residual Dense Network
    Zhou, Lin
    Chen, Xijin
    Wu, Chaoyan
    Zhong, Qiuyue
    Cheng, Xu
    Tang, Yibin
    CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 74 (01): : 1259 - 1277
  • [24] Speech enhancement with MAP estimation and ICA-based speech features
    Lee, JH
    Jung, HY
    Lee, TW
    Lee, SY
    ELECTRONICS LETTERS, 2000, 36 (17) : 1506 - 1507
  • [25] Speech Enhancement Algorithm Combining Cochlear Features and Deep Neural Network with Skip Connections
    Lan, Chaofeng
    Wang, Yuqiao
    Zhang, Lei
    Yu, Zelong
    Liu, Chundong
    Guo, Xiaoxia
    JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2023, 95 (08): : 979 - 989
  • [26] Speech Enhancement Algorithm Combining Cochlear Features and Deep Neural Network with Skip Connections
    Chaofeng Lan
    Yuqiao Wang
    Lei Zhang
    Zelong Yu
    Chundong Liu
    Xiaoxia Guo
    Journal of Signal Processing Systems, 2023, 95 : 979 - 989
  • [27] Improvement of Speech Emotion Recognition by Deep Convolutional Neural Network and Speech Features
    Mohanty, Aniruddha
    Cherukuri, Ravindranath C.
    Prusty, Alok Ranjan
    THIRD CONGRESS ON INTELLIGENT SYSTEMS, CIS 2022, VOL 1, 2023, 608 : 117 - 129
  • [28] HIERARCHICAL NETWORK BASED ON THE FUSION OF STATIC AND DYNAMIC FEATURES FOR SPEECH EMOTION RECOGNITION
    Cao, Qi
    Hou, Mixiao
    Chen, Bingzhi
    Zhang, Zheng
    Lu, Guangming
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6334 - 6338
  • [29] Speech Enhancement Based on Improved Deep Neural Networks with MMSE Pretreatment Features
    Han, Wei
    Wu, Congming
    Zhang, Xiongwei
    Sun, Meng
    Min, Gang
    PROCEEDINGS OF 2016 IEEE 13TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP 2016), 2016, : 1140 - 1145
  • [30] Binaural Deep Neural Network for Robust Speech Enhancement
    Jiang, Yi
    Liu, Runsheng
    2014 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, COMMUNICATIONS AND COMPUTING (ICSPCC), 2014, : 692 - 695