SHO based Deep Residual network and hierarchical speech features for speech enhancement

被引:0
|
作者
Bhosle M.R. [1 ,2 ]
Narayaswamy N.K. [2 ,3 ]
机构
[1] Electronics and Communication Engineering, Government Engineering College, Raichur
[2] Visvesvaraya Technological University, Karnataka, Belagavi
[3] Department of ECE, Nagarjuna College of Engineering and Technology, Bangalore
关键词
Bark Frequency Cepstral Coefficients; Deep residual network; Harmony search optimization algorithm; Shuffled Shepherd Optimization Algorithm; Speech enhancement;
D O I
10.1007/s10772-022-09972-x
中图分类号
学科分类号
摘要
The human frequently finds difficulty in understanding the speech due to the real-world noises. The presence of external noises corrupts the listening comfort of user. Hence there is a need for the enhancement of speech. In this paper, the Shepherd Harmony Optimization (SHO)-based Deep Residual network (DRN) is developed for speech enhancement. Here, the developed SHO-based DRN is the combination of the Shuffled Shepherd Optimization Algorithm (SSOA) and Harmony Search optimization (HS). The Hanning window is used for the pre-processing of the input data. In this method, the Bark Frequency Cepstral Coefficients (BFCC) and Fractional Delta amplitude modulation spectrogram (FD-AMS) are used for the feature extraction. Moreover, the noises present in speech signals are predicted for eliminating the distorted noises and external calamities. Besides, the DRN classifier is utilized to improve the speech signal. The classifier is trained by newly devised optimization algorithm. Besides, the developed speech enhancement technique obtained better performance in terms of Perceptual Evaluation of Speech Quality (PESQ) with 2.646 and Root Mean Square Error (RMSE) with 0.0067. © 2022, The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature.
引用
收藏
页码:355 / 370
页数:15
相关论文
共 50 条
  • [41] Microphone Array Speech Enhancement Via Beamforming Based Deep Learning Network
    Pathrose, Jeyasingh
    Ismail, M. Mohamed
    Mohan, P. Madhan
    INTERNATIONAL JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING SYSTEMS, 2023, 14 (07) : 781 - 790
  • [42] Speech enhancement method based on the perceptual joint optimization deep neural network
    Yuan W.
    Lou Y.
    Liang C.
    Wang Z.
    Xi'an Dianzi Keji Daxue Xuebao/Journal of Xidian University, 2019, 46 (02): : 90 - 94
  • [43] SNR-Based Progressive Learning of Deep Neural Network for Speech Enhancement
    Gao, Tian
    Du, Jun
    Dai, Li-Rong
    Lee, Chin-Hui
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3713 - 3717
  • [44] Improving Deep Neural Network Based Speech Enhancement in Low SNR Environments
    Gao, Tian
    Du, Jun
    Xu, Yong
    Liu, Cong
    Dai, Li-Rong
    Lee, Chin-Hui
    LATENT VARIABLE ANALYSIS AND SIGNAL SEPARATION, LVA/ICA 2015, 2015, 9237 : 75 - 82
  • [45] A Deep Neural Network Based Kalman Filter for Time Domain Speech Enhancement
    Yu, Hongjiang
    Ouyang, Zhiheng
    Zhu, Wei-Ping
    Champagne, Benoit
    Ji, Yunyun
    2019 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2019,
  • [46] Deep neural network based speech enhancement using mono channel mask
    Pallavi P. Ingale
    Sanjay L. Nalbalwar
    International Journal of Speech Technology, 2019, 22 : 841 - 850
  • [47] GLOBAL VARIANCE EQUALIZATION FOR IMPROVING DEEP NEURAL NETWORK BASED SPEECH ENHANCEMENT
    Xu, Yong
    Du, Jun
    Dai, Li-Rong
    Lee, Chin-Hui
    2014 IEEE CHINA SUMMIT & INTERNATIONAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (CHINASIP), 2014, : 71 - 75
  • [48] Deep neural network based speech enhancement using mono channel mask
    Ingale, Pallavi P.
    Nalbalwar, Sanjay L.
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2019, 22 (03) : 841 - 850
  • [49] Improved Sparse NMF based Speech Enhancement Method with Deep Neural Network
    Zou, Xia
    Zhang, Xiongwei
    Shi, Wenhua
    Wang, Fupeng
    Zhang, Jingtao
    Gao, Mingyue
    PROCEEDINGS OF THE 2ND INTERNATIONAL FORUM ON MANAGEMENT, EDUCATION AND INFORMATION TECHNOLOGY APPLICATION (IFMEITA 2017), 2017, 130 : 231 - 234
  • [50] A Novel Adversarial Training Scheme for Deep Neural Network based Speech Enhancement
    Cornell, Samuele
    Principi, Emanuele
    Squartini, Stefano
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,