Recalling-Enhanced Recurrent Neural Network optimized with Chimp Optimization Algorithm based speech enhancement for hearing aids

被引:1
作者
Rai, Rahul R. [1 ]
Mathivanan, M. [2 ]
机构
[1] VTU, SJB Inst Technol, Dept Elect & Commun Engn, Belagavi, India
[2] ACS Coll Engn, Dept Elect & Commun Engn, Bengaluru, Karnataka, India
来源
INTELLIGENT DECISION TECHNOLOGIES-NETHERLANDS | 2024年 / 18卷 / 01期
关键词
Speech enhancement; hearing aids; MS-SNSD dataset; ternary pattern and discrete wavelet transforms; Recalling-Enhanced Recurrent Neural Network; chimp optimization algorithm;
D O I
10.3233/IDT-230211
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Background noise often distorts the speech signals obtained in a real-world environment. This deterioration occurs in certain applications, like speech recognition, hearing aids. The aim of Speech enhancement (SE) is to suppress the unnecessary background noise in the obtained speech signal. The existing approaches for speech enhancement (SE) face more challenges like low Source-distortion ratio and memory requirements. In this manuscript, Recalling-Enhanced Recurrent Neural Network (R-ERNN) optimized with Chimp Optimization Algorithm based speech enhancement is proposed for hearing aids (R-ERNN-COA-SE-HA). Initially, the clean speech and noisy speech are amassed from MS-SNSD dataset. The input speech signals are encoded using vocoder analysis, and then the Sample RNN decode the bit stream into samples. The input speech signals are extracted using Ternary pattern and discrete wavelet transforms (TP-DWT) in the training phase. In the enhancement stage, R-ERNN forecasts the associated clean speech spectra from noisy speech spectra, then reconstructs a clean speech waveform. Chimp Optimization Algorithm (COA) is considered for optimizing the R-ERNN which enhances speech. The proposed method is implemented in MATLAB, and its efficiency is evaluated under some metrics. The R-ERNN-COA-SE-HA method provides 23.74%, 24.81%, and 19.33% higher PESQ compared with existing methods, such as RGRNN-SE-HA, PACDNN-SE-HA, ARN-SE-HA respectively.
引用
收藏
页码:123 / 134
页数:12
相关论文
共 31 条
  • [1] Cantu MA, 2023, ICASSP 2023, P1
  • [2] Chen X, 2021, P CLAR WORKSH MACH L
  • [3] dagshub, About us
  • [4] A recalling-enhanced recurrent neural network: Conjugate gradient learning algorithm and its convergence analysis
    Gao, Tao
    Gong, Xiaoling
    Zhang, Kai
    Lin, Feng
    Wang, Jian
    Huang, Tingwen
    Zurada, Jacek M.
    [J]. INFORMATION SCIENCES, 2020, 519 (273-288) : 273 - 288
  • [5] Speech enhancement using long short term memory with trained speech features and adaptive wiener filter
    Garg, Anil
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (03) : 3647 - 3675
  • [6] Real-Time Speech Enhancement Based on Convolutional Recurrent Neural Network
    Girirajan, S.
    Pandian, A.
    [J]. INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2023, 35 (02) : 1987 - 2001
  • [7] CochleaNet: A robust language-independent audio-visual model for real-time speech enhancement
    Gogate, Mandar
    Dashtipour, Kia
    Adeel, Ahsan
    Hussain, Amir
    [J]. INFORMATION FUSION, 2020, 63 : 273 - 285
  • [8] Speech recognition with a hearing-aid processing scheme combining beamforming with mask-informed speech enhancement
    Green, Tim
    Hilkhuysen, Gaston
    Huckvale, Mark
    Rosen, Stuart
    Brookes, Mike
    Moore, Alastair
    Naylor, Patrick
    Lightburn, Leo
    Xue, Wei
    [J]. TRENDS IN HEARING, 2022, 26
  • [9] PACDNN: A phase-aware composite deep neural network for speech enhancement
    Hasannezhad, Mojtaba
    Yu, Hongjiang
    Zhu, Wei-Ping
    Champagne, Benoit
    [J]. SPEECH COMMUNICATION, 2022, 136 : 1 - 13
  • [10] The Minimum Overlap-Gap Algorithm for Speech Enhancement
    Hoang, Poul
    Tan, Zheng-Hua
    De Haan, Jan Mark
    Jensen, Jesper
    [J]. IEEE ACCESS, 2022, 10 : 14698 - 14716