Recalling-Enhanced Recurrent Neural Network optimized with Chimp Optimization Algorithm based speech enhancement for hearing aids

被引：1

作者：

Rai, Rahul R. ^{[1
]}

Mathivanan, M. ^{[2
]}

机构：

[1] VTU, SJB Inst Technol, Dept Elect & Commun Engn, Belagavi, India

[2] ACS Coll Engn, Dept Elect & Commun Engn, Bengaluru, Karnataka, India

来源：

INTELLIGENT DECISION TECHNOLOGIES-NETHERLANDS | 2024年 / 18卷 / 01期

关键词：

Speech enhancement; hearing aids; MS-SNSD dataset; ternary pattern and discrete wavelet transforms; Recalling-Enhanced Recurrent Neural Network; chimp optimization algorithm;

D O I：

10.3233/IDT-230211

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Background noise often distorts the speech signals obtained in a real-world environment. This deterioration occurs in certain applications, like speech recognition, hearing aids. The aim of Speech enhancement (SE) is to suppress the unnecessary background noise in the obtained speech signal. The existing approaches for speech enhancement (SE) face more challenges like low Source-distortion ratio and memory requirements. In this manuscript, Recalling-Enhanced Recurrent Neural Network (R-ERNN) optimized with Chimp Optimization Algorithm based speech enhancement is proposed for hearing aids (R-ERNN-COA-SE-HA). Initially, the clean speech and noisy speech are amassed from MS-SNSD dataset. The input speech signals are encoded using vocoder analysis, and then the Sample RNN decode the bit stream into samples. The input speech signals are extracted using Ternary pattern and discrete wavelet transforms (TP-DWT) in the training phase. In the enhancement stage, R-ERNN forecasts the associated clean speech spectra from noisy speech spectra, then reconstructs a clean speech waveform. Chimp Optimization Algorithm (COA) is considered for optimizing the R-ERNN which enhances speech. The proposed method is implemented in MATLAB, and its efficiency is evaluated under some metrics. The R-ERNN-COA-SE-HA method provides 23.74%, 24.81%, and 19.33% higher PESQ compared with existing methods, such as RGRNN-SE-HA, PACDNN-SE-HA, ARN-SE-HA respectively.

引用

页码：123 / 134

页数：12

共 31 条

[1] Cantu MA, 2023, ICASSP 2023, P1
[2] Chen X, 2021, P CLAR WORKSH MACH L
[3] dagshub, About us
[4] A recalling-enhanced recurrent neural network: Conjugate gradient learning algorithm and its convergence analysis
Gao, Tao
Gong, Xiaoling
Zhang, Kai
Lin, Feng
Wang, Jian
Huang, Tingwen
Zurada, Jacek M.
[J]. INFORMATION SCIENCES, 2020, 519 (273-288) : 273 - 288
[5] Speech enhancement using long short term memory with trained speech features and adaptive wiener filter
Garg, Anil
[J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (03) : 3647 - 3675
[6] Real-Time Speech Enhancement Based on Convolutional Recurrent Neural Network
Girirajan, S.
Pandian, A.
[J]. INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2023, 35 (02) : 1987 - 2001
[7] CochleaNet: A robust language-independent audio-visual model for real-time speech enhancement
Gogate, Mandar
Dashtipour, Kia
Adeel, Ahsan
Hussain, Amir
[J]. INFORMATION FUSION, 2020, 63 : 273 - 285
[8] Speech recognition with a hearing-aid processing scheme combining beamforming with mask-informed speech enhancement
Green, Tim
Hilkhuysen, Gaston
Huckvale, Mark
Rosen, Stuart
Brookes, Mike
Moore, Alastair
Naylor, Patrick
Lightburn, Leo
Xue, Wei
[J]. TRENDS IN HEARING, 2022, 26
[9] PACDNN: A phase-aware composite deep neural network for speech enhancement
Hasannezhad, Mojtaba
Yu, Hongjiang
Zhu, Wei-Ping
Champagne, Benoit
[J]. SPEECH COMMUNICATION, 2022, 136 : 1 - 13
[10] The Minimum Overlap-Gap Algorithm for Speech Enhancement
Hoang, Poul
Tan, Zheng-Hua
De Haan, Jan Mark
Jensen, Jesper
[J]. IEEE ACCESS, 2022, 10 : 14698 - 14716

← 1 2 3 4 →