Multichannel speech enhancement algorithm based on hybrid reverberation model

被引：0

作者：

Xie, Yuan ^{[1
]}

Zou, Tao ^{[1
]}

Sun, Weijun ^{[2
]}

Xie, Shengli ^{[2
]}

机构：

[1] School of Mechanical and Electrical Engineering, Guangzhou University, Guangzhou,510006, China

[2] Key Laboratory of Intelligent Information Processing and System Integration of Internet of Things, Ministry of Education, Guangdong University of Technology, Guangzhou,510006, China

来源：

Tongxin Xuebao/Journal on Communications | 2024年 / 45卷 / 11期

基金：

中国国家自然科学基金;

关键词：

Eigenvalues and eigenfunctions - Matrix algebra - Noise abatement - Polynomials - Reverberation - Speech enhancement - Wiener filtering;

D O I：

10.11959/j.issn.1000-436x.2024197

中图分类号：

学科分类号：

摘要：

To solve the speech enhancement problem in reverberation and noise scenarios, a new speech enhancement model was constructed integrating multichannel linear prediction model and spatial coherence model, and then a multichannel speech enhancement algorithm based on a hybrid reverberation model was designed. The post-reverberation was divided into two components, which were modeled using a multichannel linear prediction model and a spatial coherence model, respectively. To optimize the model parameters, a Kalman filter was used to update the model parameters and polynomial matrix eigenvalue decomposition was used for spatial, temporal, and frequency decorrelation to achieve reverberation and noise reduction. Experimental results show that the proposed algorithm can enhance speech in high and low-reverberation noise environments, and its enhancement effect is superior to popular speech enhancement algorithms, the performance indicators of speech enhancement, perceptual evaluation of speech quality score (PESQ) value and short-time objective intelligibility (STOI) value, have increased by 30% and 20%, respectively. © 2024 Editorial Board of Journal on Communications. All rights reserved.

引用

页码：15 / 26

共 50 条

[31] A Speech Enhancement Algorithm for Speech Reconstruction Based on Laser Speckle Images
Hao, Xueying
Zhu, Dali
Wang, Xianlan
Yang, Long
Zeng, Hualin
SENSORS, 2023, 23 (01)
[32] Online Speech Dereverberation Algorithm Based on Adaptive Multichannel Linear Prediction
Yang, Jae-Mo
Kang, Hong-Goo
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (03) : 608 - 619
[33] A NEW UNCERTAINTY DECODING SCHEME FOR DNN-HMM HYBRID SYSTEMS WITH MULTICHANNEL SPEECH ENHANCEMENT
Huemmer, Christian
Schwarz, Andreas
Maas, Roland
Barfuss, Hendrik
Astudillo, Ramon Fernandez
Kellermann, Walter
2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 5760 - 5764
[34] A complex-valued multichannel speech enhancement learning algorithm for optimal tradeoff between noise reduction and speech distortion
Tu, Jingxian
Xia, Youshen
Zhang, Songchuan
NEUROCOMPUTING, 2017, 267 : 333 - 343
[35] A SUPERVISED MULTI-CHANNEL SPEECH ENHANCEMENT ALGORITHM BASED ON BAYESIAN NMF MODEL
Chung, Hanwook
Plourde, Eric
Champagne, Benoit
2018 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP 2018), 2018, : 221 - 225
[36] A NOVEL NMF-HMM SPEECH ENHANCEMENT ALGORITHM BASED ON POISSON MIXTURE MODEL
Xiang, Yang
Shi, Liming
Hojvang, Jesper Lisby
Rasmussen, Morten Hojfeldt
Christensen, Mads Grasboll
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 721 - 725
[37] Speech Enhancement Based on Modulation-Domain Parametric Multichannel Kalman Filtering
Xue, Wei
Moore, Alastair H.
Brookes, Mike
Naylor, Patrick A.
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 : 393 - 405
[38] Speech Data Enhancement Based on Hybrid Neural Network
Cao, Xinyue
Sun, Xiao
Ren, Fuji
ADVANCES IN MULTIMEDIA INFORMATION PROCESSING, PT III, 2018, 11166 : 362 - 372
[39] Multichannel Speech Enhancement With Own Voice-Based Interfering Speech Suppression for Hearing Assistive Devices
Hoang, Poul
de Haan, Jan Mark
Tan, Zheng-Hua
Jensen, Jesper
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 706 - 720
[40] Multichannel speech enhancement based on speech spectral magnitude estimation using generalized gamma prior distribution
Dat, Tran Huy
Takeda, Kazuya
Itakura, Fumitada
2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 4819 - 4822

← 1 2 3 4 5 →