Multichannel speech enhancement algorithm based on hybrid reverberation model

被引:0
|
作者
Xie, Yuan [1 ]
Zou, Tao [1 ]
Sun, Weijun [2 ]
Xie, Shengli [2 ]
机构
[1] School of Mechanical and Electrical Engineering, Guangzhou University, Guangzhou,510006, China
[2] Key Laboratory of Intelligent Information Processing and System Integration of Internet of Things, Ministry of Education, Guangdong University of Technology, Guangzhou,510006, China
来源
基金
中国国家自然科学基金;
关键词
Eigenvalues and eigenfunctions - Matrix algebra - Noise abatement - Polynomials - Reverberation - Speech enhancement - Wiener filtering;
D O I
10.11959/j.issn.1000-436x.2024197
中图分类号
学科分类号
摘要
To solve the speech enhancement problem in reverberation and noise scenarios, a new speech enhancement model was constructed integrating multichannel linear prediction model and spatial coherence model, and then a multichannel speech enhancement algorithm based on a hybrid reverberation model was designed. The post-reverberation was divided into two components, which were modeled using a multichannel linear prediction model and a spatial coherence model, respectively. To optimize the model parameters, a Kalman filter was used to update the model parameters and polynomial matrix eigenvalue decomposition was used for spatial, temporal, and frequency decorrelation to achieve reverberation and noise reduction. Experimental results show that the proposed algorithm can enhance speech in high and low-reverberation noise environments, and its enhancement effect is superior to popular speech enhancement algorithms, the performance indicators of speech enhancement, perceptual evaluation of speech quality score (PESQ) value and short-time objective intelligibility (STOI) value, have increased by 30% and 20%, respectively. © 2024 Editorial Board of Journal on Communications. All rights reserved.
引用
收藏
页码:15 / 26
相关论文
共 50 条
  • [31] A Speech Enhancement Algorithm for Speech Reconstruction Based on Laser Speckle Images
    Hao, Xueying
    Zhu, Dali
    Wang, Xianlan
    Yang, Long
    Zeng, Hualin
    SENSORS, 2023, 23 (01)
  • [32] Online Speech Dereverberation Algorithm Based on Adaptive Multichannel Linear Prediction
    Yang, Jae-Mo
    Kang, Hong-Goo
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (03) : 608 - 619
  • [33] A NEW UNCERTAINTY DECODING SCHEME FOR DNN-HMM HYBRID SYSTEMS WITH MULTICHANNEL SPEECH ENHANCEMENT
    Huemmer, Christian
    Schwarz, Andreas
    Maas, Roland
    Barfuss, Hendrik
    Astudillo, Ramon Fernandez
    Kellermann, Walter
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 5760 - 5764
  • [34] A complex-valued multichannel speech enhancement learning algorithm for optimal tradeoff between noise reduction and speech distortion
    Tu, Jingxian
    Xia, Youshen
    Zhang, Songchuan
    NEUROCOMPUTING, 2017, 267 : 333 - 343
  • [35] A SUPERVISED MULTI-CHANNEL SPEECH ENHANCEMENT ALGORITHM BASED ON BAYESIAN NMF MODEL
    Chung, Hanwook
    Plourde, Eric
    Champagne, Benoit
    2018 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP 2018), 2018, : 221 - 225
  • [36] A NOVEL NMF-HMM SPEECH ENHANCEMENT ALGORITHM BASED ON POISSON MIXTURE MODEL
    Xiang, Yang
    Shi, Liming
    Hojvang, Jesper Lisby
    Rasmussen, Morten Hojfeldt
    Christensen, Mads Grasboll
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 721 - 725
  • [37] Speech Enhancement Based on Modulation-Domain Parametric Multichannel Kalman Filtering
    Xue, Wei
    Moore, Alastair H.
    Brookes, Mike
    Naylor, Patrick A.
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 : 393 - 405
  • [38] Speech Data Enhancement Based on Hybrid Neural Network
    Cao, Xinyue
    Sun, Xiao
    Ren, Fuji
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING, PT III, 2018, 11166 : 362 - 372
  • [39] Multichannel Speech Enhancement With Own Voice-Based Interfering Speech Suppression for Hearing Assistive Devices
    Hoang, Poul
    de Haan, Jan Mark
    Tan, Zheng-Hua
    Jensen, Jesper
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 706 - 720
  • [40] Multichannel speech enhancement based on speech spectral magnitude estimation using generalized gamma prior distribution
    Dat, Tran Huy
    Takeda, Kazuya
    Itakura, Fumitada
    2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 4819 - 4822