Orthogonalization of the Sensing Matrix Through Dominant Columns in Compressive Sensing for Speech Enhancement

被引:0
作者
Shukla, Vasundhara [1 ]
Swami, Preety D. D. [1 ]
机构
[1] Univ Inst Technol RGPV, Dept Elect & Commun Engn, Bhopal 462033, India
来源
APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 15期
关键词
compressive sensing (CS); orthogonal matching pursuit (OMP); sensing matrix optimization; voice activity detection (VAD); speech enhancement; particle swarm optimization (PSO); SIGNAL RECOVERY; UNDERDETERMINED SYSTEMS; UNCERTAINTY PRINCIPLES; LINEAR-EQUATIONS; PERFORMANCE; PURSUIT; NOISE;
D O I
10.3390/app13158954
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
This paper introduces a novel speech enhancement approach called dominant columns group orthogonalization of the sensing matrix (DCGOSM) in compressive sensing (CS). DCGOSM optimizes the sensing matrix using particle swarm optimization (PSO), ensuring separate basis vectors for speech and noise signals. By utilizing an orthogonal matching pursuit (OMP) based CS signal reconstruction with this optimized matrix, noise components are effectively avoided, resulting in lower noise in the reconstructed signal. The reconstruction process is accelerated by iterating only through the known speech-contributing columns. DCGOSM is evaluated against various noise types using speech quality measures such as SNR, SSNR, STOI, and PESQ. Compared to other OMP-based CS algorithms and deep neural network (DNN)-based speech enhancement techniques, DCGOSM demonstrates significant improvements, with maximum enhancements of 42.54%, 62.97%, 27.48%, and 8.72% for SNR, SSNR, PESQ, and STOI, respectively. Additionally, DCGOSM outperforms DNN-based techniques by 20.32% for PESQ and 8.29% for STOI. Furthermore, it reduces recovery time by at least 13.2% compared to other OMP-based CS algorithms.
引用
收藏
页数:23
相关论文
共 67 条
[41]   MATCHING PURSUITS WITH TIME-FREQUENCY DICTIONARIES [J].
MALLAT, SG ;
ZHANG, ZF .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1993, 41 (12) :3397-3415
[42]   A Deep Learning Loss Function Based on the Perceptual Evaluation of the Speech Quality [J].
Manuel Martin-Donas, Juan ;
Manuel Gomez, Angel ;
Gonzalez, Jose A. ;
Peinado, Antonio M. .
IEEE SIGNAL PROCESSING LETTERS, 2018, 25 (11) :1680-1684
[43]   A Review of Sparse Recovery Algorithms [J].
Marques, Elaine Crespo ;
Maciel, Nilson ;
Naviner, Lirida ;
Cai, Hao ;
Yang, Jun .
IEEE ACCESS, 2019, 7 :1300-1322
[44]   CoSaMP: Iterative signal recovery from incomplete and inaccurate samples [J].
Needell, D. ;
Tropp, J. A. .
APPLIED AND COMPUTATIONAL HARMONIC ANALYSIS, 2009, 26 (03) :301-321
[45]   Uniform Uncertainty Principle and Signal Recovery via Regularized Orthogonal Matching Pursuit [J].
Needell, Deanna ;
Vershynin, Roman .
FOUNDATIONS OF COMPUTATIONAL MATHEMATICS, 2009, 9 (03) :317-334
[46]  
Pilastri A., 2016, 11 EDITION DOCTORAL, DOI DOI 10.1109/ICACCM50413.2020.9212838
[47]  
Prabhu KMM, 2014, WINDOW FUNCTIONS AND THEIR APPLICATIONS IN SIGNAL PROCESSING, P1
[48]  
Quackenbush S. R., 1988, Objective measures of speech quality
[49]   A Systematic Review of Compressive Sensing: Concepts, Implementations and Applications [J].
Rani, Meenu ;
Dhok, S. B. ;
Deshmukh, R. B. .
IEEE ACCESS, 2018, 6 :4875-4894
[50]  
Reznik YA, 2013, INT CONF ACOUST SPEE, P5642, DOI 10.1109/ICASSP.2013.6638744