Star DGT: a robust Gabor transform for speech denoising

被引:0
作者
Kouni, Vicky [1 ]
Rauhut, Holger [2 ]
Theoharis, Theoharis [1 ]
机构
[1] Natl & Kapodistrian Univ Athens, Dept Informat & Telecommun, Athens, Greece
[2] Rhein Westfal TH Aachen, Chair Math Informat Proc, Aachen, Germany
来源
SAMPLING THEORY SIGNAL PROCESSING AND DATA ANALYSIS | 2023年 / 21卷 / 01期
关键词
Denoising; Speech signal; Gabor transform; Window vector; Spark deficient Gabor frame; SIGNALS; NOISE; DEREVERBERATION;
D O I
10.1007/s43670-023-00053-x
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
In this paper, we address the speech denoising problem, where Gaussian and coloured additive noises are to be removed from a given speech signal. Our approach is based on a redundant, analysis-sparse representation of the original speech signal. We pick an eigenvector of the Zauner unitary matrix and-under certain assumptions on the ambient dimension-we use it as window vector to generate a spark deficient Gabor frame. The analysis operator associated with such a frame, is a (highly) redundant Gabor transform, which we use as a sparsifying transform in the denoising procedure. We conduct computational experiments on real-world speech data, using as baseline three Gabor transforms generated by state-of-the-art window vectors in time-frequency analysis and compare their performance to the proposed Gabor transform. The results show that the proposed redundant Gabor transform outperforms previous ones consistently for all types of examined signals of noise.
引用
收藏
页数:20
相关论文
共 59 条
[1]  
Abdulatif S, 2021, EUR SIGNAL PR CONF, P451, DOI 10.23919/Eusipco47968.2020.9287606
[2]  
Attias H, 2001, ADV NEUR IN, V13, P758
[3]   Templates for convex cone problems with applications to sparse signal recovery [J].
Becker S.R. ;
Candès E.J. ;
Grant M.C. .
Mathematical Programming Computation, 2011, 3 (3) :165-218
[4]  
Bhattacharya Gautam, 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), P2898, DOI 10.1109/ICASSP.2014.6854130
[5]   Sampling Theorems for Signals From the Union of Finite-Dimensional Linear Subspaces [J].
Blumensath, Thomas ;
Davies, Mike E. .
IEEE TRANSACTIONS ON INFORMATION THEORY, 2009, 55 (04) :1872-1882
[6]   Power iteration method for the several largest eigenvalues and eigenfunctions [J].
Booth, Thomas E. .
NUCLEAR SCIENCE AND ENGINEERING, 2006, 154 (01) :48-62
[7]  
Brajovic M., 2022, 26 INT C INF TECHN I, P1
[8]  
Chardon Gilles, 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), P3102, DOI 10.1109/ICASSP.2014.6854171
[9]   A new pre-whitening transform domain LMS algorithm and its application to speech denoising [J].
Chergui, Laid ;
Bouguezel, Saad .
SIGNAL PROCESSING, 2017, 130 :118-128
[10]   Time-Frequency Analysis, Denoising, Compression, Segmentation, and Classification of PCG Signals [J].
Chowdhury, Md Tanzil Hoque ;
Poudel, Khem Narayan ;
Hu, Yating .
IEEE ACCESS, 2020, 8 :160882-160890