Single-Channel Speech Enhancement Using Single Dimension Change Accelerated Particle Swarm Optimization for Subspace Partitioning

被引：4

作者：

Ghorpade, Kalpana ^{[1
]}

Khaparde, Arti ^{[2
]}

机构：

[1] Cummins Coll Engn Women, Dept Elect & Telecommun, Pune, Maharashtra, India

[2] MIT World Peace Univ, Dept ECE, Pune, Maharashtra, India

来源：

CIRCUITS SYSTEMS AND SIGNAL PROCESSING | 2023年 / 42卷 / 07期

关键词：

Eigenvalue decomposition; Modified accelerated particle swarm optimization; Speech enhancement; Subspace method; Voice activity detection; DESIGN; NOISE;

D O I：

10.1007/s00034-023-02324-3

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Speech signal gets contaminated by background noise affecting its quality and intelligibility. There are different sources of additive noise. This additive noise, either stationary or non-stationary, has a distinct distribution of noise energy in the frequency domain. Degraded speech affects the performance of speech-operated systems. Speech enhancement can reduce this additive noise. Here, we propose a subspace-based single-channel speech enhancement method using modified accelerated particle swarm optimization to optimize subspace partitioning. Principal components of noisy speech are partitioned into speech, speech plus noise, and noise only based on the signal-to-noise ratio of principal components. Voice activity detection is implemented to find the variance of additive noise. Modified accelerated particle swarm optimization optimizes the number of principal components in each partition and the weights of the components in each class. The proposed speech enhancement method gives better results for the quality and intelligibility measures of enhanced speech compared with conventional speech enhancement methods. We got 18.8% improvement in STOI for 0 dB restaurant noise, 20.5% improvement for 0 dB train noise, and 11.55% improvement for 0 dB exhibition noise. We got an improvement of 39.15% in PESQ for 0 dB babble noise, 41.57% for 0 dB car noise, and 31.79% increase for 0 dB airport noise. The average improvement in the segmental SNR of the enhanced speech is 8.32 dB for 0 dB noise. There is 4.4 dB improvement in SDR for the airport noise and 5.54 dB improvement for the station noise. We got this improvement with minimum speech distortion.

引用

页码：4343 / 4361

页数：19

共 50 条

[41] A Comparative Study on Single-Channel Noise Estimation Methods for Speech Enhancement [J].

Veisi, Hadi ;

Sameti, Hossein .

2012 12TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS (ISDA), 2012, :645-650

[42] Perceptual Weighting Deep Neural Networks for Single-channel Speech Enhancement [J].

Han, Wei ;

Zhang, Xiongwei ;

Min, Gang ;

Zhou, Xingyu ;

Zhang, Wei .

PROCEEDINGS OF THE 2016 12TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2016, :446-450

[43] Biophysically-inspired single-channel speech enhancement in the time domain [J].

Wen, Chuan ;

Verhulst, Sarah .

INTERSPEECH 2023, 2023, :775-779

[44] Two-Stage Single-Channel Speech Enhancement with Multi-Frame Filtering [J].

Lin, Shaoxiong ;

Zhang, Wangyou ;

Qian, Yanmin .

APPLIED SCIENCES-BASEL, 2023, 13 (08)

[45] Non-negative Matrix Factorization with Linear Constraints for Single-Channel Speech Enhancement [J].

Lyubimov, Nikolay ;

Kotov, Mikhail .

14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, :446-450

[46] A COMPUTATIONALLY-EFFICIENT SINGLE-CHANNEL SPEECH ENHANCEMENT ALGORITHM FOR MONAURAL HEARING AIDS [J].

Ayllon, David ;

Gil-Pita, Roberto ;

Utrilla-Manso, Manuel ;

Rosa-Zurera, Manuel .

2014 PROCEEDINGS OF THE 22ND EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2014, :2050-2054

[47] Multi-Level Single-Channel Speech Enhancement Using a Unified Framework for Estimating Magnitude and Phase Spectra [J].

Lavanya, T. ;

Nagarajan, T. ;

Vijayalakshmi, P. .

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 :1315-1327

[48] Robust Speaker Recognition Based on Single-Channel and Multi-Channel Speech Enhancement [J].

Taherian, Hassan ;

Wang, Zhong-Qiu ;

Chang, Jorge ;

Wang, DeLiang .

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 :1293-1302

[49] INCORPORATING MULTI-CHANNEL WIENER FILTER WITH SINGLE-CHANNEL SPEECH ENHANCEMENT ALGORITHM [J].

Yong, Pei Chee ;

Nordholm, Sven ;

Dam, Hai Huyen ;

Leung, Yee Hong ;

Lai, Chiong Ching .

2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, :7284-7288

[50] Single-channel speech separation using soft mask filtering [J].

Radfar, Mohammad H. ;

Dansereau, Richard M. .

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (08) :2299-2310

← 1 2 3 4 5 →