Single-Channel Speech Enhancement Based on Psychoacoustic Masking

被引：3

作者：

Zhou, Tingting ^{[1
]}

Zeng, Yumin ^{[1
]}

Wang, Rongrong ^{[1
]}

机构：

[1] Nanjing Normal Univ, Sch Phys & Technol, Nanjing, Jiangsu, Peoples R China

来源：

JOURNAL OF THE AUDIO ENGINEERING SOCIETY | 2017年 / 65卷 / 04期

关键词：

NOISE; SUPPRESSION; BACKWARD;

D O I：

10.17743/jaes.2017.0003

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

This paper addresses the problem of single-channel speech enhancement based on masking properties of the human auditory system. A complete implementation of speech enhancement using psychoacoustic masking based on a suggestion of Virag [1] is presented in this paper. Essential features have been improved by modifying simultaneous masking and introducing temporal masking in the proposed algorithm. The noise masking threshold is calculated by considering simultaneous masking as well as temporal masking, and then it is used to adapt the subtraction parameters to obtain the best tradeoff between the amount of noise reduction, the speech distortion, and the level of residual noise in a perceptual sense. The application of objective measures and subjective listening tests demonstrate that the proposed algorithm outperforms comparable speech enhancement algorithms.

引用

页码：272 / 284

页数：13

共 50 条

[21] New Results in Modulation-Domain Single-Channel Speech Enhancement
Mowlaee, Pejman
Blass, Martin
Kleijn, W. Bastiaan
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (11) : 2125 - 2137
[22] Modulation-domain Kalman filtering for single-channel speech enhancement
So, Stephen
Paliwal, Kuldip K.
SPEECH COMMUNICATION, 2011, 53 (06) : 818 - 829
[23] Deep Learning with Augmented Kalman Filter for Single-Channel Speech Enhancement
Roy, Sujan Kumar
Nicolson, Aaron
Paliwal, Kuldip K.
2020 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2020,
[24] A HYBRID APPROACH TO COMBINING CONVENTIONAL AND DEEP LEARNING TECHNIQUES FOR SINGLE-CHANNEL SPEECH ENHANCEMENT AND RECOGNITION
Tu, Yan-Hui
Tashev, Ivan
Zarar, Shuayb
Lee, Chin-Hui
2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 2531 - 2535
[25] Phase-Sensitive Decision-Directed SNR Estimator for Single-Channel Speech Enhancement
Ou, Shifeng
Song, Peng
Gao, Ying
INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2017, 31 (08)
[26] Single-Channel Multitalker Speech Recognition
Rennie, Steven J.
Hershey, John R.
Olsen, Peder A.
IEEE SIGNAL PROCESSING MAGAZINE, 2010, 27 (06) : 66 - 80
[27] An evaluation of the perceptual quality of phase-aware single-channel speech enhancement
Krawczyk-Becker, Martin
Gerkmann, Timo
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2016, 140 (04) : EL364 - EL369
[28] A GENERALIZED LOG-SPECTRAL AMPLITUDE ESTIMATOR FOR SINGLE-CHANNEL SPEECH ENHANCEMENT
Chinaev, Aleksej
Haeb-Umbach, Reinhold
2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 4980 - 4984
[29] Single-channel speech enhancement with correlated spectral components: Limits-potential
Mowlaee, Pejman
Stahl, Johannes K. W.
SPEECH COMMUNICATION, 2020, 121 (121) : 58 - 69
[30] FPGA Implementation of a Phase-Aware Single-Channel Speech Enhancement System
Samui, Suman
Sahu, Pragya
Chakrabarti, Indrajit
Ghosh, Soumya K.
CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2017, 36 (11) : 4688 - 4715

← 1 2 3 4 5 →