Single-Channel Speech Enhancement Based on Psychoacoustic Masking

被引:3
作者
Zhou, Tingting [1 ]
Zeng, Yumin [1 ]
Wang, Rongrong [1 ]
机构
[1] Nanjing Normal Univ, Sch Phys & Technol, Nanjing, Jiangsu, Peoples R China
来源
JOURNAL OF THE AUDIO ENGINEERING SOCIETY | 2017年 / 65卷 / 04期
关键词
NOISE; SUPPRESSION; BACKWARD;
D O I
10.17743/jaes.2017.0003
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper addresses the problem of single-channel speech enhancement based on masking properties of the human auditory system. A complete implementation of speech enhancement using psychoacoustic masking based on a suggestion of Virag [1] is presented in this paper. Essential features have been improved by modifying simultaneous masking and introducing temporal masking in the proposed algorithm. The noise masking threshold is calculated by considering simultaneous masking as well as temporal masking, and then it is used to adapt the subtraction parameters to obtain the best tradeoff between the amount of noise reduction, the speech distortion, and the level of residual noise in a perceptual sense. The application of objective measures and subjective listening tests demonstrate that the proposed algorithm outperforms comparable speech enhancement algorithms.
引用
收藏
页码:272 / 284
页数:13
相关论文
共 50 条
  • [21] New Results in Modulation-Domain Single-Channel Speech Enhancement
    Mowlaee, Pejman
    Blass, Martin
    Kleijn, W. Bastiaan
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (11) : 2125 - 2137
  • [22] Modulation-domain Kalman filtering for single-channel speech enhancement
    So, Stephen
    Paliwal, Kuldip K.
    SPEECH COMMUNICATION, 2011, 53 (06) : 818 - 829
  • [23] Deep Learning with Augmented Kalman Filter for Single-Channel Speech Enhancement
    Roy, Sujan Kumar
    Nicolson, Aaron
    Paliwal, Kuldip K.
    2020 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2020,
  • [24] A HYBRID APPROACH TO COMBINING CONVENTIONAL AND DEEP LEARNING TECHNIQUES FOR SINGLE-CHANNEL SPEECH ENHANCEMENT AND RECOGNITION
    Tu, Yan-Hui
    Tashev, Ivan
    Zarar, Shuayb
    Lee, Chin-Hui
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 2531 - 2535
  • [25] Phase-Sensitive Decision-Directed SNR Estimator for Single-Channel Speech Enhancement
    Ou, Shifeng
    Song, Peng
    Gao, Ying
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2017, 31 (08)
  • [26] Single-Channel Multitalker Speech Recognition
    Rennie, Steven J.
    Hershey, John R.
    Olsen, Peder A.
    IEEE SIGNAL PROCESSING MAGAZINE, 2010, 27 (06) : 66 - 80
  • [27] An evaluation of the perceptual quality of phase-aware single-channel speech enhancement
    Krawczyk-Becker, Martin
    Gerkmann, Timo
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2016, 140 (04) : EL364 - EL369
  • [28] A GENERALIZED LOG-SPECTRAL AMPLITUDE ESTIMATOR FOR SINGLE-CHANNEL SPEECH ENHANCEMENT
    Chinaev, Aleksej
    Haeb-Umbach, Reinhold
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 4980 - 4984
  • [29] Single-channel speech enhancement with correlated spectral components: Limits-potential
    Mowlaee, Pejman
    Stahl, Johannes K. W.
    SPEECH COMMUNICATION, 2020, 121 (121) : 58 - 69
  • [30] FPGA Implementation of a Phase-Aware Single-Channel Speech Enhancement System
    Samui, Suman
    Sahu, Pragya
    Chakrabarti, Indrajit
    Ghosh, Soumya K.
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2017, 36 (11) : 4688 - 4715