A CONVEX OPTIMIZATION APPROACH FOR TIME-FREQUENCY MASK ESTIMATION

被引:0
|
作者
Bao, Feng [1 ]
Abdulla, Waleed H. [1 ]
机构
[1] Univ Auckland, Elect & Comp Engn Dept, 20 Symond St, Auckland 1010, New Zealand
关键词
Computational auditory scene analysis (CASA); Ideal binary mask (IBM); Convex optimization; Speech enhancement; SPEECH; NOISE; ENHANCEMENT;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we propose a new time-frequency mask method for computational auditory scene analysis (CASA) based on convex optimization of the binary mask. In the proposed method, the pitch estimation and segment segregation in conventional CASA are completely replaced by the convex optimization of speech power. Considering the cross-correlation between the power spectra of noisy speech and noise in each of a Gammatone filterbank channel, the objective function of speech power used for convex optimization is built. The speech power is estimated by gradient descent method. Thus, the time-frequency units dominated by speech and noise are labeled by comparing the powers of noisy and estimated speech, and noise. The erroneous local masks are also removed by using the Teager energy of the estimated speech and time-frequency unit smoothing. The results from the average segmental signal-to-noise ratio improvement, HIT-False Alarm rate and subjective test show that the performance of the proposed method outperforms the reference methods.
引用
收藏
页码:31 / 35
页数:5
相关论文
共 50 条
  • [31] Channel estimation using time-frequency techniques
    Shen, H
    Papandreou-Suppappola, A
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 513 - 516
  • [32] Post-processing of time-frequency representations in instantaneous frequency estimation based on ant colony optimization
    Brajovic, Milos
    Popovic-Bugarin, Vesna
    Djurovic, Igor
    Djukanovic, Slobodan
    SIGNAL PROCESSING, 2017, 138 : 195 - 210
  • [33] Time-Frequency Regularization for Impulse Response Estimation
    Fujimoto, Yusuke
    2020 59TH ANNUAL CONFERENCE OF THE SOCIETY OF INSTRUMENT AND CONTROL ENGINEERS OF JAPAN (SICE), 2020, : 1329 - 1332
  • [34] Adaptive multitaper time-frequency spectrum estimation
    Pitton, J
    ADVANCED SIGNAL PROCESSING ALGORITHMS, ARCHITECTURES,AND IMPLEMENTATIONS IX, 1999, 3807 : 458 - 468
  • [35] Time-frequency methods for biological signal estimation
    Nelson, D
    Loughlin, P
    Cristóbal, G
    Cohen, L
    15TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 3, PROCEEDINGS: IMAGE, SPEECH AND SIGNAL PROCESSING, 2000, : 110 - 114
  • [36] Nonstationary spectrum estimation and time-frequency concentration
    Pitton, JW
    PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 2425 - 2428
  • [37] Comparison of Measures of Time-Frequency Distribution Optimization
    Pikula, Stanislav
    Benes, Petr
    2016 8TH INTERNATIONAL CONGRESS ON ULTRA MODERN TELECOMMUNICATIONS AND CONTROL SYSTEMS AND WORKSHOPS (ICUMT), 2016, : 314 - 319
  • [38] A Joint Time-Frequency Domain Algorithm for Carrier Frequency Estimation
    Sun, Jinhua
    Ding, Yujie
    Wu, Xiaojun
    2014 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, COMMUNICATIONS AND COMPUTING (ICSPCC), 2014, : 301 - 306
  • [39] Estimation of the instantaneous frequency using adaptive time-frequency distribution
    Chen, Guang-Hua
    Cao, Jia-Lin
    Wang, Jian
    Qin, Ting-Hao
    Ma, Shi-Wei
    Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2002, 24 (01):
  • [40] Time-frequency distribution spectral polynomials for instantanous frequency estimation
    Wang, CS
    Amin, MG
    SIGNAL PROCESSING, 1999, 76 (02) : 211 - 217