A CONVEX OPTIMIZATION APPROACH FOR TIME-FREQUENCY MASK ESTIMATION

被引:0
|
作者
Bao, Feng [1 ]
Abdulla, Waleed H. [1 ]
机构
[1] Univ Auckland, Elect & Comp Engn Dept, 20 Symond St, Auckland 1010, New Zealand
关键词
Computational auditory scene analysis (CASA); Ideal binary mask (IBM); Convex optimization; Speech enhancement; SPEECH; NOISE; ENHANCEMENT;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we propose a new time-frequency mask method for computational auditory scene analysis (CASA) based on convex optimization of the binary mask. In the proposed method, the pitch estimation and segment segregation in conventional CASA are completely replaced by the convex optimization of speech power. Considering the cross-correlation between the power spectra of noisy speech and noise in each of a Gammatone filterbank channel, the objective function of speech power used for convex optimization is built. The speech power is estimated by gradient descent method. Thus, the time-frequency units dominated by speech and noise are labeled by comparing the powers of noisy and estimated speech, and noise. The erroneous local masks are also removed by using the Teager energy of the estimated speech and time-frequency unit smoothing. The results from the average segmental signal-to-noise ratio improvement, HIT-False Alarm rate and subjective test show that the performance of the proposed method outperforms the reference methods.
引用
收藏
页码:31 / 35
页数:5
相关论文
共 50 条
  • [41] A new approach for time-frequency analysis of heart rate variability and assessment of time-frequency representations
    Chan, HL
    Huang, HH
    Wu, CP
    Lin, JL
    JOURNAL OF THE CHINESE INSTITUTE OF ENGINEERS, 1997, 20 (03) : 343 - 353
  • [42] New approach for time-frequency analysis of heart rate variability and assessment of time-frequency representations
    Natl Taiwan Univ, Taipei, Taiwan
    J Chin Inst Eng Trans Chin Inst Eng Ser A, 3 (343-353):
  • [43] Applying Non-Negative Matrix Factorization on Time-Frequency Reassignment Spectra for Missing Data Mask Estimation
    Van Segbroeck, Maarten
    Van Hamme, Hugo
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 2467 - 2470
  • [44] Fetal Heart Rate Estimation: Adaptive Filtering Approach vs Time-Frequency Analysis
    Ahmad, Ashraf A.
    Nyitamen, Dominic S.
    Lawan, Sagir
    Wamdeo, Chiroma L.
    2019 2ND INTERNATIONAL CONFERENCE OF THE IEEE NIGERIA COMPUTER CHAPTER (NIGERIACOMPUTCONF), 2019, : 82 - 86
  • [45] SIMULTANEOUS OPTIMIZATION OF FORGETTING FACTOR AND TIME-FREQUENCY MASK FOR BLOCK ONLINE MULTI-CHANNEL SPEECH ENHANCEMENT
    Togami, Masahito
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 2702 - 2706
  • [46] A time-frequency domain approach of heart rate estimation from photoplethysmographic (PPG) signal
    Islam, Mohammad Tariqul
    Zabir, Ishmam
    Ahamed, Sk. Tanvir
    Yasar, Md. Tahmid
    Shahnaz, Celia
    Fattah, Shaikh Anowarul
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2017, 36 : 146 - 154
  • [47] A Structured Sparse Bayesian Channel Estimation Approach for Orthogonal Time-Frequency Space Modulation
    Zhang, Mi
    Xia, Xiaochen
    Xu, Kui
    Yang, Xiaoqin
    Xie, Wei
    Li, Yunkun
    Liu, Yang
    ENTROPY, 2023, 25 (05)
  • [48] If estimation of linear FM signals corrupted by multiplicative and additive noise: A time-frequency approach
    Barkat, B
    Boashash, B
    2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 661 - 664
  • [49] Novel Approach of Acceleration Estimation via Time-Frequency Image of GPS Carrier Signal
    Xia, Xuan
    Zhao, Jiankang
    Zang, Zhongyuan
    Zhong, Kewei
    Dong, Liang
    2016 3RD INTERNATIONAL CONFERENCE ON SYSTEMS AND INFORMATICS (ICSAI), 2016, : 831 - 836
  • [50] A time-frequency approach to the adjustable bandwidth concept
    Galleani, Lorenzo
    Cohen, Leon
    Noga, Andrew
    DIGITAL SIGNAL PROCESSING, 2006, 16 (05) : 454 - 467