The optimal ratio time-frequency mask for speech separation in terms of the signal-to-noise ratio

被引:22
|
作者
Liang, Shan [1 ]
Liu, Wenju [1 ]
Jiang, Wei [1 ]
Xue, Wei [1 ]
机构
[1] Chinese Acad Sci, Inst Automat, NLPR, Beijing 100190, Peoples R China
来源
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA | 2013年 / 134卷 / 05期
关键词
ENHANCEMENT; RECOGNITION; BINARY;
D O I
10.1121/1.4824632
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, a computational goal for a monaural speech separation system is proposed. Since this goal is derived by maximizing the signal-to-noise ratio (SNR), it is called the optimal ratio mask (ORM). Under the approximate W-Disjoint Orthogonality assumption which almost always holds due to the sparse nature of speech, theoretical analysis shows that the ORM can improve the SNR about 10log(10)2 dB over the ideal ratio mask. With three kinds of real-world interference, the speech separation results of SNR gain and objective quality evaluation demonstrate the correctness of the theoretical analysis, and imply that the ORMachieves a better separation performance. (C) 2013 Acoustical Society of America
引用
收藏
页码:EL452 / EL458
页数:7
相关论文
共 50 条
  • [21] Optimal signal-to-noise ratio in stochastic time-delayed bistable systems
    Shilong Gao
    The European Physical Journal B, 2016, 89
  • [22] Optimal signal-to-noise ratio in stochastic time-delayed bistable systems
    Gao, Shilong
    EUROPEAN PHYSICAL JOURNAL B, 2016, 89 (04):
  • [23] ESTIMATION OF SIGNAL-TO-NOISE RATIO
    RAUCH, S
    IEEE TRANSACTIONS ON INFORMATION THEORY, 1969, 15 (1P1) : 166 - +
  • [24] Signal-to-noise ratio in MRI
    Redpath, TW
    BRITISH JOURNAL OF RADIOLOGY, 1998, 71 (847): : 704 - 707
  • [25] On Signal-to-Noise Ratio Estimation
    Papic, Veljko
    Djurovic, Zeljko
    Kvascev, Goran
    Tadic, Predrag
    MELECON 2010: THE 15TH IEEE MEDITERRANEAN ELECTROTECHNICAL CONFERENCE, 2010, : 160 - 165
  • [26] Enhancement of Signal-to-Noise Ratio
    Dhara, A. K.
    Journal of Statistical Physics, 87 (1-2):
  • [27] Enhancement of signal-to-noise ratio
    Asish K. Dhara
    Journal of Statistical Physics, 1997, 87 : 251 - 271
  • [28] The estimation of signal-to-noise ratio in continuous speech for disordered voices
    Qi, YY
    Hillman, RE
    Milstein, C
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1999, 105 (04): : 2532 - 2535
  • [29] LEARNING TO MAXIMIZE SIGNAL-TO-NOISE RATIO FOR REVERBERANT SPEECH SEGREGATION
    Jin, Zhaozhang
    Wang, DeLiang
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4689 - +
  • [30] The concept of signal-to-noise ratio in the modulation domain and speech intelligibility
    Dubbelboer, Finn
    Houtgast, Tarnmo
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2008, 124 (06): : 3937 - 3946