The optimal ratio time-frequency mask for speech separation in terms of the signal-to-noise ratio

被引:22
|
作者
Liang, Shan [1 ]
Liu, Wenju [1 ]
Jiang, Wei [1 ]
Xue, Wei [1 ]
机构
[1] Chinese Acad Sci, Inst Automat, NLPR, Beijing 100190, Peoples R China
来源
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA | 2013年 / 134卷 / 05期
关键词
ENHANCEMENT; RECOGNITION; BINARY;
D O I
10.1121/1.4824632
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, a computational goal for a monaural speech separation system is proposed. Since this goal is derived by maximizing the signal-to-noise ratio (SNR), it is called the optimal ratio mask (ORM). Under the approximate W-Disjoint Orthogonality assumption which almost always holds due to the sparse nature of speech, theoretical analysis shows that the ORM can improve the SNR about 10log(10)2 dB over the ideal ratio mask. With three kinds of real-world interference, the speech separation results of SNR gain and objective quality evaluation demonstrate the correctness of the theoretical analysis, and imply that the ORMachieves a better separation performance. (C) 2013 Acoustical Society of America
引用
收藏
页码:EL452 / EL458
页数:7
相关论文
共 50 条
  • [31] The concept of signal-to-noise ratio in the modulation domain and speech intelligibility
    Dubbelboer, Finn
    Houtgast, Tammo
    Journal of the Acoustical Society of America, 2009, 124 (06): : 3937 - 3946
  • [32] A Signal-to-Noise Ratio Enhancer
    Lu, Ning H.
    2011 IEEE SENSORS APPLICATIONS SYMPOSIUM (SAS), 2011, : 34 - 38
  • [33] SIGNAL-TO-NOISE RATIO DEBATED
    HUNTER, JS
    QUALITY PROGRESS, 1987, 20 (05) : 7 - &
  • [34] Enhancement of signal-to-noise ratio
    Dhara, AK
    JOURNAL OF STATISTICAL PHYSICS, 1997, 87 (1-2) : 251 - 271
  • [35] Accuracy of speech transmission index predictions based on the reverberation time and signal-to-noise ratio
    Galbrun, Laurent
    Kitapci, Kivanc
    APPLIED ACOUSTICS, 2014, 81 : 1 - 14
  • [36] Signal-to-noise ratio and signal-to-noise efficiency in SMASH imaging
    Sodickson, DK
    Griswold, MA
    Jakob, PM
    Edelman, RR
    Manning, WJ
    MAGNETIC RESONANCE IN MEDICINE, 1999, 41 (05) : 1009 - 1022
  • [37] A FEATURE STUDY FOR CLASSIFICATION-BASED SPEECH SEPARATION AT VERY LOW SIGNAL-TO-NOISE RATIO
    Chen, Jitong
    Wang, Yuxuan
    Wang, DeLiang
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [38] Frequency Hopping Signal Detection in Low Signal-to-Noise Ratio Regimes
    Hasan, Md. Zoheb
    Couto, David J.
    Abdel-Malek, Mai A.
    Reed, Jeffrey H.
    2023 IEEE 34TH ANNUAL INTERNATIONAL SYMPOSIUM ON PERSONAL, INDOOR AND MOBILE RADIO COMMUNICATIONS, PIMRC, 2023,
  • [39] Enhancement of Signal-to-noise Ratio of Peroneal Somatosensory Evoked Potential Using Independent Component Analysis and Time-frequency Template
    Hung, Chih-I
    Yang, Yea-Ru
    Wang, Ray-Yau
    Chou, Wen-Ling
    Hsieh, Jen-Chuen
    Wu, Yu-Te
    JOURNAL OF MEDICAL AND BIOLOGICAL ENGINEERING, 2008, 28 (03) : 161 - 166
  • [40] Optimal Linear Control for Channels with Signal-to-Noise Ratio Constraints
    Johannesson, Erik
    Rantzer, Anders
    Bernhardsson, Bo
    2011 AMERICAN CONTROL CONFERENCE, 2011, : 521 - 526