The optimal ratio time-frequency mask for speech separation in terms of the signal-to-noise ratio

被引：22

作者：

Liang, Shan ^{[1
]}

Liu, Wenju ^{[1
]}

Jiang, Wei ^{[1
]}

Xue, Wei ^{[1
]}

机构：

[1] Chinese Acad Sci, Inst Automat, NLPR, Beijing 100190, Peoples R China

来源：

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA | 2013年 / 134卷 / 05期

关键词：

ENHANCEMENT; RECOGNITION; BINARY;

D O I：

10.1121/1.4824632

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In this paper, a computational goal for a monaural speech separation system is proposed. Since this goal is derived by maximizing the signal-to-noise ratio (SNR), it is called the optimal ratio mask (ORM). Under the approximate W-Disjoint Orthogonality assumption which almost always holds due to the sparse nature of speech, theoretical analysis shows that the ORM can improve the SNR about 10log(10)2 dB over the ideal ratio mask. With three kinds of real-world interference, the speech separation results of SNR gain and objective quality evaluation demonstrate the correctness of the theoretical analysis, and imply that the ORMachieves a better separation performance. (C) 2013 Acoustical Society of America

引用

页码：EL452 / EL458

页数：7

共 50 条

[21] Optimal signal-to-noise ratio in stochastic time-delayed bistable systems
Shilong Gao
The European Physical Journal B, 2016, 89
[22] Optimal signal-to-noise ratio in stochastic time-delayed bistable systems
Gao, Shilong
EUROPEAN PHYSICAL JOURNAL B, 2016, 89 (04):
[23] ESTIMATION OF SIGNAL-TO-NOISE RATIO
RAUCH, S
IEEE TRANSACTIONS ON INFORMATION THEORY, 1969, 15 (1P1) : 166 - +
[24] Signal-to-noise ratio in MRI
Redpath, TW
BRITISH JOURNAL OF RADIOLOGY, 1998, 71 (847): : 704 - 707
[25] On Signal-to-Noise Ratio Estimation
Papic, Veljko
Djurovic, Zeljko
Kvascev, Goran
Tadic, Predrag
MELECON 2010: THE 15TH IEEE MEDITERRANEAN ELECTROTECHNICAL CONFERENCE, 2010, : 160 - 165
[26] Enhancement of Signal-to-Noise Ratio
Dhara, A. K.
Journal of Statistical Physics, 87 (1-2):
[27] Enhancement of signal-to-noise ratio
Asish K. Dhara
Journal of Statistical Physics, 1997, 87 : 251 - 271
[28] The estimation of signal-to-noise ratio in continuous speech for disordered voices
Qi, YY
Hillman, RE
Milstein, C
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1999, 105 (04): : 2532 - 2535
[29] LEARNING TO MAXIMIZE SIGNAL-TO-NOISE RATIO FOR REVERBERANT SPEECH SEGREGATION
Jin, Zhaozhang
Wang, DeLiang
2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4689 - +
[30] The concept of signal-to-noise ratio in the modulation domain and speech intelligibility
Dubbelboer, Finn
Houtgast, Tarnmo
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2008, 124 (06): : 3937 - 3946

← 1 2 3 4 5 →