The optimal ratio time-frequency mask for speech separation in terms of the signal-to-noise ratio

被引：22

作者：

Liang, Shan ^{[1
]}

Liu, Wenju ^{[1
]}

Jiang, Wei ^{[1
]}

Xue, Wei ^{[1
]}

机构：

[1] Chinese Acad Sci, Inst Automat, NLPR, Beijing 100190, Peoples R China

来源：

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA | 2013年 / 134卷 / 05期

关键词：

ENHANCEMENT; RECOGNITION; BINARY;

D O I：

10.1121/1.4824632

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In this paper, a computational goal for a monaural speech separation system is proposed. Since this goal is derived by maximizing the signal-to-noise ratio (SNR), it is called the optimal ratio mask (ORM). Under the approximate W-Disjoint Orthogonality assumption which almost always holds due to the sparse nature of speech, theoretical analysis shows that the ORM can improve the SNR about 10log(10)2 dB over the ideal ratio mask. With three kinds of real-world interference, the speech separation results of SNR gain and objective quality evaluation demonstrate the correctness of the theoretical analysis, and imply that the ORMachieves a better separation performance. (C) 2013 Acoustical Society of America

引用

页码：EL452 / EL458

页数：7

共 50 条

[41] The optimal threshold for removing noise from speech is similar across normal and impaired hearing-a time-frequency masking study
Healy, Eric W.
Vasko, Jordan L.
Wang, DeLiang
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2019, 145 (06) : EL581 - EL586
[42] Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation
Luo, Yi
Mesgarani, Nima
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (08) : 1256 - 1266
[43] Speech Understanding Performance of Cochlear Implant Subjects Using Time-Frequency Masking-Based Noise Reduction
Qazi, Obaid Ur Rehman
van Dijk, Bas
Moonen, Marc
Wouters, Jan
IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2012, 59 (05) : 1364 - 1373
[44] Increased sensitivity and signal-to-noise ratio in diffusion-weighted MRI using multi-echo acquisitions
Eichner, Cornelius
Paquette, Michael
Mildner, Toralf
Schlumm, Torsten
Pleh, Kamilla
Samuni, Liran
Crockford, Catherine
Wittig, Roman M.
Jaeger, Carsten
Moeller, Harald E.
Friederici, Angela D.
Anwander, Alfred
NEUROIMAGE, 2020, 221
[45] Modified segmental signal-to-noise ratio reflecting spectral masking effect for evaluating the performance of hearing aid algorithms
Yook, Sunhyun
Nam, Kyoung Won
Kim, Heepyung
Kwon, See Youn
Kim, Dongwook
Lee, Sangmin
Hong, Sung Hwa
Jang, Dong Pyo
Kim, In Young
SPEECH COMMUNICATION, 2013, 55 (10) : 1003 - 1010
[46] Comparison of Cooled and Uncooled IR Sensors by Means of Signal-to-Noise Ratio for NDT Diagnostics of Aerospace Grade Composites
Deane, Shakeb
Avdelidis, Nicolas P.
Ibarra-Castanedo, Clemente
Zhang, Hai
Nezhad, Hamed Yazdani
Williamson, Alex A.
Mackley, Tim
Maldague, Xavier
Tsourdos, Antonios
Nooralishahi, Parham
SENSORS, 2020, 20 (12) : 1 - 29
[47] Performance of compressed sensing for fluorine-19 magnetic resonance imaging at low signal-to-noise ratio conditions
Starke, Ludger
Pohlmann, Andreas
Prinz, Christian
Niendorf, Thoralf
Waiczies, Sonia
MAGNETIC RESONANCE IN MEDICINE, 2020, 84 (02) : 592 - 608
[48] A MLE-based blind signal separation method for time-frequency overlapped signal using neural network
Pang, Lihui
Tang, Yilong
Tan, Qingyi
Liu, Yulang
Yang, Bin
EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2022, 2022 (01)
[49] Distributed Vibration Sensing System with High Signal-to-Noise Ratio Based on Ultra-Weak Fiber Bragg Grating
Tang Jianguan
Liu Yuzhe
Li Chengli
Guo Huiyong
Yang Minghong
ACTA OPTICA SINICA, 2021, 41 (13)
[50] Intelligent identification technology for high-order digital modulation signals under low signal-to-noise ratio conditions
Zha, Yanping
Wang, Hongjun
Shen, Zhexian
Shi, Yingchun
Shu, Feng
IET SIGNAL PROCESSING, 2023, 17 (02)

← 1 2 3 4 5 →