Gain Adapted Optimum Mixture Estimation Scheme for Single Channel Speech Separation

被引:2
作者
Kapoor, Divneet Singh [1 ]
Kohli, Amit Kumar [2 ]
机构
[1] Chandigarh Grp Coll, Dept Elect & Commun Engn, Gharuan, Mohali, India
[2] Thapar Univ, Dept Elect & Commun Engn, Patiala 147004, Punjab, India
关键词
Single channel speech separation (SCSS); Optimum mixture estimator; Mixture-maximization (MixMax); Quadratic estimator; Gain adaptation; BLIND SOURCE SEPARATION; SEGREGATION; RECOGNITION; DRIVEN; SOUND;
D O I
10.1007/s00034-013-9566-7
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper presents the proof of an Optimum mixture estimator for the single channel speech separation problem, which is a technique for separating two speech signals from a single recording of their mixture. The presented work is an attempt to solve a fundamental limitation in the current single channel speech separation techniques, in which it is assumed that the data used in the training as well as test phases of the separation model have the same energy levels. To overcome this limitation, a gain adapted Optimum mixture estimator is derived, which estimates the mixture of speech signals under the different signal-to-signal ratios (SSRs). Specifically, the speakers' gains are incorporated as unknown parameters into the separation model, and then the estimator is derived in terms of the source distributions and SSR. It is demonstrated that the use of the Optimum mixture estimator results in the lower estimation error than the non-linear mapping (log and inverse-log operations)-based Mixture-Maximization (MixMax) or Quadratic estimators. The experimental results based on the real speech data also depict that the proposed estimator improves the mixture estimation performance significantly when compared with MixMax or Quadratic estimators with the gain adaptation.
引用
收藏
页码:2335 / 2351
页数:17
相关论文
共 28 条
  • [21] Hybrid Approach to Single-Channel Speech Separation Based on Coherent-Incoherent Modulation Filtering
    Mahmoodzadeh, Azar
    Abutalebi, Hamid Reza
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2017, 36 (05) : 1970 - 1988
  • [22] LOGATOM ARTICULATION INDEX EVALUATION OF SPEECH ENHANCED BY BLIND SOURCE SEPARATION AND SINGLE-CHANNEL NOISE REDUCTION
    Drgas, Szymon
    Kocinski, Jedrzej
    Sek, Aleksander P.
    ARCHIVES OF ACOUSTICS, 2008, 33 (04) : 455 - 474
  • [23] Single-Channel Blind Separation Using Pseudo-Stereo Mixture and Complex 2-D Histogram
    Tengtrairat, N.
    Gao, Bin
    Woo, W. L.
    Dlay, S. S.
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2013, 24 (11) : 1722 - 1735
  • [24] Single-channel Speech Separation Using Dictionary-updated Orthogonal Matching Pursuit and Temporal Structure Information
    Guo, Haiyan
    Li, Xiaoxiong
    Zhou, Lin
    Wu, Zhenyang
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2015, 34 (12) : 3861 - 3882
  • [25] A Maximum Likelihood Approach to Deep Neural Network Based Nonlinear Spectral Mapping for Single-Channel Speech Separation
    Wang, Yannan
    Du, Jun
    Dai, Li-Rong
    Lee, Chin-Hui
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1178 - 1182
  • [26] Deep neural networks based binary classification for single channel speaker independent multi-talker speech separation
    Saleem, Nasir
    Khattak, Muhammad Irfan
    APPLIED ACOUSTICS, 2020, 167
  • [27] An adaptive speech source separation algorithm under overcomplete-cases using Laplacian mixture modeling for mixture matrix estimation by adaptive EM-type algorithm in wavelet packet domain
    Mozaffari B.
    Tinati M.A.
    International Journal of Speech Technology, 2008, 11 (01) : 33 - 42
  • [28] The Single-channel blind source separation based on VMD and Tukey M estimation for rolling bearing composite fault diagnosis
    Wang, Yaping
    Zhang, Qisong
    Cao, Ruofan
    Zhang, Sheng
    Li, Shisong
    Xu, Di
    MEASUREMENT & CONTROL, 2023, 56 (9-10) : 1599 - 1612