Cochannel Speech Segregation with Sparse Coding

被引:0
|
作者
Ingale, Pallavi P. [1 ]
Nalbalwar, S. L. [1 ]
机构
[1] Dr Babasaheb Ambedkar Technol Univ, Dept Elect & Telecommun Engn, Lonere, India
来源
2016 INTERNATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS, AND OPTIMIZATION TECHNIQUES (ICEEOT) | 2016年
关键词
Speech segregation; Sparse coding; Computational auditory scene analysis (CASA); ALGORITHM;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Most of the computational auditory scene analysis (CASA) based systems rely on pitch based features. When we go for cochannel speech segregation, two speakers are involved. Pitch ranges for male speech and female speech overlap to a large extent. Therefore multi-pitch tracking becomes a nontrivial task. In case of same gender mixtures, again pitch tracking becomes harder. Considering this fact, we should go for some reliable features. Here we propose a cochannel speech segregation system with sparsity based features. Sparse coding is applied on the cochleagram of the signal to get sparse approximation coefficients using pre-trained dictionaries for speakers. We treat sparse approximation coefficients the features because these are selected from the speaker specific dictionaries to represent an input signal. Sparse approximation coefficients are good choice for finding binary masks. Speech waveform is resynthesized from the masked cochleagram of the mixture. Experimental results show that the proposed method produces better objective intelligibility scores than the baseline system.
引用
收藏
页码:4589 / 4592
页数:4
相关论文
共 50 条
  • [31] Sparse coding and dictionary learning for electron hologram denoising
    Anada, Satoshi
    Nomura, Yuki
    Hirayama, Tsukasa
    Yamamoto, Kazuo
    ULTRAMICROSCOPY, 2019, 206
  • [32] Adaptive sparse coding on PCA dictionary for image denoising
    Liu, Qian
    Zhang, Caiming
    Guo, Qiang
    Xu, Hui
    Zhou, Yuanfeng
    VISUAL COMPUTER, 2016, 32 (04) : 535 - 549
  • [33] Image encryption using sparse coding and compressive sensing
    Ponuma, R.
    Amutha, R.
    MULTIDIMENSIONAL SYSTEMS AND SIGNAL PROCESSING, 2019, 30 (04) : 1895 - 1909
  • [34] Suppression of clutter by rank adaptive reweighted sparse coding
    Sathyanarayna, Sushanth G.
    Acton, Scott T.
    Hossack, John A.
    2017 IEEE INTERNATIONAL ULTRASONICS SYMPOSIUM (IUS), 2017,
  • [35] Constructing Deep Sparse Coding Network for image classification
    Zhang, Shizhou
    Wang, Jinjun
    Tao, Xiaoyu
    Gong, Yihong
    Zheng, Nanning
    PATTERN RECOGNITION, 2017, 64 : 130 - 140
  • [36] Detecting shot boundary with sparse coding for video summarization
    Li, Jiatong
    Yao, Ting
    Ling, Qiang
    Mei, Tao
    NEUROCOMPUTING, 2017, 266 : 66 - 78
  • [37] Underdetermined Blind Source Separation Using Sparse Coding
    Zhen, Liangli
    Peng, Dezhong
    Yi, Zhang
    Xiang, Yong
    Chen, Peng
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 28 (12) : 3102 - 3108
  • [38] AN ALGORITHM FOR SPEECH SEGREGATION OF CO-CHANNEL SPEECH
    Vishnubhotla, Srikanth
    Espy-Wilson, Carol Y.
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 109 - +
  • [39] LEARNED CONVOLUTIONAL SPARSE CODING
    Sreter, Hillel
    Giryes, Raja
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 2191 - 2195
  • [40] Sparse Coding of Visual Context
    Miao, Jun
    Qing, Laiyun
    Duan, Lijuan
    Chen, Xilin
    Gao, Wen
    ADVANCES IN COGNITIVE NEURODYNAMICS, PROCEEDINGS, 2008, : 891 - +