Cochannel Speech Segregation with Sparse Coding

被引:0
|
作者
Ingale, Pallavi P. [1 ]
Nalbalwar, S. L. [1 ]
机构
[1] Dr Babasaheb Ambedkar Technol Univ, Dept Elect & Telecommun Engn, Lonere, India
来源
2016 INTERNATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS, AND OPTIMIZATION TECHNIQUES (ICEEOT) | 2016年
关键词
Speech segregation; Sparse coding; Computational auditory scene analysis (CASA); ALGORITHM;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Most of the computational auditory scene analysis (CASA) based systems rely on pitch based features. When we go for cochannel speech segregation, two speakers are involved. Pitch ranges for male speech and female speech overlap to a large extent. Therefore multi-pitch tracking becomes a nontrivial task. In case of same gender mixtures, again pitch tracking becomes harder. Considering this fact, we should go for some reliable features. Here we propose a cochannel speech segregation system with sparsity based features. Sparse coding is applied on the cochleagram of the signal to get sparse approximation coefficients using pre-trained dictionaries for speakers. We treat sparse approximation coefficients the features because these are selected from the speaker specific dictionaries to represent an input signal. Sparse approximation coefficients are good choice for finding binary masks. Speech waveform is resynthesized from the masked cochleagram of the mixture. Experimental results show that the proposed method produces better objective intelligibility scores than the baseline system.
引用
收藏
页码:4589 / 4592
页数:4
相关论文
共 50 条
  • [41] Sparse Coding with Anomaly Detection
    Amir Adler
    Michael Elad
    Yacov Hel-Or
    Ehud Rivlin
    Journal of Signal Processing Systems, 2015, 79 : 179 - 188
  • [42] Sparse Coding with Anomaly Detection
    Adler, Amir
    Elad, Michael
    Hel-Or, Yacov
    Rivlin, Ehud
    JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2015, 79 (02): : 179 - 188
  • [43] Sparse Spectrotemporal Coding of Sounds
    David J. Klein
    Peter König
    Konrad P. Körding
    EURASIP Journal on Advances in Signal Processing, 2003
  • [44] The problem of sparse image coding
    Pece, AEC
    JOURNAL OF MATHEMATICAL IMAGING AND VISION, 2002, 17 (02) : 89 - 108
  • [45] The Problem of Sparse Image Coding
    Arthur E.C. Pece
    Journal of Mathematical Imaging and Vision, 2002, 17 : 89 - 108
  • [46] SPARSE CODING WITH ANOMALY DETECTION
    Adler, Amir
    Elad, Michael
    Hel-Or, Yacov
    Rivlin, Ehud
    2013 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2013,
  • [47] Sparse spectrotemporal coding of sounds
    Klein, DJ
    König, P
    Körding, KP
    EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2003, 2003 (07) : 659 - 667
  • [48] Deep Denoising Sparse Coding
    Wang, Yijie
    Yang, Bo
    2020 IEEE 32ND INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI), 2020, : 681 - 685
  • [49] Sparse Coding and Selected Applications
    Hocke J.
    Labusch K.
    Barth E.
    Martinetz T.
    KI - Kunstliche Intelligenz, 2012, 26 (04): : 349 - 355
  • [50] Order Preserving Sparse Coding
    Ni, Bingbing
    Moulin, Pierre
    Yan, Shuicheng
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2015, 37 (08) : 1615 - 1628