Cochannel Speech Segregation with Sparse Coding

被引:0
|
作者
Ingale, Pallavi P. [1 ]
Nalbalwar, S. L. [1 ]
机构
[1] Dr Babasaheb Ambedkar Technol Univ, Dept Elect & Telecommun Engn, Lonere, India
来源
2016 INTERNATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS, AND OPTIMIZATION TECHNIQUES (ICEEOT) | 2016年
关键词
Speech segregation; Sparse coding; Computational auditory scene analysis (CASA); ALGORITHM;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Most of the computational auditory scene analysis (CASA) based systems rely on pitch based features. When we go for cochannel speech segregation, two speakers are involved. Pitch ranges for male speech and female speech overlap to a large extent. Therefore multi-pitch tracking becomes a nontrivial task. In case of same gender mixtures, again pitch tracking becomes harder. Considering this fact, we should go for some reliable features. Here we propose a cochannel speech segregation system with sparsity based features. Sparse coding is applied on the cochleagram of the signal to get sparse approximation coefficients using pre-trained dictionaries for speakers. We treat sparse approximation coefficients the features because these are selected from the speaker specific dictionaries to represent an input signal. Sparse approximation coefficients are good choice for finding binary masks. Speech waveform is resynthesized from the masked cochleagram of the mixture. Experimental results show that the proposed method produces better objective intelligibility scores than the baseline system.
引用
收藏
页码:4589 / 4592
页数:4
相关论文
共 50 条
  • [21] Hessian sparse coding
    Zheng, Miao
    Bu, Jiajun
    Chen, Chun
    NEUROCOMPUTING, 2014, 123 : 247 - 254
  • [22] Provably Accurate Double-Sparse Coding
    Nguyen, Thanh V.
    Wong, Raymond K. W.
    Hegde, Chinmay
    JOURNAL OF MACHINE LEARNING RESEARCH, 2019, 20
  • [23] Structure regularized sparse coding for data representation
    Wang, Xiaoming
    Wang, Shitong
    Huang, Zengxi
    Du, Yajun
    KNOWLEDGE-BASED SYSTEMS, 2019, 174 : 87 - 102
  • [24] Supervised Sparse Coding Strategy in Cochlear Implants
    Sang, Jinqiu
    Li, Guoping
    Hu, Hongmei
    Lutman, Mark E.
    Bleeck, Stefan
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1804 - 1807
  • [25] Sparse Coding with Outliers
    Dai, Xiangguang
    Zhang, Keke
    Zhang, Wei
    Xiong, Jiang
    Feng, Yuming
    2019 TENTH INTERNATIONAL CONFERENCE ON INTELLIGENT CONTROL AND INFORMATION PROCESSING (ICICIP), 2019, : 246 - 249
  • [26] An MDL Framework for Sparse Coding and Dictionary Learning
    Ramirez, Ignacio
    Sapiro, Guillermo
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2012, 60 (06) : 2913 - 2927
  • [27] A Fast Proximal Method for Convolutional Sparse Coding
    Chalasani, Rakesh
    Principe, Jose C.
    Ramakrishnan, Naveen
    2013 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2013,
  • [28] Local and global regularized sparse coding for data representation
    Shu, Zhenqiu
    Zhou, Jun
    Huang, Pu
    Yu, Xun
    Yang, Zhangjing
    Zhao, Chunxia
    NEUROCOMPUTING, 2016, 175 : 188 - 197
  • [29] Color video denoising using epitome and sparse coding
    Lee, Hwea Yee
    Hoo, Wai Lam
    Chan, Chee Seng
    EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (02) : 751 - 759
  • [30] Inference via sparse coding in a hierarchical vision model
    Bowren, Joshua
    Sanchez-Giraldo, Luis
    Schwartz, Odelia
    JOURNAL OF VISION, 2022, 22 (02):