Cochannel Speech Segregation with Sparse Coding

被引：0

作者：

Ingale, Pallavi P. ^{[1
]}

Nalbalwar, S. L. ^{[1
]}

机构：

[1] Dr Babasaheb Ambedkar Technol Univ, Dept Elect & Telecommun Engn, Lonere, India

来源：

2016 INTERNATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS, AND OPTIMIZATION TECHNIQUES (ICEEOT) | 2016年

关键词：

Speech segregation; Sparse coding; Computational auditory scene analysis (CASA); ALGORITHM;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Most of the computational auditory scene analysis (CASA) based systems rely on pitch based features. When we go for cochannel speech segregation, two speakers are involved. Pitch ranges for male speech and female speech overlap to a large extent. Therefore multi-pitch tracking becomes a nontrivial task. In case of same gender mixtures, again pitch tracking becomes harder. Considering this fact, we should go for some reliable features. Here we propose a cochannel speech segregation system with sparsity based features. Sparse coding is applied on the cochleagram of the signal to get sparse approximation coefficients using pre-trained dictionaries for speakers. We treat sparse approximation coefficients the features because these are selected from the speaker specific dictionaries to represent an input signal. Sparse approximation coefficients are good choice for finding binary masks. Speech waveform is resynthesized from the masked cochleagram of the mixture. Experimental results show that the proposed method produces better objective intelligibility scores than the baseline system.

引用

页码：4589 / 4592

页数：4

共 50 条

[31] Sparse coding and dictionary learning for electron hologram denoising
Anada, Satoshi
Nomura, Yuki
Hirayama, Tsukasa
Yamamoto, Kazuo
ULTRAMICROSCOPY, 2019, 206
[32] Adaptive sparse coding on PCA dictionary for image denoising
Liu, Qian
Zhang, Caiming
Guo, Qiang
Xu, Hui
Zhou, Yuanfeng
VISUAL COMPUTER, 2016, 32 (04) : 535 - 549
[33] Image encryption using sparse coding and compressive sensing
Ponuma, R.
Amutha, R.
MULTIDIMENSIONAL SYSTEMS AND SIGNAL PROCESSING, 2019, 30 (04) : 1895 - 1909
[34] Suppression of clutter by rank adaptive reweighted sparse coding
Sathyanarayna, Sushanth G.
Acton, Scott T.
Hossack, John A.
2017 IEEE INTERNATIONAL ULTRASONICS SYMPOSIUM (IUS), 2017,
[35] Constructing Deep Sparse Coding Network for image classification
Zhang, Shizhou
Wang, Jinjun
Tao, Xiaoyu
Gong, Yihong
Zheng, Nanning
PATTERN RECOGNITION, 2017, 64 : 130 - 140
[36] Detecting shot boundary with sparse coding for video summarization
Li, Jiatong
Yao, Ting
Ling, Qiang
Mei, Tao
NEUROCOMPUTING, 2017, 266 : 66 - 78
[37] Underdetermined Blind Source Separation Using Sparse Coding
Zhen, Liangli
Peng, Dezhong
Yi, Zhang
Xiang, Yong
Chen, Peng
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 28 (12) : 3102 - 3108
[38] AN ALGORITHM FOR SPEECH SEGREGATION OF CO-CHANNEL SPEECH
Vishnubhotla, Srikanth
Espy-Wilson, Carol Y.
2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 109 - +
[39] LEARNED CONVOLUTIONAL SPARSE CODING
Sreter, Hillel
Giryes, Raja
2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 2191 - 2195
[40] Sparse Coding of Visual Context
Miao, Jun
Qing, Laiyun
Duan, Lijuan
Chen, Xilin
Gao, Wen
ADVANCES IN COGNITIVE NEURODYNAMICS, PROCEEDINGS, 2008, : 891 - +

← 1 2 3 4 5 →