SVM-BASED SEPARATION OF UNVOICED-VOICED SPEECH IN COCHANNEL CONDITIONS

被引：0

作者：

Hu, Ke ^{[1
]}

Wang, DeLiang ^{[1
]}

机构：

[1] Ohio State Univ, Dept Comp Sci & Engn, Columbus, OH 43210 USA

来源：

2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2012年

关键词：

Cochannel speech separation; unvoiced speech; voiced speech; unit-level features; classification; SEGREGATION; ALGORITHM;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Unvoiced-voiced portions of cochannel speech contain considerable amounts of both voiced and unvoiced speech and play a significant role in separation. Motivated by recent developments in separation of speech from nonspeech noise, we propose a classification-based approach for unvoiced-voiced speech separation. A new feature set consisting of pitch-based features and gammatone frequency cepstral coefficients is proposed to represent the characteristics of a time-frequency unit. The cepstral features do not rely on pitch and are thus more robust than the pitch-based features to pitch estimation errors. Speaker-independent support vector machines are trained for classification. Results based on the TIMIT corpus show that the proposed algorithm significantly improves unvoiced speech segregation compared to a recent algorithm.

引用

页码：4545 / 4548

页数：4

共 50 条

[1] Speech enhancement based on a voiced-unvoiced speech model
Goh, Z
Tan, KC
Tan, BTG
PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 401 - 404
[2] MODEL BASED BINAURAL ENHANCEMENT OF VOICED AND UNVOICED SPEECH
Kavalekalam, Mathew Shaji
Christensen, Mads Graesboll
Boldt, Jesper B.
2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 666 - 670
[3] A novel two-step SVM classifier for voiced/unvoiced/silence classification of speech
Qi, FY
Bao, CC
Liu, Y
2004 International Symposium on Chinese Spoken Language Processing, Proceedings, 2004, : 77 - 80
[4] IFAS-based voiced/unvoiced classification of speech signal
Arifianto, D
Kobayashi, T
2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 812 - 815
[5] Voiced/Unvoiced Classification Recovery in the Speech Decoder Based on GMM
Wei Xuan
Dang Xiaoyan
Cui Huijuan
Tang Kun
ICSP: 2008 9TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-5, PROCEEDINGS, 2008, : 546 - 548
[6] VOICED UNVOICED SILENCE CLASSIFICATION OF SPEECH SIGNALS BASED ON STATISTICAL APPROACHES
ALHASHEMY, BAR
TAHA, SMR
APPLIED ACOUSTICS, 1988, 25 (03) : 169 - 179
[7] The Complexity Analysis of Voiced and Unvoiced Speech Signal Based on Sample Entropy
Sun, Guiqi
Fan, Zhenyan
Mastorakis, Nikos E.
Kaminaris, Stavros D.
Zhuang, Xiaodong
2017 FOURTH INTERNATIONAL CONFERENCE ON MATHEMATICS AND COMPUTERS IN SCIENCES AND IN INDUSTRY (MCSI), 2017, : 26 - 29
[8] Cochannel Speech Separation Using Multi-pitch Estimation and Model Based Voiced Sequential Grouping
Li, Ming
Cao, Chuan
Wang, Di
Lu, Ping
Fu, Qiang
Yan, Yonghong
INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 151 - 154
[9] Voiced/Unvoiced Decision Algorithm for HMM-based Speech Synthesis
Kang, Shiyin
Shuang, Zhiwei
Duan, Quansheng
Qin, Yong
Cai, Lianhong
INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 420 - +
[10] Sequential stream segregation of voiced and unvoiced speech sounds based on fundamental frequency
David, Marion
Lavandier, Mathieu
Grimault, Nicolas
Oxenham, Andrew J.
HEARING RESEARCH, 2017, 344 : 235 - 243

← 1 2 3 4 5 →