SVM-BASED SEPARATION OF UNVOICED-VOICED SPEECH IN COCHANNEL CONDITIONS

被引:0
|
作者
Hu, Ke [1 ]
Wang, DeLiang [1 ]
机构
[1] Ohio State Univ, Dept Comp Sci & Engn, Columbus, OH 43210 USA
来源
2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2012年
关键词
Cochannel speech separation; unvoiced speech; voiced speech; unit-level features; classification; SEGREGATION; ALGORITHM;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Unvoiced-voiced portions of cochannel speech contain considerable amounts of both voiced and unvoiced speech and play a significant role in separation. Motivated by recent developments in separation of speech from nonspeech noise, we propose a classification-based approach for unvoiced-voiced speech separation. A new feature set consisting of pitch-based features and gammatone frequency cepstral coefficients is proposed to represent the characteristics of a time-frequency unit. The cepstral features do not rely on pitch and are thus more robust than the pitch-based features to pitch estimation errors. Speaker-independent support vector machines are trained for classification. Results based on the TIMIT corpus show that the proposed algorithm significantly improves unvoiced speech segregation compared to a recent algorithm.
引用
收藏
页码:4545 / 4548
页数:4
相关论文
共 50 条
  • [1] Speech enhancement based on a voiced-unvoiced speech model
    Goh, Z
    Tan, KC
    Tan, BTG
    PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 401 - 404
  • [2] MODEL BASED BINAURAL ENHANCEMENT OF VOICED AND UNVOICED SPEECH
    Kavalekalam, Mathew Shaji
    Christensen, Mads Graesboll
    Boldt, Jesper B.
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 666 - 670
  • [3] A novel two-step SVM classifier for voiced/unvoiced/silence classification of speech
    Qi, FY
    Bao, CC
    Liu, Y
    2004 International Symposium on Chinese Spoken Language Processing, Proceedings, 2004, : 77 - 80
  • [4] IFAS-based voiced/unvoiced classification of speech signal
    Arifianto, D
    Kobayashi, T
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 812 - 815
  • [5] Voiced/Unvoiced Classification Recovery in the Speech Decoder Based on GMM
    Wei Xuan
    Dang Xiaoyan
    Cui Huijuan
    Tang Kun
    ICSP: 2008 9TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-5, PROCEEDINGS, 2008, : 546 - 548
  • [6] VOICED UNVOICED SILENCE CLASSIFICATION OF SPEECH SIGNALS BASED ON STATISTICAL APPROACHES
    ALHASHEMY, BAR
    TAHA, SMR
    APPLIED ACOUSTICS, 1988, 25 (03) : 169 - 179
  • [7] The Complexity Analysis of Voiced and Unvoiced Speech Signal Based on Sample Entropy
    Sun, Guiqi
    Fan, Zhenyan
    Mastorakis, Nikos E.
    Kaminaris, Stavros D.
    Zhuang, Xiaodong
    2017 FOURTH INTERNATIONAL CONFERENCE ON MATHEMATICS AND COMPUTERS IN SCIENCES AND IN INDUSTRY (MCSI), 2017, : 26 - 29
  • [8] Cochannel Speech Separation Using Multi-pitch Estimation and Model Based Voiced Sequential Grouping
    Li, Ming
    Cao, Chuan
    Wang, Di
    Lu, Ping
    Fu, Qiang
    Yan, Yonghong
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 151 - 154
  • [9] Voiced/Unvoiced Decision Algorithm for HMM-based Speech Synthesis
    Kang, Shiyin
    Shuang, Zhiwei
    Duan, Quansheng
    Qin, Yong
    Cai, Lianhong
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 420 - +
  • [10] Sequential stream segregation of voiced and unvoiced speech sounds based on fundamental frequency
    David, Marion
    Lavandier, Mathieu
    Grimault, Nicolas
    Oxenham, Andrew J.
    HEARING RESEARCH, 2017, 344 : 235 - 243