Cochannel Speech Separation Using Multi-pitch Estimation and Model Based Voiced Sequential Grouping

被引:0
|
作者
Li, Ming [1 ]
Cao, Chuan [1 ]
Wang, Di [1 ]
Lu, Ping [1 ]
Fu, Qiang [1 ]
Yan, Yonghong [1 ]
机构
[1] Chinese Acad Sci, ThinkIT Speech Lab, Inst Acoust, Beijing 100190, Peoples R China
关键词
Auditory scene analysis; cochannel speech; multi-pitch estimation; sequential grouping;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, a new cochannel speech separation algorithm using multi-pitch extraction and speaker model based sequential grouping is proposed. After auditory segmentation based on onset and offset analysis, robust multi-pitch estimation algorithm is performed on each segment and the corresponding voiced portions are segregated. Then speaker pair model based on support vector machine (SVM) is employed to determine the optimal sequential grouping alignments and group the speaker homogeneous segments into pure speaker streams. Systematic evaluation on the speech separation challenge database shows significant improvement over the baseline performance.
引用
收藏
页码:151 / 154
页数:4
相关论文
共 50 条
  • [21] An iterative model-based approach to cochannel speech separation
    Hu, Ke
    Wang, DeLiang
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2013,
  • [22] Multi-Pitch Estimation using NHF with Multi-Dictionary Distinguishing Attack and Reverberation of Sounds
    Fujisawa, Takanori
    Harada, Sora
    Ikehara, Masaaki
    CONFERENCE RECORD OF THE 2019 FIFTY-THIRD ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, 2019, : 1836 - 1841
  • [23] Multi-pitch Estimation Based on Sparse Representation with Pre-screened Dictionary
    Gao, Lufei
    Lee, Tan
    2015 IEEE 17TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2015,
  • [24] Deep Neural Network for Multi-Pitch Estimation Using Weighted Cross Entropy Loss
    Stone, Samuel
    Spector, Evan
    2021 IEEE WESTERN NEW YORK IMAGE AND SIGNAL PROCESSING WORKSHOP (WNYISPW), 2021,
  • [25] HMM-based speech enhancement using pitch period information in voiced speech segments
    Oberle, S
    Kaelin, A
    ISCAS '97 - PROCEEDINGS OF 1997 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS I - IV: CIRCUITS AND SYSTEMS IN THE INFORMATION AGE, 1997, : 2645 - 2648
  • [26] Pitch estimation using a modulation model of speech
    Gopalan, K
    2000 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I-III, 2000, : 786 - 791
  • [27] Multi-Pitch Estimation of Polyphonic Music Based on Pseudo Two-Dimensional Spectrum
    Zhang, Weiwei
    Chen, Zhe
    Yin, Fuliang
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 (28) : 2095 - 2108
  • [28] Joint Multi-Pitch Detection Using Harmonic Envelope Estimation for Polyphonic Music Transcription
    Benetos, Emmanouil
    Dixon, Simon
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2011, 5 (06) : 1111 - 1123
  • [29] Pitch estimation using music algorithm based on the sinusoidal speech model
    Amirkabir University of Technology, Electrical Engineering Department, Hafez Avenue, 15914 Tehran, Iran
    不详
    Advances in Communications and Software Technologies, 2002, : 255 - 258
  • [30] Evaluation of Zero Frequency Filtering based Method for Multi-pitch Streaming of Concurrent Speech Signals
    Mansali, Mariem Bouafif
    Backstrom, Tom
    Lachiri, Zied
    28TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2020), 2021, : 286 - 290