Cochannel Speech Separation Using Multi-pitch Estimation and Model Based Voiced Sequential Grouping

被引：0

作者：

Li, Ming ^{[1
]}

Cao, Chuan ^{[1
]}

Wang, Di ^{[1
]}

Lu, Ping ^{[1
]}

Fu, Qiang ^{[1
]}

Yan, Yonghong ^{[1
]}

机构：

[1] Chinese Acad Sci, ThinkIT Speech Lab, Inst Acoust, Beijing 100190, Peoples R China

来源：

INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5 | 2008年

关键词：

Auditory scene analysis; cochannel speech; multi-pitch estimation; sequential grouping;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, a new cochannel speech separation algorithm using multi-pitch extraction and speaker model based sequential grouping is proposed. After auditory segmentation based on onset and offset analysis, robust multi-pitch estimation algorithm is performed on each segment and the corresponding voiced portions are segregated. Then speaker pair model based on support vector machine (SVM) is employed to determine the optimal sequential grouping alignments and group the speaker homogeneous segments into pure speaker streams. Systematic evaluation on the speech separation challenge database shows significant improvement over the baseline performance.

引用

页码：151 / 154

页数：4

共 50 条

[21] An iterative model-based approach to cochannel speech separation
Hu, Ke
Wang, DeLiang
EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2013,
[22] Multi-Pitch Estimation using NHF with Multi-Dictionary Distinguishing Attack and Reverberation of Sounds
Fujisawa, Takanori
Harada, Sora
Ikehara, Masaaki
CONFERENCE RECORD OF THE 2019 FIFTY-THIRD ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, 2019, : 1836 - 1841
[23] Multi-pitch Estimation Based on Sparse Representation with Pre-screened Dictionary
Gao, Lufei
Lee, Tan
2015 IEEE 17TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2015,
[24] Deep Neural Network for Multi-Pitch Estimation Using Weighted Cross Entropy Loss
Stone, Samuel
Spector, Evan
2021 IEEE WESTERN NEW YORK IMAGE AND SIGNAL PROCESSING WORKSHOP (WNYISPW), 2021,
[25] HMM-based speech enhancement using pitch period information in voiced speech segments
Oberle, S
Kaelin, A
ISCAS '97 - PROCEEDINGS OF 1997 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS I - IV: CIRCUITS AND SYSTEMS IN THE INFORMATION AGE, 1997, : 2645 - 2648
[26] Pitch estimation using a modulation model of speech
Gopalan, K
2000 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I-III, 2000, : 786 - 791
[27] Multi-Pitch Estimation of Polyphonic Music Based on Pseudo Two-Dimensional Spectrum
Zhang, Weiwei
Chen, Zhe
Yin, Fuliang
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 (28) : 2095 - 2108
[28] Joint Multi-Pitch Detection Using Harmonic Envelope Estimation for Polyphonic Music Transcription
Benetos, Emmanouil
Dixon, Simon
IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2011, 5 (06) : 1111 - 1123
[29] Pitch estimation using music algorithm based on the sinusoidal speech model
Amirkabir University of Technology, Electrical Engineering Department, Hafez Avenue, 15914 Tehran, Iran
不详
Advances in Communications and Software Technologies, 2002, : 255 - 258
[30] Evaluation of Zero Frequency Filtering based Method for Multi-pitch Streaming of Concurrent Speech Signals
Mansali, Mariem Bouafif
Backstrom, Tom
Lachiri, Zied
28TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2020), 2021, : 286 - 290

← 1 2 3 4 5 →