Online Gaussian Process for Nonstationary Speech Separation

被引：0

作者：

Hsieh, Hsin-Lung ^{[1
]}

Chien, Jen-Tzung ^{[1
]}

机构：

[1] Natl Cheng Kung Univ, Dept Comp Sci & Informat Engn, Tainan 70101, Taiwan

来源：

11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2 | 2010年

关键词：

speech enhancement; speech separation; Gaussian process; online learning; variational Bayes;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In a practical speech enhancement system, it is required to enhance speech signals from the mixed signals, which were corrupted due to the nonstationary source signals and mixing conditions. The source voices may be from different moving speakers. The speakers may abruptly appear or disappear and may be permuted continuously. To deal with these scenarios with a varying number of sources, we present a new method for nonstationary speech separation. An online Gaussian process independent component analysis (OLGP-ICA) is developed to characterize the real-time temporal structure in time-varying mixing system and to capture the evolved statistics of independent sources from online observed signals. A variational Bayes algorithm is established to estimate the evolved parameters for dynamic source separation. In the experiments, the proposed OLGP-ICA is compared with other ICA methods and is illustrated to be effective in recovering speech and music signals in a nonstationary speaking environment.

引用

页码：394 / 397

页数：4

共 50 条

[1] Online Nonstationary and Nonlinear Bandits with Recursive Weighted Gaussian Process
Miyake, Yusuke
Watanabe, Ryuji
Mine, Tsunenori
2024 IEEE 48TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE, COMPSAC 2024, 2024, : 11 - 20
[2] NONSTATIONARY AND TEMPORALLY CORRELATED SOURCE SEPARATION USING GAUSSIAN PROCESS
Hsieh, Hsin-Lung
Chien, Jen-Tzung
2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 2120 - 2123
[3] A General Nonstationary and Time-Varying Mixed Signal Blind Source Separation Method Based on Online Gaussian Process
He, Pengju
Qi, Mi
Li, Wenhui
Tang, Mengyang
Zhao, Ziwei
INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2020, 34 (11)
[4] ONLINE SPEECH SOURCE SEPARATION BASED ON MAXIMUM LIKELIHOOD OF LOCAL GAUSSIAN MODELING
Togami, Masahito
2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 213 - 216
[5] Single-channel Speech Separation based on Gaussian Process Regression
Le Dinh Nguyen
Chen, Sih-Huei
Tai, Tzu-Chiang
Wang, Jia-Ching
2018 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM 2018), 2018, : 275 - 278
[6] Nonstationary Source Separation for Underdetermined Speech Mixtures
Corey, Ryan M.
Singer, Andrew C.
2016 50TH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS, 2016, : 934 - 938
[7] Sequential Gaussian Processes for Online Learning of Nonstationary Functions
Zhang, Michael Minyi
Dumitrascu, Bianca
Williamson, Sinead A.
Engelhardt, Barbara E.
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2023, 71 : 1539 - 1550
[8] Gaussian process for nonstationary time series prediction
Brahim-Belhouari, S
Bermak, A
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2004, 47 (04) : 705 - 712
[9] Nonstationary covariance functions for Gaussian process regression
Paciorek, CJ
Schervish, MJ
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 16, 2004, 16 : 273 - 280
[10] Simulation of cutting force using nonstationary Gaussian process
A. M. M. Sharif Ullah
Khalifa H. Harib
Journal of Intelligent Manufacturing, 2010, 21 : 681 - 691

← 1 2 3 4 5 →