Online Gaussian Process for Nonstationary Speech Separation

被引:0
|
作者
Hsieh, Hsin-Lung [1 ]
Chien, Jen-Tzung [1 ]
机构
[1] Natl Cheng Kung Univ, Dept Comp Sci & Informat Engn, Tainan 70101, Taiwan
来源
11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2 | 2010年
关键词
speech enhancement; speech separation; Gaussian process; online learning; variational Bayes;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In a practical speech enhancement system, it is required to enhance speech signals from the mixed signals, which were corrupted due to the nonstationary source signals and mixing conditions. The source voices may be from different moving speakers. The speakers may abruptly appear or disappear and may be permuted continuously. To deal with these scenarios with a varying number of sources, we present a new method for nonstationary speech separation. An online Gaussian process independent component analysis (OLGP-ICA) is developed to characterize the real-time temporal structure in time-varying mixing system and to capture the evolved statistics of independent sources from online observed signals. A variational Bayes algorithm is established to estimate the evolved parameters for dynamic source separation. In the experiments, the proposed OLGP-ICA is compared with other ICA methods and is illustrated to be effective in recovering speech and music signals in a nonstationary speaking environment.
引用
收藏
页码:394 / 397
页数:4
相关论文
共 50 条
  • [1] Online Nonstationary and Nonlinear Bandits with Recursive Weighted Gaussian Process
    Miyake, Yusuke
    Watanabe, Ryuji
    Mine, Tsunenori
    2024 IEEE 48TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE, COMPSAC 2024, 2024, : 11 - 20
  • [2] NONSTATIONARY AND TEMPORALLY CORRELATED SOURCE SEPARATION USING GAUSSIAN PROCESS
    Hsieh, Hsin-Lung
    Chien, Jen-Tzung
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 2120 - 2123
  • [3] A General Nonstationary and Time-Varying Mixed Signal Blind Source Separation Method Based on Online Gaussian Process
    He, Pengju
    Qi, Mi
    Li, Wenhui
    Tang, Mengyang
    Zhao, Ziwei
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2020, 34 (11)
  • [4] ONLINE SPEECH SOURCE SEPARATION BASED ON MAXIMUM LIKELIHOOD OF LOCAL GAUSSIAN MODELING
    Togami, Masahito
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 213 - 216
  • [5] Single-channel Speech Separation based on Gaussian Process Regression
    Le Dinh Nguyen
    Chen, Sih-Huei
    Tai, Tzu-Chiang
    Wang, Jia-Ching
    2018 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM 2018), 2018, : 275 - 278
  • [6] Nonstationary Source Separation for Underdetermined Speech Mixtures
    Corey, Ryan M.
    Singer, Andrew C.
    2016 50TH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS, 2016, : 934 - 938
  • [7] Sequential Gaussian Processes for Online Learning of Nonstationary Functions
    Zhang, Michael Minyi
    Dumitrascu, Bianca
    Williamson, Sinead A.
    Engelhardt, Barbara E.
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2023, 71 : 1539 - 1550
  • [8] Gaussian process for nonstationary time series prediction
    Brahim-Belhouari, S
    Bermak, A
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2004, 47 (04) : 705 - 712
  • [9] Nonstationary covariance functions for Gaussian process regression
    Paciorek, CJ
    Schervish, MJ
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 16, 2004, 16 : 273 - 280
  • [10] Simulation of cutting force using nonstationary Gaussian process
    A. M. M. Sharif Ullah
    Khalifa H. Harib
    Journal of Intelligent Manufacturing, 2010, 21 : 681 - 691