LOW LATENCY ONLINE BLIND SOURCE SEPARATION BASED ON JOINT OPTIMIZATION WITH BLIND DEREVERBERATION

被引:11
|
作者
Ueda, Tetsuya [1 ,2 ]
Nakatani, Tomohiro [1 ]
Ikeshita, Rintaro [1 ]
Kinoshita, Keisuke [1 ]
Araki, Shoko [1 ]
Makino, Shoji [2 ]
机构
[1] NTT Corp, Tokyo, Japan
[2] Univ Tsukuba, Tsukuba, Ibaraki, Japan
关键词
Blind source separation; blind dereverberation; online; independent vector analysis; real-time; INDEPENDENT COMPONENT ANALYSIS; SPEECH;
D O I
10.1109/ICASSP39728.2021.9413700
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper presents a new low-latency online blind source separation (BSS) algorithm. Although algorithmic delay of a frequency domain online BSS can be reduced simply by shortening the short-time Fourier transform (STFT) frame length, it degrades the source separation performance in the presence of reverberation. This paper proposes a method to solve this problem by integrating BSS with Weighted Prediction Error (WPE) based dereverberation. Although a simple cascade of online BSS after online WPE upgrades the separation performance, the overall optimality is not guaranteed. Instead, this paper extends a recently proposed batch processing algorithm that can jointly optimize dereverberation and separation so that it can perform online processing with low computational cost and little processing delay (< 12 ms). The results of a source separation experiment in a noisy car environment suggest that the proposed online method has better separation performance than the simple cascaded methods.
引用
收藏
页码:506 / 510
页数:5
相关论文
共 50 条
  • [21] Guided joint diagonalization and its application to online blind source separation
    Li, Ronghua
    Zhou, Guoxu
    Wang, Jin
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE INFORMATION COMPUTING AND AUTOMATION, VOLS 1-3, 2008, : 1011 - +
  • [22] LOW ALGORITHMIC DELAY IMPLEMENTATION OF CONVOLUTIONAL BEAMFORMER FOR ONLINE JOINT SOURCE SEPARATION AND DEREVERBERATION
    Mo, Kaien
    Wang, Xianrui
    Yang, Yichen
    Makino, Shoji
    Chen, Jingdong
    32ND EUROPEAN SIGNAL PROCESSING CONFERENCE, EUSIPCO 2024, 2024, : 912 - 916
  • [23] Blind Separation of the Joint Algorithm Based on the Cyclostationarity Optimization
    Jing-hong, Xue
    Min, Li
    2016 IEEE INTERNATIONAL CONFERENCE ON ELECTRONIC INFORMATION AND COMMUNICATION TECHNOLOGY ICEICT 2016 PROCEEDINGS, 2016, : 266 - 270
  • [24] Underdetermined Joint Blind Source Separation based on Tensor Decomposition
    Zou, Liang
    Wang, Z. Jane
    Chen, Xun
    Ji, Xiangyang
    2016 IEEE CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (CCECE), 2016,
  • [25] SEMI-BLIND SPEECH ENHANCEMENT BASED ON RECURRENT NEURAL NETWORK FOR SOURCE SEPARATION AND DEREVERBERATION
    Wake, Masaya
    Bando, Yoshiaki
    Mimura, Masato
    Itoyama, Katsutoshi
    Yoshii, Kazuyoshi
    Kawahara, Tatsuya
    2017 IEEE 27TH INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING, 2017,
  • [26] Joint Multichannel Deconvolution and Blind Source Separation
    Jiang, Ming
    Bobin, Jerome
    Starck, Jean-Luc
    SIAM JOURNAL ON IMAGING SCIENCES, 2017, 10 (04): : 1997 - 2021
  • [27] Blind Source Separation based on Improved Particle Swarm Optimization
    Li, Ming
    Li, Weijuan
    Wang, Yan
    Sun, Xiangfeng
    2009 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND COMPUTATIONAL INTELLIGENCE, VOL III, PROCEEDINGS, 2009, : 307 - 310
  • [28] Blind Source Separation based on Collective Neurodynamic Optimization Approach
    Fan, Jianchao
    Wang, Ye
    Zhao, Jianhua
    Wang, Xiang
    Wang, Xinxin
    PROCEEDINGS OF THE 36TH CHINESE CONTROL CONFERENCE (CCC 2017), 2017, : 2047 - 2051
  • [29] Blind source separation based on adaptive particle swarm optimization
    Zhang, Chao-Zhu
    Zhang, Jian-Pei
    Sun, Xiao-Dong
    Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2009, 31 (06): : 1275 - 1278
  • [30] Joint Dereverberation and Beamforming With Blind Estimation of the Shape Parameter of the Desired Source Prior
    Yadav, Shekhar Kumar
    George, Nithin V.
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 779 - 793