Combined architecture of adaptive beamforming and blind source separation for speech recognition of intelligent service robots

被引:0
|
作者
Woo, Sungmin [1 ]
Lee, Sanghoon [1 ]
Jeong, Hong [1 ]
机构
[1] Pohang Univ Sci & Technol, Dept Elect & Elect Engn, Pohang 790784, South Korea
关键词
D O I
10.1109/IPC.2007.72
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Successful speech recognition in noisy environments for intelligent robots depends on the performance of preprocessing elements employed. Even though acoustic signals are often corrupted in the high noise level environment, speech recognition systems such as the widely-used HTK do not deal with signal distortion problems. We propose an architecture that effectively combines adaptive beamforming (ABF) and blind source separation (BSS) algorithms in the spatial domain. To avoid permutation ambiguity and heavy computational complexity in the BSS system, the adaptive generalized sidelobe canceller is employed in front of the BSS system. We slightly modified the conventional convolutive mixture model of the BSS for fast processing in hardware implementations. Unlike the conventional BSS, this does not suffer from permutation ambiguity since the target angle of the front-line beamformer is fixed so it always provides enhanced and reference noise signals to the predefined two inputs of the BSS. The proposed system also reduces heavy computations in the BSS when the BSS have more than two inputs. The proposed time domain approach can be easily implemented into hardware in real-time. We evaluated the structure and assessed its performance with a DSP module. The experimental results of speech recognition test show that the proposed combined system guarantees high speech recognition rate in the noisy environment and better performance than the ABF and BSS system.
引用
收藏
页码:214 / 219
页数:6
相关论文
共 50 条
  • [1] Supervised speech separation combined with adaptive beamforming
    Saric, Zoran
    Subotic, Misko
    Bilibajkic, Ruzica
    Barjaktarovic, Marko
    Stojanovic, Jasmina
    COMPUTER SPEECH AND LANGUAGE, 2022, 76
  • [2] Adaptive blind source separation with HRTFs beamforming preprocessing
    Maazaoui, Mounira
    Abed-Meraim, Karim
    Grenier, Yves
    2012 IEEE 7TH SENSOR ARRAY AND MULTICHANNEL SIGNAL PROCESSING WORKSHOP (SAM), 2012, : 269 - 272
  • [3] Improvement of Speech Recognition for Robots Using Blind Signal Separation
    Bicher, Daniel
    Kroll-Peters, Olaf
    Lee, Thebin
    Tiotuico, Natascha
    Wilhelm, Mathias
    ISCGAV'08: PROCEEDINGS OF THE 8TH WSEAS INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, COMPUTATIONAL GEOMETRY AND ARTIFICIAL VISION, 2008, : 52 - 55
  • [4] Blind Adaptive Principal Eigenvector Beamforming for Acoustical Source Separation
    Warsitz, Ernst
    Haeb-Umbach, Reinhold
    Vu, Dang Hai Tran
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 461 - 464
  • [5] Detection in present of reverberation Combined with Blind Source Separation and Beamforming
    Xu, Ce
    Zhang, Xinhua
    Xu, Zhaoyan
    2ND IEEE INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER CONTROL (ICACC 2010), VOL. 4, 2010, : 158 - 162
  • [6] BLIND SPEECH SEPARATION FOR ROBOTS WITH INTELLIGENT HUMAN-MACHINE INTERACTION
    Huang Yulei Ding Zhizhong Dai Lirong* Chen Xiaoping* (Department of Communication Engineering
    Journal of Electronics(China), 2012, (Z2) : 286 - 293
  • [7] BLIND SPEECH SEPARATION FOR ROBOTS WITH INTELLIGENT HUMAN-MACHINE INTERACTION
    Huang Yulei Ding Zhizhong Dai Lirong Chen Xiaoping Department of Communication Engineering Hefei University of Technology Hefei China Department of Electronic Engineering and Information Science University of Science and Technology of China Hefei China
    JournalofElectronics(China), 2012, 29(Z2) (China) : 286 - 293
  • [8] Adaptive Blind Beamforming for Intelligent Surface
    Lai, Wenhai
    Wang, Wenyu
    Xu, Fan
    Li, Xin
    Niu, Shaobo
    Shen, Kaiming
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2025, 24 (02) : 907 - 923
  • [9] Research on blind source separation and blind beamforming
    Zhao, B
    Yang, JA
    Zhang, M
    Proceedings of 2005 International Conference on Machine Learning and Cybernetics, Vols 1-9, 2005, : 4389 - 4393
  • [10] Neural Blind Source Separation and Diarization for Distant Speech Recognition
    Bando, Yoshiaki
    Nakamura, Tomohiko
    Watanabe, Shinji
    INTERSPEECH 2024, 2024, : 722 - 726