Combined architecture of adaptive beamforming and blind source separation for speech recognition of intelligent service robots

被引：0

作者：

Woo, Sungmin ^{[1
]}

Lee, Sanghoon ^{[1
]}

Jeong, Hong ^{[1
]}

机构：

[1] Pohang Univ Sci & Technol, Dept Elect & Elect Engn, Pohang 790784, South Korea

来源：

2007 INTERNATIONAL CONFERENCE ON INTELLIGENT PERVASIVE COMPUTING, PROCEEDINGS | 2007年

关键词：

D O I：

10.1109/IPC.2007.72

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Successful speech recognition in noisy environments for intelligent robots depends on the performance of preprocessing elements employed. Even though acoustic signals are often corrupted in the high noise level environment, speech recognition systems such as the widely-used HTK do not deal with signal distortion problems. We propose an architecture that effectively combines adaptive beamforming (ABF) and blind source separation (BSS) algorithms in the spatial domain. To avoid permutation ambiguity and heavy computational complexity in the BSS system, the adaptive generalized sidelobe canceller is employed in front of the BSS system. We slightly modified the conventional convolutive mixture model of the BSS for fast processing in hardware implementations. Unlike the conventional BSS, this does not suffer from permutation ambiguity since the target angle of the front-line beamformer is fixed so it always provides enhanced and reference noise signals to the predefined two inputs of the BSS. The proposed system also reduces heavy computations in the BSS when the BSS have more than two inputs. The proposed time domain approach can be easily implemented into hardware in real-time. We evaluated the structure and assessed its performance with a DSP module. The experimental results of speech recognition test show that the proposed combined system guarantees high speech recognition rate in the noisy environment and better performance than the ABF and BSS system.

引用

页码：214 / 219

页数：6

共 50 条

[1] Supervised speech separation combined with adaptive beamforming
Saric, Zoran
Subotic, Misko
Bilibajkic, Ruzica
Barjaktarovic, Marko
Stojanovic, Jasmina
COMPUTER SPEECH AND LANGUAGE, 2022, 76
[2] Adaptive blind source separation with HRTFs beamforming preprocessing
Maazaoui, Mounira
Abed-Meraim, Karim
Grenier, Yves
2012 IEEE 7TH SENSOR ARRAY AND MULTICHANNEL SIGNAL PROCESSING WORKSHOP (SAM), 2012, : 269 - 272
[3] Improvement of Speech Recognition for Robots Using Blind Signal Separation
Bicher, Daniel
Kroll-Peters, Olaf
Lee, Thebin
Tiotuico, Natascha
Wilhelm, Mathias
ISCGAV'08: PROCEEDINGS OF THE 8TH WSEAS INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, COMPUTATIONAL GEOMETRY AND ARTIFICIAL VISION, 2008, : 52 - 55
[4] Blind Adaptive Principal Eigenvector Beamforming for Acoustical Source Separation
Warsitz, Ernst
Haeb-Umbach, Reinhold
Vu, Dang Hai Tran
INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 461 - 464
[5] Detection in present of reverberation Combined with Blind Source Separation and Beamforming
Xu, Ce
Zhang, Xinhua
Xu, Zhaoyan
2ND IEEE INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER CONTROL (ICACC 2010), VOL. 4, 2010, : 158 - 162
[6] BLIND SPEECH SEPARATION FOR ROBOTS WITH INTELLIGENT HUMAN-MACHINE INTERACTION
Huang Yulei Ding Zhizhong Dai Lirong* Chen Xiaoping* (Department of Communication Engineering
Journal of Electronics(China), 2012, (Z2) : 286 - 293
[7] BLIND SPEECH SEPARATION FOR ROBOTS WITH INTELLIGENT HUMAN-MACHINE INTERACTION
Huang Yulei Ding Zhizhong Dai Lirong Chen Xiaoping Department of Communication Engineering Hefei University of Technology Hefei China Department of Electronic Engineering and Information Science University of Science and Technology of China Hefei China
JournalofElectronics(China), 2012, 29(Z2) (China) : 286 - 293
[8] Adaptive Blind Beamforming for Intelligent Surface
Lai, Wenhai
Wang, Wenyu
Xu, Fan
Li, Xin
Niu, Shaobo
Shen, Kaiming
IEEE TRANSACTIONS ON MOBILE COMPUTING, 2025, 24 (02) : 907 - 923
[9] Research on blind source separation and blind beamforming
Zhao, B
Yang, JA
Zhang, M
Proceedings of 2005 International Conference on Machine Learning and Cybernetics, Vols 1-9, 2005, : 4389 - 4393
[10] Neural Blind Source Separation and Diarization for Distant Speech Recognition
Bando, Yoshiaki
Nakamura, Tomohiko
Watanabe, Shinji
INTERSPEECH 2024, 2024, : 722 - 726

← 1 2 3 4 5 →