Processing of speech signals using a microphone array for intelligent robots

被引:3
作者
Hu, I [1 ]
Cheng, CC [1 ]
Liu, WH [1 ]
机构
[1] Natl Chiao Tung Univ, Dept Elect & Control Engn, Hsinchu, Taiwan
关键词
beamforming; beamformer; DOA; microphone array; robot hearing; speech enhancement;
D O I
10.1243/095965105X9461
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
For intelligent robots to interact with people, an efficient human-robot communication interface is very important (e.g. voice command). However, recognizing voice command or speech represents only part of speech communication. The physics of speech signals includes other information, such as speaker direction. Secondly, a basic element of processing the speech signal is recognition at the acoustic level. However, the performance of recognition depends greatly on the reception. In a noisy environment, the success rate can be very poor. As a result, prior to speech recognition, it is important to process the speech signals to extract the needed content while rejecting others (such as background noise). This paper presents a speech purification system for robots to improve the signal-to-noise ratio of reception and an algorithm with a multidirection calibration beamformer.
引用
收藏
页码:133 / 143
页数:11
相关论文
共 50 条
[31]   Digital Hearing Aids Speech Enhancement Based on Microphone Array [J].
Wang, Chong .
2015 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND ENGINEERING APPLICATIONS (CSEA 2015), 2015, :546-550
[32]   Detection and Separation of Speech Events in Meeting Recordings Using a Microphone Array [J].
Futoshi Asano ;
Kiyoshi Yamamoto ;
Jun Ogata ;
Miichi Yamada ;
Masami Nakamura .
EURASIP Journal on Audio, Speech, and Music Processing, 2007
[33]   Adaptive microphone array employing calibration signals: An analytical evaluation [J].
Nordholm, S ;
Claesson, I ;
Dahl, M .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1999, 7 (03) :241-252
[34]   An Optimum Microphone Array Post-Filter for Speech Applications [J].
Leukimmiatis, Stamatis ;
Dimitriadis, Dimitrios ;
Maragos, Petros .
INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, :2142-2145
[35]   A Microphone Array Beamformer for the Performance Enhancement of Speech Recognizer in Car [J].
Han, Chul-Hee ;
Kang, Hong-Goo ;
Hwang, Youngsoo ;
Youn, Dae-Hee .
JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2005, 24 (07) :423-430
[36]   A Python']Python framework for microphone array data processing [J].
Sarradj, Ennes ;
Herold, Gert .
APPLIED ACOUSTICS, 2017, 116 :50-58
[37]   Performance of an HMM speech recognizer using a real-time tracking microphone array as input [J].
Hughes, TB ;
Kim, HS ;
DiBiase, JH ;
Silverman, HF .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1999, 7 (03) :346-349
[38]   Feature Mapping of Multiple Beamformed Sources for Robust Overlapping Speech Recognition Using a Microphone Array [J].
Li, Weifeng ;
Wang, Longbiao ;
Zhou, Yicong ;
Dines, John ;
Magimai-Doss, Mathew ;
Bourlard, Herve ;
Liao, Qingmin .
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (12) :2244-2255
[39]   Dealing with uncertainty in microphone placement in a microphone array speech recognition system [J].
Himawan, Ivan ;
Sridharan, Sridha ;
McCowan, Kin .
2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, :1565-+
[40]   MAPL - Microphone Array Processing Library [J].
Hilovsky, Martin ;
Gressak, Jozef ;
Lojka, Martin ;
Juhar, Jozef .
PROCEEDINGS OF ELMAR 2016 - 58TH INTERNATIONAL SYMPOSIUM ELMAR 2016, 2016, :27-30