The Concept Of Narrow Sound Channel Using Binary Time Frequency Masking For Speech Recognition Of Intelligent Service Robots

被引：0

作者：

Jang, Hyukjoon ^{[1
]}

Song, Jaiyoun ^{[1
]}

Jeong, Hong ^{[2
]}

机构：

[1] POSTECH, Dept Info Technol, Pohang, Kyungbuk, South Korea

[2] POSTECH, Dept EEE, Pohang, Kyungbuk, South Korea

来源：

11TH INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION TECHNOLOGY, VOLS I-III, PROCEEDINGS,: UBIQUITOUS ICT CONVERGENCE MAKES LIFE BETTER! | 2009年

关键词：

Degenerate Unmixing Estimation Technique; Narrow Sound Channel;

D O I：

暂无

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

In this paper, we propose a speech recognition system with a narrow sound channel for intelligent service robots in noisy environments. The narrow sound channel is obtained by using time delay between two array microphone inputs used in the Degenerate Unmixing Estimation Technique (DUET). In the proposed system, the voice from a specific direction only passes through the sound channel, while unwanted voices from any other direction are removed. The recognition results showed that the performance of the proposed system, using a stereo microphone is higher than the normal voice recognizer, using a single ultra directional microphone, without this method.

引用

页码：1325 / +

页数：2

共 50 条

[41] Multi-Channel Bin-Wise Speech Separation Combining Time-Frequency Masking and Beamforming
Bella, Mostafa
Saylani, Hicham
Hosseini, Shahram
Deville, Yannick
IEEE ACCESS, 2023, 11 : 100632 - 100645
[42] Robust speaker recognition using binary time-frequency masks
Shao, Yang
Wang, DeLiang
2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 645 - 648
[43] TIME-FREQUENCY MASKING-BASED SPEECH ENHANCEMENT USING GENERATIVE ADVERSARIAL NETWORK
Soni, Meet H.
Shah, Neil
Patil, Hemant A.
2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5039 - 5043
[44] Weighting Time-Frequency Representation of Speech using Auditory Saliency for Automatic Speech Recognition
Cong-Thanh Do
Stylianou, Yannis
19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1591 - 1595
[45] Real-time control architecture using Xenomai for intelligent service robots in USN environments
Byoung Wook Choi
Dong Gwan Shin
Jeong Ho Park
Soo Yeong Yi
Seet Gerald
Intelligent Service Robotics, 2009, 2 : 139 - 151
[46] Real-time control architecture using Xenomai for intelligent service robots in USN environments
Choi, Byoung Wook
Shin, Dong Gwan
Park, Jeong Ho
Yi, Soo Yeong
Gerald, Seet
INTELLIGENT SERVICE ROBOTICS, 2009, 2 (03) : 139 - 151
[47] Time-Frequency Masking Based Online Multi-Channel Speech Enhancement With Convolutional Recurrent Neural Networks
Chakrabarty, Soumitro
Habets, Emanuel A. P.
IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2019, 13 (04) : 787 - 799
[48] Time frequency masking based speech enhancement using deep encoder-decoder neural network
Shi, Wenhua
Zhang, Xiongwei
Zou, Xia
Sun, Meng
Li, Li
Shengxue Xuebao/Acta Acustica, 2020, 45 (03): : 299 - 307
[49] PHASE TIME-FREQUENCY MASKING BASED SPEECH ENHANCEMENT ALGORITHM USING CIRCULAR MICROPHONE ARRAY
He, Li
Zhou, Yi
Liu, Hongqing
2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 808 - 813
[50] TIME DIFFERENCE OF ARRIVAL ESTIMATION OF SPEECH SIGNALS USING DEEP NEURAL NETWORKS WITH INTEGRATED TIME-FREQUENCY MASKING
Pertila, Pasi
Parviainen, Mikko
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 436 - 440

← 1 2 3 4 5 →