The Concept Of Narrow Sound Channel Using Binary Time Frequency Masking For Speech Recognition Of Intelligent Service Robots

被引:0
|
作者
Jang, Hyukjoon [1 ]
Song, Jaiyoun [1 ]
Jeong, Hong [2 ]
机构
[1] POSTECH, Dept Info Technol, Pohang, Kyungbuk, South Korea
[2] POSTECH, Dept EEE, Pohang, Kyungbuk, South Korea
关键词
Degenerate Unmixing Estimation Technique; Narrow Sound Channel;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In this paper, we propose a speech recognition system with a narrow sound channel for intelligent service robots in noisy environments. The narrow sound channel is obtained by using time delay between two array microphone inputs used in the Degenerate Unmixing Estimation Technique (DUET). In the proposed system, the voice from a specific direction only passes through the sound channel, while unwanted voices from any other direction are removed. The recognition results showed that the performance of the proposed system, using a stereo microphone is higher than the normal voice recognizer, using a single ultra directional microphone, without this method.
引用
收藏
页码:1325 / +
页数:2
相关论文
共 50 条
  • [41] Multi-Channel Bin-Wise Speech Separation Combining Time-Frequency Masking and Beamforming
    Bella, Mostafa
    Saylani, Hicham
    Hosseini, Shahram
    Deville, Yannick
    IEEE ACCESS, 2023, 11 : 100632 - 100645
  • [42] Robust speaker recognition using binary time-frequency masks
    Shao, Yang
    Wang, DeLiang
    2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 645 - 648
  • [43] TIME-FREQUENCY MASKING-BASED SPEECH ENHANCEMENT USING GENERATIVE ADVERSARIAL NETWORK
    Soni, Meet H.
    Shah, Neil
    Patil, Hemant A.
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5039 - 5043
  • [44] Weighting Time-Frequency Representation of Speech using Auditory Saliency for Automatic Speech Recognition
    Cong-Thanh Do
    Stylianou, Yannis
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1591 - 1595
  • [45] Real-time control architecture using Xenomai for intelligent service robots in USN environments
    Byoung Wook Choi
    Dong Gwan Shin
    Jeong Ho Park
    Soo Yeong Yi
    Seet Gerald
    Intelligent Service Robotics, 2009, 2 : 139 - 151
  • [46] Real-time control architecture using Xenomai for intelligent service robots in USN environments
    Choi, Byoung Wook
    Shin, Dong Gwan
    Park, Jeong Ho
    Yi, Soo Yeong
    Gerald, Seet
    INTELLIGENT SERVICE ROBOTICS, 2009, 2 (03) : 139 - 151
  • [47] Time-Frequency Masking Based Online Multi-Channel Speech Enhancement With Convolutional Recurrent Neural Networks
    Chakrabarty, Soumitro
    Habets, Emanuel A. P.
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2019, 13 (04) : 787 - 799
  • [48] Time frequency masking based speech enhancement using deep encoder-decoder neural network
    Shi, Wenhua
    Zhang, Xiongwei
    Zou, Xia
    Sun, Meng
    Li, Li
    Shengxue Xuebao/Acta Acustica, 2020, 45 (03): : 299 - 307
  • [49] PHASE TIME-FREQUENCY MASKING BASED SPEECH ENHANCEMENT ALGORITHM USING CIRCULAR MICROPHONE ARRAY
    He, Li
    Zhou, Yi
    Liu, Hongqing
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 808 - 813
  • [50] TIME DIFFERENCE OF ARRIVAL ESTIMATION OF SPEECH SIGNALS USING DEEP NEURAL NETWORKS WITH INTEGRATED TIME-FREQUENCY MASKING
    Pertila, Pasi
    Parviainen, Mikko
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 436 - 440