Binaural Speech Enhancement based on DNN for the Application of Virtual Reality

被引:0
|
作者
Wang, Jin [1 ]
Wang, Jing [1 ]
Liu, Ming [1 ]
Yan, Zhaoyu [1 ]
机构
[1] Beijing Inst Technol, Sch Informat & Elect, Beijing, Peoples R China
来源
PROCEEDINGS OF 2018 14TH IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP) | 2018年
关键词
binaural speech enhancement; deep neural network; virtual reality; log-power spectra; NOISE;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Binaural sound can increase the immersion in virtual reality scenes due to the sense of direction, but when recorded in real-world, it may be corrupted by noise. Some of the existing binaural speech enhancement or separation methods can only provide the single-channel output, which will lead to the loss of the sense of direction. Some methods can provide the dual-channel output, however, such methods will suffer performance loss when the binaural clean speeches and the binaural noise are in the same direction. In this paper, we propose a binaural speech enhancement method based on deep neural network, aiming at dealing with the situation that binaural clean speeches and binaural noises are in the same direction. By mapping the features of the binaural noisy speeches to the labels of the binaural clean speeches, the dual-channel output can be obtained. Besides, batch normalization layer is introduced to further improve the performance. Compared with the baseline methods, the proposed method can obtain better speech quality and intelligibility, and the sense of the direction of the estimated binaural speeches can also be better preserved.
引用
收藏
页码:629 / 633
页数:5
相关论文
共 50 条
  • [1] Speech Enhancement based on Binaural Cues
    Chen, Nan
    Bao, Changchun
    Wang, Xianyun
    2017 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC 2017), 2017, : 145 - 148
  • [2] DNN-BASED SPEECH QUALITY ASSESSMENT FOR BINAURAL SIGNALS
    Reimes, Jan
    2022 INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC 2022), 2022,
  • [3] AN OBJECTIVE EVALUATION OF HEARING AIDS AND DNN-BASED BINAURAL SPEECH ENHANCEMENT IN COMPLEX ACOUSTIC SCENES
    Guso, Enric
    Luberadzka, Joanna
    Baig, Marti
    Sayin, Umut
    Serra, Xavier
    2023 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS, WASPAA, 2023,
  • [4] Integration of DNN based Speech Enhancement and ASR
    Astudillo, Ramon F.
    Correia, Joana
    Trancoso, Isabel
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 3576 - 3580
  • [5] A DNN Parameter Mask for the Binaural Reverberant Speech Segregation
    Jiang, Yi
    Li, Wei
    Zu, Yuanyuan
    Liu, Runsheng
    Ma, Chao
    2016 9TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI 2016), 2016, : 959 - 963
  • [6] Speech enhancement methods based on binaural cue coding
    Xianyun Wang
    Changchun Bao
    EURASIP Journal on Audio, Speech, and Music Processing, 2019
  • [7] DNN-BASED ENHANCEMENT OF NOISY AND REVERBERANT SPEECH
    Zhao, Yan
    Wang, DeLiang
    Merks, Ivo
    Zhang, Tao
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 6525 - 6529
  • [8] Significance of Phase in DNN based speech enhancement algorithms
    Rani, P. Swetha
    Andhavarapu, Sivaganesh
    Kodukula, Sri Rama Murry
    2020 TWENTY SIXTH NATIONAL CONFERENCE ON COMMUNICATIONS (NCC 2020), 2020,
  • [9] MODEL BASED BINAURAL ENHANCEMENT OF VOICED AND UNVOICED SPEECH
    Kavalekalam, Mathew Shaji
    Christensen, Mads Graesboll
    Boldt, Jesper B.
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 666 - 670
  • [10] Speech enhancement methods based on binaural cue coding
    Wang, Xianyun
    Bao, Changchun
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2019, 2019 (01)