Single-channel multiple regression for in-car speech enhancement

被引:1
|
作者
Li, WF [1 ]
Itou, K
Takeda, K
Itakura, F
机构
[1] Nagoya Univ, Grad Sch Engn, Dept Informat Elect, Nagoya, Aichi 4648603, Japan
[2] Nagoya Univ, Grad Sch Engn, Dept Med Sci, Nagoya, Aichi 4648603, Japan
[3] Meijo Univ, Fac Sci & Technol, Nagoya, Aichi 4688502, Japan
关键词
speech enhancement; speech recognition; multi-layer perceptron; mean opinion score; pairwise preference test; environmental adaptation; K-means clustering;
D O I
10.1093/ietisy/e89-d.3.1032
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We address issues for improving hands-free speech enhancement and speech recognition performance in different car environments using a single distant microphone. This paper describes a new single-channel in-car speech enhancement method that estimates the log spectra of speech at a close-talking microphone based on the nonlinear regression of the log spectra of noisy signal captured by a distant microphone and the estimated noise. The proposed method provides significant overall quality improvements in our subjective evaluation on the regression-enhanced speech, and performed best in most objective measures. Based on our isolated word recognition experiments conducted under 15 real car environments, the proposed adaptive nonlinear regression approach shows an advantage in average relative word error rate (WER) reductions of 50.8% and 13.1%, respectively, compared to original noisy speech and ETSI advanced front-end (ETSI ES 202 050).
引用
收藏
页码:1032 / 1039
页数:8
相关论文
共 50 条
  • [1] Weak Speech Recovery for Single-Channel Speech Enhancement
    Wong, Arthur
    Ming, Kok
    Low, Siow Yong
    2012 4TH INTERNATIONAL CONFERENCE ON INTELLIGENT AND ADVANCED SYSTEMS (ICIAS), VOLS 1-2, 2012, : 627 - 631
  • [2] Phase Processing for Single-Channel Speech Enhancement
    Gerkmann, Timo
    Krawczyk-Becker, Martin
    Le Roux, Jonathan
    IEEE SIGNAL PROCESSING MAGAZINE, 2015, 32 (02) : 55 - 66
  • [3] Single-Channel Speech Enhancement Techniques for Distant Speech Recognition
    Ashwini, Jaya
    Kumaraswamy, Ramaswamy
    JOURNAL OF INTELLIGENT SYSTEMS, 2013, 22 (02) : 81 - 93
  • [4] Single-channel Speech Enhancement Student under Multi-channel Speech Enhancement Teacher
    Zhang, Yuzhu
    Zhang, Hui
    Zhang, Xueliang
    PROCEEDINGS OF 2022 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2022, : 372 - 377
  • [5] Multiple regression of log spectra for in-car speech recognition using multiple distributed microphones
    Li, WF
    Shinde, T
    Fujimura, H
    Miyajima, C
    Nishino, T
    Itou, K
    Takeda, K
    Itakura, F
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2005, E88D (03) : 384 - 390
  • [6] SPEECH INTELLIGIBILITY ENHANCEMENT BY EQUALIZATION FOR IN-CAR APPLICATIONS
    Gentet, Enguerrand
    David, Bertrand
    Denjean, Sebastien
    Richard, Gael
    Roussarie, Vincent
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 6934 - 6938
  • [7] Single-Channel Speech Enhancement Based on Psychoacoustic Masking
    Zhou, Tingting
    Zeng, Yumin
    Wang, Rongrong
    JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 2017, 65 (04): : 272 - 284
  • [8] Single-channel speech enhancement by subspace affinity minimization
    Tran, Dung N.
    Koishida, Kazuhito
    INTERSPEECH 2020, 2020, : 2447 - 2451
  • [9] CompNet: Complementary network for single-channel speech enhancement
    Fan, Cunhang
    Zhang, Hongmei
    Li, Andong
    Xiang, Wang
    Zheng, Chengshi
    Lv, Zhao
    Wu, Xiaopei
    NEURAL NETWORKS, 2023, 168 : 508 - 517
  • [10] Single-channel speech enhancement using colored spectrograms
    Gul, Sania
    Khan, Muhammad Salman
    Fazeel, Muhammad
    COMPUTER SPEECH AND LANGUAGE, 2024, 86