Speech enhancement method based on feature compensation gain for effective speech recognition in noisy environments

被引：0

作者：

Bae, Ara ^{[1
]}

Kim, Wooil ^{[1
]}

机构：

[1] Incheon Natl Univ, Dept Comp Sci & Engn, 119 Acad Ro, Incheon 22012, South Korea

来源：

JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA | 2019年 / 38卷 / 01期

关键词：

Speech enhancement; Feature compensation gain; Variational model composition; Speech recognition; Noisy environment;

D O I：

10.7776/ASK.2019.38.1.051

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

This paper proposes a speech enhancement method utilizing the feature compensation gain for robust speech recognition performances in noisy environments. In this paper we propose a speech enhancement method utilizing the feature compensation gain which is obtained from the PCGMM (Parallel Combined Gaussian Mixture Model)-based feature compensation method employing variational model composition. The experimental results show that the proposed method significantly outperforms the conventional front-end algorithms and our previous research over various background noise types and SNR (Signal to Noise Ratio) conditions in mismatched ASR (Automatic Speech Recognition) system condition. The computation complexity is significantly reduced by employing the noise model selection technique with maintaining the speech recognition performance at a similar level.

引用

页码：51 / 55

页数：5

共 50 条

[31] Compensation of speech enhancement distortion for robust speech recognition
Ding, P
Cao, ZG
2002 IEEE REGION 10 CONFERENCE ON COMPUTERS, COMMUNICATIONS, CONTROL AND POWER ENGINEERING, VOLS I-III, PROCEEDINGS, 2002, : 449 - 452
[32] Speech intelligibility improvement in noisy reverberant environments based on speech enhancement and inverse filtering
Dong, Huan-Yu
Lee, Chang-Myung
EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2018,
[33] Speech intelligibility improvement in noisy reverberant environments based on speech enhancement and inverse filtering
Huan-Yu Dong
Chang-Myung Lee
EURASIP Journal on Audio, Speech, and Music Processing, 2018
[34] Speech Enhancement Based on Masking Approach Considering Speech Quality and Acoustic Confidence for Noisy Speech Recognition
Chu, Shih-Chuan
Wu, Chung-Hsien
Lin, Yun-Wen
2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 536 - 540
[35] Enhancement of Reverberant Speech in Noisy Acoustical Environments
Joorabchi, Marjan
Ghorshi, Seyed
Sarafnia, Ali
2014 SIXTH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP), 2014,
[36] SPEECH RECOGNITION WITH NO SPEECH OR WITH NOISY SPEECH
Krishna, Gautam
Co Tran
Yu, Jianguo
Tewfik, Ahmed H.
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 1090 - 1094
[37] A study on combination of loss functions for effective mask-based speech enhancement in noisy environments
Jung, Jaehee
Kim, Wooil
JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2021, 40 (03): : 234 - 240
[38] Cepstrum-domain acoustic feature compensation based on decomposition of speech and noise for ASR in noisy environments
Kim, HK
Rose, RC
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2003, 11 (05): : 435 - 446
[39] Multisensory benefits for speech recognition in noisy environments
Oh, Yonghee
Schwalm, Meg
Kalpin, Nicole
FRONTIERS IN NEUROSCIENCE, 2022, 16
[40] Speech Emotion Recognition in Noisy and Reverberant Environments
Heracleous, Panikos
Yasuda, Keiji
Sugaya, Fumiaki
Yoneyama, Akio
Hashimoto, Masayuki
2017 SEVENTH INTERNATIONAL CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION (ACII), 2017, : 262 - 266

← 1 2 3 4 5 →