Robust speech recognition based on independent vector analysis using harmonic frequency dependency

被引：4

作者：

Jun, Soram ^{[1
]}

Kim, Minook ^{[1
]}

Oh, Myungwoo ^{[1
]}

Park, Hyung-Min ^{[1
]}

机构：

[1] Sogang Univ, Dept Elect Engn, Seoul 121742, South Korea

来源：

NEURAL COMPUTING & APPLICATIONS | 2013年 / 22卷 / 7-8期

基金：

新加坡国家研究基金会;

关键词：

Robust speech recognition; Independent vector analysis; Missing feature technique; Blind source separation; BLIND SOURCE SEPARATION; MUSIC;

D O I：

10.1007/s00521-012-1002-6

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper describes an algorithm that enhances speech by independent vector analysis (IVA) using harmonic frequency dependency for robust speech recognition. While the conventional IVA exploits the full-band uniform dependencies of each source signal, a harmonic clique model is introduced to improve the enhancement performance by modeling strong dependencies among multiples of fundamental frequencies. An IVA-based learning algorithm is derived to consider the non-holonomic constraint and the minimal distortion principle to reduce the unavoidable distortion of IVA, and the minimum power distortionless response beamformer is used as a pre-processing step. In addition, the algorithm compares the log-spectral features of the enhanced speech and observed noisy speech to identify time-frequency segments corrupted by noise and restores those with the cluster-based missing feature reconstruction technique. Experimental results demonstrate that the proposed method enhances recognition performance significantly in noisy environments, especially with competing interference.

引用

页码：1321 / 1327

页数：7

共 50 条

[1] Robust speech recognition based on independent vector analysis using harmonic frequency dependency
Soram Jun
Minook Kim
Myungwoo Oh
Hyung-Min Park
Neural Computing and Applications, 2013, 22 : 1321 - 1327
[2] Preprocessing of Independent Vector Analysis Using Feed-Forward Network for Robust Speech Recognition
Oh, Myungwoo
Park, Hyung-Min
NEURAL INFORMATION PROCESSING, PT II, 2011, 7063 : 366 - 373
[3] Independent vector analysis followed by HMM-based feature enhancement for robust speech recognition
Cho, Ji-Won
Park, Hyung-Min
SIGNAL PROCESSING, 2016, 120 : 200 - 208
[4] Robust Speech Recognition Using a Harmonic Model
许超
曹志刚
TsinghuaScienceandTechnology, 2004, (02) : 202 - 206
[5] Robust speech recognition using harmonic features
Goh, Yeh Huann
Raveendran, Paramesran
Jamuar, Sudhanshu Shekhar
IET SIGNAL PROCESSING, 2014, 8 (02) : 167 - 175
[6] Bayesian feature enhancement using independent vector analysis and reverberation parameter re-estimation for noisy reverberant speech recognition
Cho, Ji-Won
Park, Jong-Hyeon
Chang, Joon-Hyuk
Park, Hyung-Min
COMPUTER SPEECH AND LANGUAGE, 2017, 46 : 496 - 516
[7] Audiovisual Speech Separation Based on Independent Vector Analysis Using a Visual Voice Activity Detector
Narvor, Pierre
Rivet, Bertrand
Jutten, Christian
LATENT VARIABLE ANALYSIS AND SIGNAL SEPARATION (LVA/ICA 2017), 2017, 10169 : 247 - 257
[8] Efficient online target speech extraction using DOA-constrained independent component analysis of stereo data for robust speech recognition
Kim, Minook
Park, Hyung-Min
SIGNAL PROCESSING, 2015, 117 : 126 - 137
[9] Auxiliary Function Based Independent Vector Analysis with Spatial Initialization for Frequency Domain Speech Separation
Chen, Songbo
Zhao, Yuxin
Liang, Yanfeng
2014 SEVENTH INTERNATIONAL JOINT CONFERENCE ON COMPUTATIONAL SCIENCES AND OPTIMIZATION (CSO), 2014, : 185 - 189
[10] Independent vector analysis for real world speech processing
Lee, Intae
Lee, Te-Won
INDEPENDENT COMPONENT ANALYSES, WAVELETS, UNSUPERVISED NANO-BIOMIMETIC SENSORS, AND NEURAL NETWORKS V, 2007, 6576

← 1 2 3 4 5 →