Robust speech recognition based on independent vector analysis using harmonic frequency dependency

被引:4
|
作者
Jun, Soram [1 ]
Kim, Minook [1 ]
Oh, Myungwoo [1 ]
Park, Hyung-Min [1 ]
机构
[1] Sogang Univ, Dept Elect Engn, Seoul 121742, South Korea
来源
NEURAL COMPUTING & APPLICATIONS | 2013年 / 22卷 / 7-8期
基金
新加坡国家研究基金会;
关键词
Robust speech recognition; Independent vector analysis; Missing feature technique; Blind source separation; BLIND SOURCE SEPARATION; MUSIC;
D O I
10.1007/s00521-012-1002-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper describes an algorithm that enhances speech by independent vector analysis (IVA) using harmonic frequency dependency for robust speech recognition. While the conventional IVA exploits the full-band uniform dependencies of each source signal, a harmonic clique model is introduced to improve the enhancement performance by modeling strong dependencies among multiples of fundamental frequencies. An IVA-based learning algorithm is derived to consider the non-holonomic constraint and the minimal distortion principle to reduce the unavoidable distortion of IVA, and the minimum power distortionless response beamformer is used as a pre-processing step. In addition, the algorithm compares the log-spectral features of the enhanced speech and observed noisy speech to identify time-frequency segments corrupted by noise and restores those with the cluster-based missing feature reconstruction technique. Experimental results demonstrate that the proposed method enhances recognition performance significantly in noisy environments, especially with competing interference.
引用
收藏
页码:1321 / 1327
页数:7
相关论文
共 50 条
  • [1] Robust speech recognition based on independent vector analysis using harmonic frequency dependency
    Soram Jun
    Minook Kim
    Myungwoo Oh
    Hyung-Min Park
    Neural Computing and Applications, 2013, 22 : 1321 - 1327
  • [2] Preprocessing of Independent Vector Analysis Using Feed-Forward Network for Robust Speech Recognition
    Oh, Myungwoo
    Park, Hyung-Min
    NEURAL INFORMATION PROCESSING, PT II, 2011, 7063 : 366 - 373
  • [3] Independent vector analysis followed by HMM-based feature enhancement for robust speech recognition
    Cho, Ji-Won
    Park, Hyung-Min
    SIGNAL PROCESSING, 2016, 120 : 200 - 208
  • [4] Robust Speech Recognition Using a Harmonic Model
    许超
    曹志刚
    TsinghuaScienceandTechnology, 2004, (02) : 202 - 206
  • [5] Robust speech recognition using harmonic features
    Goh, Yeh Huann
    Raveendran, Paramesran
    Jamuar, Sudhanshu Shekhar
    IET SIGNAL PROCESSING, 2014, 8 (02) : 167 - 175
  • [6] Bayesian feature enhancement using independent vector analysis and reverberation parameter re-estimation for noisy reverberant speech recognition
    Cho, Ji-Won
    Park, Jong-Hyeon
    Chang, Joon-Hyuk
    Park, Hyung-Min
    COMPUTER SPEECH AND LANGUAGE, 2017, 46 : 496 - 516
  • [7] Audiovisual Speech Separation Based on Independent Vector Analysis Using a Visual Voice Activity Detector
    Narvor, Pierre
    Rivet, Bertrand
    Jutten, Christian
    LATENT VARIABLE ANALYSIS AND SIGNAL SEPARATION (LVA/ICA 2017), 2017, 10169 : 247 - 257
  • [8] Efficient online target speech extraction using DOA-constrained independent component analysis of stereo data for robust speech recognition
    Kim, Minook
    Park, Hyung-Min
    SIGNAL PROCESSING, 2015, 117 : 126 - 137
  • [9] Auxiliary Function Based Independent Vector Analysis with Spatial Initialization for Frequency Domain Speech Separation
    Chen, Songbo
    Zhao, Yuxin
    Liang, Yanfeng
    2014 SEVENTH INTERNATIONAL JOINT CONFERENCE ON COMPUTATIONAL SCIENCES AND OPTIMIZATION (CSO), 2014, : 185 - 189
  • [10] Independent vector analysis for real world speech processing
    Lee, Intae
    Lee, Te-Won
    INDEPENDENT COMPONENT ANALYSES, WAVELETS, UNSUPERVISED NANO-BIOMIMETIC SENSORS, AND NEURAL NETWORKS V, 2007, 6576