Robust speech recognition based on independent vector analysis using harmonic frequency dependency

被引:4
|
作者
Jun, Soram [1 ]
Kim, Minook [1 ]
Oh, Myungwoo [1 ]
Park, Hyung-Min [1 ]
机构
[1] Sogang Univ, Dept Elect Engn, Seoul 121742, South Korea
基金
新加坡国家研究基金会;
关键词
Robust speech recognition; Independent vector analysis; Missing feature technique; Blind source separation; BLIND SOURCE SEPARATION; MUSIC;
D O I
10.1007/s00521-012-1002-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper describes an algorithm that enhances speech by independent vector analysis (IVA) using harmonic frequency dependency for robust speech recognition. While the conventional IVA exploits the full-band uniform dependencies of each source signal, a harmonic clique model is introduced to improve the enhancement performance by modeling strong dependencies among multiples of fundamental frequencies. An IVA-based learning algorithm is derived to consider the non-holonomic constraint and the minimal distortion principle to reduce the unavoidable distortion of IVA, and the minimum power distortionless response beamformer is used as a pre-processing step. In addition, the algorithm compares the log-spectral features of the enhanced speech and observed noisy speech to identify time-frequency segments corrupted by noise and restores those with the cluster-based missing feature reconstruction technique. Experimental results demonstrate that the proposed method enhances recognition performance significantly in noisy environments, especially with competing interference.
引用
收藏
页码:1321 / 1327
页数:7
相关论文
共 50 条
  • [31] Robust speech recognition using the modulation spectrogram
    Kingsbury, BED
    Morgan, N
    Greenberg, S
    SPEECH COMMUNICATION, 1998, 25 (1-3) : 117 - 132
  • [32] Determined BSS Based on Time-Frequency Masking and Its Application to Harmonic Vector Analysis
    Yatabe, Kohei
    Kitamura, Daichi
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 : 1609 - 1625
  • [33] TIME-FREQUENCY CONVOLUTIONAL NETWORKS FOR ROBUST SPEECH RECOGNITION
    Mitra, Vikramjit
    Franco, Horacio
    2015 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2015, : 317 - 323
  • [34] Independent Vector Analysis with Frequency Range Division and Prior Switching
    Ikeshita, Rintaro
    Kawaguchi, Yohei
    Togami, Masahito
    Fujita, Yusuke
    Nagamatsu, Kenji
    2017 25TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2017, : 2329 - 2333
  • [35] Efficient Overdetermined Independent Vector Analysis Based on Iterative Projection with Adjustment
    Guo, Ruiming
    Luo, Zhongqiang
    Wang, Ling
    Feng, Li
    ELECTRONICS, 2023, 12 (14)
  • [36] Signal Separation for Robust Speech Recognition Based on Phase Difference Information Obtained in the Frequency Domain
    Kim, Chanwoo
    Kumar, Kshitiz
    Raj, Bhiksha
    Stern, Richard M.
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 2479 - +
  • [37] Flow-Based Independent Vector Analysis for Blind Source Separation
    Nugraha, Aditya Arie
    Sekiguchi, Kouhei
    Fontaine, Mathieu
    Bando, Yoshiaki
    Yoshii, Kazuyoshi
    IEEE SIGNAL PROCESSING LETTERS, 2020, 27 (27) : 2173 - 2177
  • [38] Harmonic Separation Based on Independent Component Analysis Method
    Ai, Yongle
    Zhang, Haiyang
    JOURNAL OF COMPUTERS, 2013, 8 (02) : 433 - 440
  • [39] Deep Neural Network Based Speech Separation for Robust Speech Recognition
    Tu Yanhui
    Jun, Du
    Xu Yong
    Dai Lirong
    Chin-Hui, Lee
    2014 12TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP), 2014, : 532 - 536
  • [40] ROBUST SPEECH RECOGNITION USING GENERATIVE ADVERSARIAL NETWORKS
    Sriram, Anuroop
    Jun, Heewoo
    Gaur, Yashesh
    Satheesh, Sanjeev
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5639 - 5643