Robust speech recognition based on independent vector analysis using harmonic frequency dependency

被引:4
|
作者
Jun, Soram [1 ]
Kim, Minook [1 ]
Oh, Myungwoo [1 ]
Park, Hyung-Min [1 ]
机构
[1] Sogang Univ, Dept Elect Engn, Seoul 121742, South Korea
来源
NEURAL COMPUTING & APPLICATIONS | 2013年 / 22卷 / 7-8期
基金
新加坡国家研究基金会;
关键词
Robust speech recognition; Independent vector analysis; Missing feature technique; Blind source separation; BLIND SOURCE SEPARATION; MUSIC;
D O I
10.1007/s00521-012-1002-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper describes an algorithm that enhances speech by independent vector analysis (IVA) using harmonic frequency dependency for robust speech recognition. While the conventional IVA exploits the full-band uniform dependencies of each source signal, a harmonic clique model is introduced to improve the enhancement performance by modeling strong dependencies among multiples of fundamental frequencies. An IVA-based learning algorithm is derived to consider the non-holonomic constraint and the minimal distortion principle to reduce the unavoidable distortion of IVA, and the minimum power distortionless response beamformer is used as a pre-processing step. In addition, the algorithm compares the log-spectral features of the enhanced speech and observed noisy speech to identify time-frequency segments corrupted by noise and restores those with the cluster-based missing feature reconstruction technique. Experimental results demonstrate that the proposed method enhances recognition performance significantly in noisy environments, especially with competing interference.
引用
收藏
页码:1321 / 1327
页数:7
相关论文
共 50 条
  • [21] A Method For Harmonic Emission Level Assessment Based on Robust Independent Component Analysis
    Chen F.
    Xiao X.
    Wang Y.
    Xiao, Xianyong (xiaoxianyong@163.com), 1600, Power System Technology Press (44): : 3007 - 3013
  • [22] INDEPENDENT VECTOR ANALYSIS ASSISTED ADAPTIVE BEAMFOMRING FOR SPEECH SOURCE SEPARATION WITH AN ACOUSTIC VECTOR SENSOR
    Yang, Yichen
    Wang, Xianrui
    Zhang, Wen
    Chen, Jingdong
    2022 INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC 2022), 2022,
  • [23] Robust Automatic Speech Recognition System Based on Using Adaptive Time-Frequency Masking
    Gouda, Ahmed Mostafa
    Tamazin, Mohamed
    Khedr, Mohamed
    PROCEEDINGS OF 2016 11TH INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING & SYSTEMS (ICCES), 2016, : 181 - 186
  • [24] Independent vector analysis using subband and subspace nonlinearity
    Na, Yueyue
    Yu, Jian
    Chai, Bianfang
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2013,
  • [25] Independent vector analysis using subband and subspace nonlinearity
    Yueyue Na
    Jian Yu
    Bianfang Chai
    EURASIP Journal on Advances in Signal Processing, 2013
  • [26] Robust Speech Recognition Using Improved Vector Taylor Series Algorithm for Embedded Systems
    Lue, Yong
    Wu, Haiyang
    Wu, Zhenyang
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2010, 56 (02) : 764 - 769
  • [27] Adaptive Speech Separation Based on Beamforming and Frequency Domain-Independent Component Analysis
    Zhang, Ke
    Wei, Yangjie
    Wu, Dan
    Wang, Yi
    APPLIED SCIENCES-BASEL, 2020, 10 (07):
  • [28] Robust Beam forming for Speech Recognition Using DNN-Based Time-Frequency Masks Estimation
    Jiang, Wenbin
    Wen, Fei
    Liu, Peilin
    IEEE ACCESS, 2018, 6 : 52385 - 52392
  • [29] Isolation of multiple electrocardiogram artifacts using independent vector analysis
    Uddin, Zahoor
    Altaf, Muhammad
    Ahmad, Ayaz
    Qamar, Aamir
    Orakzai, Farooq Alam
    PEERJ COMPUTER SCIENCE, 2023, 9 : 1 - 23
  • [30] Independent Vector Analysis based Convolutive Speech Separation by Estimating Entropy using Recursive Copula Splitting
    Masood, Asim
    Tong, Renjie
    Shakeel, Muhammad
    Ye, Zhongfu
    PROCEEDINGS OF 2020 IEEE 15TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP 2020), 2020, : 646 - 651