Robust speech recognition based on independent vector analysis using harmonic frequency dependency

被引：4

作者：

Jun, Soram ^{[1
]}

Kim, Minook ^{[1
]}

Oh, Myungwoo ^{[1
]}

Park, Hyung-Min ^{[1
]}

机构：

[1] Sogang Univ, Dept Elect Engn, Seoul 121742, South Korea

来源：

NEURAL COMPUTING & APPLICATIONS | 2013年 / 22卷 / 7-8期

基金：

新加坡国家研究基金会;

关键词：

Robust speech recognition; Independent vector analysis; Missing feature technique; Blind source separation; BLIND SOURCE SEPARATION; MUSIC;

D O I：

10.1007/s00521-012-1002-6

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper describes an algorithm that enhances speech by independent vector analysis (IVA) using harmonic frequency dependency for robust speech recognition. While the conventional IVA exploits the full-band uniform dependencies of each source signal, a harmonic clique model is introduced to improve the enhancement performance by modeling strong dependencies among multiples of fundamental frequencies. An IVA-based learning algorithm is derived to consider the non-holonomic constraint and the minimal distortion principle to reduce the unavoidable distortion of IVA, and the minimum power distortionless response beamformer is used as a pre-processing step. In addition, the algorithm compares the log-spectral features of the enhanced speech and observed noisy speech to identify time-frequency segments corrupted by noise and restores those with the cluster-based missing feature reconstruction technique. Experimental results demonstrate that the proposed method enhances recognition performance significantly in noisy environments, especially with competing interference.

引用

页码：1321 / 1327

页数：7

共 50 条

[21] A Method For Harmonic Emission Level Assessment Based on Robust Independent Component Analysis
Chen F.
Xiao X.
Wang Y.
Xiao, Xianyong (xiaoxianyong@163.com), 1600, Power System Technology Press (44): : 3007 - 3013
[22] INDEPENDENT VECTOR ANALYSIS ASSISTED ADAPTIVE BEAMFOMRING FOR SPEECH SOURCE SEPARATION WITH AN ACOUSTIC VECTOR SENSOR
Yang, Yichen
Wang, Xianrui
Zhang, Wen
Chen, Jingdong
2022 INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC 2022), 2022,
[23] Robust Automatic Speech Recognition System Based on Using Adaptive Time-Frequency Masking
Gouda, Ahmed Mostafa
Tamazin, Mohamed
Khedr, Mohamed
PROCEEDINGS OF 2016 11TH INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING & SYSTEMS (ICCES), 2016, : 181 - 186
[24] Independent vector analysis using subband and subspace nonlinearity
Na, Yueyue
Yu, Jian
Chai, Bianfang
EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2013,
[25] Independent vector analysis using subband and subspace nonlinearity
Yueyue Na
Jian Yu
Bianfang Chai
EURASIP Journal on Advances in Signal Processing, 2013
[26] Robust Speech Recognition Using Improved Vector Taylor Series Algorithm for Embedded Systems
Lue, Yong
Wu, Haiyang
Wu, Zhenyang
IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2010, 56 (02) : 764 - 769
[27] Adaptive Speech Separation Based on Beamforming and Frequency Domain-Independent Component Analysis
Zhang, Ke
Wei, Yangjie
Wu, Dan
Wang, Yi
APPLIED SCIENCES-BASEL, 2020, 10 (07):
[28] Robust Beam forming for Speech Recognition Using DNN-Based Time-Frequency Masks Estimation
Jiang, Wenbin
Wen, Fei
Liu, Peilin
IEEE ACCESS, 2018, 6 : 52385 - 52392
[29] Isolation of multiple electrocardiogram artifacts using independent vector analysis
Uddin, Zahoor
Altaf, Muhammad
Ahmad, Ayaz
Qamar, Aamir
Orakzai, Farooq Alam
PEERJ COMPUTER SCIENCE, 2023, 9 : 1 - 23
[30] Independent Vector Analysis based Convolutive Speech Separation by Estimating Entropy using Recursive Copula Splitting
Masood, Asim
Tong, Renjie
Shakeel, Muhammad
Ye, Zhongfu
PROCEEDINGS OF 2020 IEEE 15TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP 2020), 2020, : 646 - 651

← 1 2 3 4 5 →