Robust speech recognition based on independent vector analysis using harmonic frequency dependency

被引：4

作者：

Jun, Soram ^{[1
]}

Kim, Minook ^{[1
]}

Oh, Myungwoo ^{[1
]}

Park, Hyung-Min ^{[1
]}

机构：

[1] Sogang Univ, Dept Elect Engn, Seoul 121742, South Korea

来源：

NEURAL COMPUTING & APPLICATIONS | 2013年 / 22卷 / 7-8期

基金：

新加坡国家研究基金会;

关键词：

Robust speech recognition; Independent vector analysis; Missing feature technique; Blind source separation; BLIND SOURCE SEPARATION; MUSIC;

D O I：

10.1007/s00521-012-1002-6

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper describes an algorithm that enhances speech by independent vector analysis (IVA) using harmonic frequency dependency for robust speech recognition. While the conventional IVA exploits the full-band uniform dependencies of each source signal, a harmonic clique model is introduced to improve the enhancement performance by modeling strong dependencies among multiples of fundamental frequencies. An IVA-based learning algorithm is derived to consider the non-holonomic constraint and the minimal distortion principle to reduce the unavoidable distortion of IVA, and the minimum power distortionless response beamformer is used as a pre-processing step. In addition, the algorithm compares the log-spectral features of the enhanced speech and observed noisy speech to identify time-frequency segments corrupted by noise and restores those with the cluster-based missing feature reconstruction technique. Experimental results demonstrate that the proposed method enhances recognition performance significantly in noisy environments, especially with competing interference.

引用

页码：1321 / 1327

页数：7

共 50 条

[31] Robust speech recognition using the modulation spectrogram
Kingsbury, BED
Morgan, N
Greenberg, S
SPEECH COMMUNICATION, 1998, 25 (1-3) : 117 - 132
[32] Determined BSS Based on Time-Frequency Masking and Its Application to Harmonic Vector Analysis
Yatabe, Kohei
Kitamura, Daichi
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 : 1609 - 1625
[33] TIME-FREQUENCY CONVOLUTIONAL NETWORKS FOR ROBUST SPEECH RECOGNITION
Mitra, Vikramjit
Franco, Horacio
2015 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2015, : 317 - 323
[34] Independent Vector Analysis with Frequency Range Division and Prior Switching
Ikeshita, Rintaro
Kawaguchi, Yohei
Togami, Masahito
Fujita, Yusuke
Nagamatsu, Kenji
2017 25TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2017, : 2329 - 2333
[35] Efficient Overdetermined Independent Vector Analysis Based on Iterative Projection with Adjustment
Guo, Ruiming
Luo, Zhongqiang
Wang, Ling
Feng, Li
ELECTRONICS, 2023, 12 (14)
[36] Signal Separation for Robust Speech Recognition Based on Phase Difference Information Obtained in the Frequency Domain
Kim, Chanwoo
Kumar, Kshitiz
Raj, Bhiksha
Stern, Richard M.
INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 2479 - +
[37] Flow-Based Independent Vector Analysis for Blind Source Separation
Nugraha, Aditya Arie
Sekiguchi, Kouhei
Fontaine, Mathieu
Bando, Yoshiaki
Yoshii, Kazuyoshi
IEEE SIGNAL PROCESSING LETTERS, 2020, 27 (27) : 2173 - 2177
[38] Harmonic Separation Based on Independent Component Analysis Method
Ai, Yongle
Zhang, Haiyang
JOURNAL OF COMPUTERS, 2013, 8 (02) : 433 - 440
[39] Deep Neural Network Based Speech Separation for Robust Speech Recognition
Tu Yanhui
Jun, Du
Xu Yong
Dai Lirong
Chin-Hui, Lee
2014 12TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP), 2014, : 532 - 536
[40] ROBUST SPEECH RECOGNITION USING GENERATIVE ADVERSARIAL NETWORKS
Sriram, Anuroop
Jun, Heewoo
Gaur, Yashesh
Satheesh, Sanjeev
2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5639 - 5643

← 1 2 3 4 5 →