A Robust Algorithm for Word Boundary Detection in the Presence of Noise
被引:99
作者:
Junqua, Jean-Claude
论文数: 0引用数: 0
h-index: 0
机构:
Speech Technol Lab, Santa Barbara, CA 93105 USASpeech Technol Lab, Santa Barbara, CA 93105 USA
Junqua, Jean-Claude
[1
]
Mak, Brian
论文数: 0引用数: 0
h-index: 0
机构:Speech Technol Lab, Santa Barbara, CA 93105 USA
Mak, Brian
Reaves, Ben
论文数: 0引用数: 0
h-index: 0
机构:
Cent Res Labs, Moriguchi, Osaka 570, JapanSpeech Technol Lab, Santa Barbara, CA 93105 USA
Reaves, Ben
[2
]
机构:
[1] Speech Technol Lab, Santa Barbara, CA 93105 USA
[2] Cent Res Labs, Moriguchi, Osaka 570, Japan
来源:
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING
|
1994年
/
2卷
/
03期
关键词:
D O I:
10.1109/89.294354
中图分类号:
O42 [声学];
学科分类号:
070206 ;
082403 ;
摘要:
We address the problem of automatic word boundary detection in quiet and in the presence of noise. Attention has been given to automatic word boundary detection for both additive noise and noise-induced changes in the talker's speech production (Lombard reflex). After a comparison of several automatic word boundary detection algorithms in different noisy-Lombard conditions, we propose a new algorithm that is robust in the presence of noise. This new algorithm identifies islands of reliability (essentially the portion of speech contained between the first and the last vowel) using time and frequency-based features and then, after a noise classification, applies a noise adaptive procedure to refine the boundaries. It is shown that this new algorithm outperforms the commonly used algorithm developed by Lamel et al. and several other recently developed methods. We evaluated the average recognition error rate due to word boundary detection in an HMM-based recognition system across several signal-to-noise ratios and noise conditions. The recognition error rate decreased to about 20% compared to an average of approximately 50% obtained with a modified version of the Lamel er al. algorithm.