A multi-source knowledge fusion strategy to improve confidence measure in a lattice-based spoken term detection system

被引:0
作者
Gao, Xinglong [1 ]
Pan, Jielin [1 ]
Yan, Yonghong [1 ]
机构
[1] The Key Laboratory of Speech Acoustics and Content Understanding, Institute of Acoustics, Chinese Academy of Sciences, Beijing
来源
Journal of Information and Computational Science | 2014年 / 11卷 / 11期
基金
中国国家自然科学基金;
关键词
Confidence measure; Detecting precision; Multi-source knowledge fusion; Spoken term detection;
D O I
10.12733/jics20104225
中图分类号
学科分类号
摘要
This paper mainly concerns the problem of Confidence measure estimation for Spoken Term Detection (STD). The detecting precision is always a main obstacle to make STD system to be applicable in real-world. In this context, a multi-source knowledge fusion strategy was proposed to improve the qualification of Confidence measure of detected spoken term which is mainly estimated by posterior probability before. For lattice based STD system, a collection of optimal predictive information of detected term is extracted, and the hidden-units Conditional Random Fields (hidden-units CRFs) is adopted to combine these information into a normalized conditional probability to stand for an alternative score of detected term. More precisely, the discriminative ability of multi-source knowledge fusion based Confidence measure is proved to be superior to the posterior based Confidence measure first. And then, the new proposed Confidence measure was combined with the posterior to improve detecting precision and decrease False Alarm rate (FA) in a lattice-based STD system for Conversional Telephone Speech (CTS). Experimental results show that the new proposed Confidence measure has strong complimentary effect and improve the detecting precision about 17% over the baseline system in high precision area. ©2014 Binary Information Press
引用
收藏
页码:3783 / 3792
页数:9
相关论文
empty
未找到相关数据