PHONETIC NAME MATCHING FOR CROSS-LINGUAL SPOKEN SENTENCE RETRIEVAL

被引：1

作者：

Ji, Heng ^{[1
]}

Grishman, Ralph ^{[2
]}

Wang, Wen ^{[3
]}

机构：

[1] CUNY, New York, NY 10021 USA

[2] NYU, New York, NY USA

[3] SRI Int, Menlo Pk, CA USA

来源：

2008 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY: SLT 2008, PROCEEDINGS | 2008年

基金：

美国国家科学基金会;

关键词：

Speech Recognition; Information Retrieval;

D O I：

10.1109/SLT.2008.4777895

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Cross-lingual Spoken Sentence Retrieval (CLSSR) remains a challenge, especially for queries including OOV words such as person names. This paper proposes a simple method of fuzzy matching between query names and phones of candidate audio segments. This approach has the advantage of avoiding some word decoding errors in Automatic Speech Recognition (ASR). Experiments on Mandarin-English CLSSR show that phone-based searching and conventional translation-based searching are complementary. Adding phone matching achieved 26.29% improvement on F-measure over searching on state-of-the-art Machine Translation (MT) output and 8.83% over Entity Translation (ET) output.

引用

页码：281 / +

页数：2