PHONETIC NAME MATCHING FOR CROSS-LINGUAL SPOKEN SENTENCE RETRIEVAL

被引:1
作者
Ji, Heng [1 ]
Grishman, Ralph [2 ]
Wang, Wen [3 ]
机构
[1] CUNY, New York, NY 10021 USA
[2] NYU, New York, NY USA
[3] SRI Int, Menlo Pk, CA USA
来源
2008 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY: SLT 2008, PROCEEDINGS | 2008年
基金
美国国家科学基金会;
关键词
Speech Recognition; Information Retrieval;
D O I
10.1109/SLT.2008.4777895
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cross-lingual Spoken Sentence Retrieval (CLSSR) remains a challenge, especially for queries including OOV words such as person names. This paper proposes a simple method of fuzzy matching between query names and phones of candidate audio segments. This approach has the advantage of avoiding some word decoding errors in Automatic Speech Recognition (ASR). Experiments on Mandarin-English CLSSR show that phone-based searching and conventional translation-based searching are complementary. Adding phone matching achieved 26.29% improvement on F-measure over searching on state-of-the-art Machine Translation (MT) output and 8.83% over Entity Translation (ET) output.
引用
收藏
页码:281 / +
页数:2
相关论文
共 15 条
  • [1] AKBACAK M, 2008, OPEN VOCABULARY SPOK
  • [2] CHIA TK, 2008, LATTICE BASED APPROA
  • [3] Damerau Fred J, 1964, COMMUNICATIONS ACM
  • [4] FREITAG D, 2007, SEQUENCE ALIGNMENT M
  • [5] GEY FC, 2001, CROSS LANGUAGE RETRI
  • [6] GILLICK D, 2008, UNSUPERVISED LEARNIN
  • [7] HWANG MY, 2007, BUILDING HIGHLY ACCU
  • [8] JI H, 2007, NIST ET 2007 PI EV
  • [9] KUROHASHI S, 1994, INT WORKSH SHAR NAT, P22
  • [10] Levenshtein VI., 1966, SOVIET PHYS DOKL, V10, P707