Combination of similarity measures for effective spoken document retrieval

被引：8

作者：

Crestani, F ^{[1
]}

机构：

[1] Univ Strathclyde, Dept Comp & Informat Sci, Glasgow G1 1XH, Lanark, Scotland

来源：

JOURNAL OF INFORMATION SCIENCE | 2003年 / 29卷 / 02期

关键词：

D O I：

10.1177/016555103763031572

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Often users of information retrieval systems and document authors use different terms to refer to the same concept. For this simple reason, information retrieval is affected by the 'term mismatch' problem. The term mismatch problem does not only have the effect of hindering the retrieval of relevant documents, it also produces bad rankings of relevant documents. A similar problem can be found in spoken document retrieval, where terms misrecognized by the speech recognition process can hinder the retrieval of potentially relevant spoken documents. We will call this problem 'term misrecognition', by analogy to the term mismatch problem. This paper presents two classes of retrieval models that attempt to tackle both the term mismatch and the term misrecognition problems at retrieval time using term similarity information. The models use either complete or partial knowledge of semantic and phonetic term similarity, evaluated using statistical methods from the corpus.

引用

页码：87 / 96

页数：10

共 50 条

[31] Spoken document retrieval for the languages of Hong Kong
Meng, HM
Hui, PY
PROCEEDINGS OF 2001 INTERNATIONAL SYMPOSIUM ON INTELLIGENT MULTIMEDIA, VIDEO AND SPEECH PROCESSING, 2001, : 201 - 204
[32] Cambridge University spoken document retrieval system
Johnson, S.E.
Jourlin, P.
Moore, G.L.
Sparck Jones, K.
Woodland, P.C.
ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 1999, 1 : 49 - 52
[33] Automatic story segmentation for spoken document retrieval
Hui, PY
Tang, XO
Meng, HM
Lam, W
Gao, XB
10TH IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-3: MEETING THE GRAND CHALLENGE: MACHINES THAT SERVE PEOPLE, 2001, : 1319 - 1322
[34] IMPROVED LATTICE-BASED SPOKEN DOCUMENT RETRIEVAL BY DIRECTLY LEARNING FROM THE EVALUATION MEASURES
Meng, Chao-hong
Lee, Hung-yi
Lee, Lin-shan
2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4893 - +
[35] WEIGHTED MATRIX FACTORIZATION FOR SPOKEN DOCUMENT RETRIEVAL
Chen, Kuan-Yu
Wang, Hsin-Min
Chen, Berlin
Chen, Hsin-Hsi
2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 8530 - 8534
[36] Speech transcription and spoken document retrieval in Finnish
Kurimo, M
Turunen, V
Ekman, I
MACHINE LEARNING FOR MULTIMODAL INTERACTION, 2005, 3361 : 253 - 262
[37] HANDLING VERBOSE QUERIES FOR SPOKEN DOCUMENT RETRIEVAL
Lin, Shih-Hsiang
Jan, Ea-Ee
Chen, Berlin
2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5552 - 5555
[38] Easy Listening: Spoken Document Retrieval in CHoral
Heeren, Willemijn
van der Werff, Laurens
de Jong, Franciska
Ordelman, Roeland
Verschoor, Thijs
van Hessen, Arjan
Langelaar, Mies
INTERDISCIPLINARY SCIENCE REVIEWS, 2009, 34 (2-3) : 236 - 252
[39] Open-Vocabulary Spoken Document Retrieval based on new subword models and subword phonetic similarity
Iwata, Kohei
Itoh, Yoshiaki
Kojima, Kazunori
Ishigame, Masaaki
Tanaka, Kazuyo
Lee, Shi-wook
INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 325 - +
[40] SpeechFind: Advances in spoken document retrieval for a National Gallery of the Spoken Word
Hansen, JH
Huang, RQ
Zhou, B
Seadle, M
Deller, JR
Gurijala, AR
Kurimo, M
Angkititrakul, P
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2005, 13 (05): : 712 - 730

← 1 2 3 4 5 →