Combination of similarity measures for effective spoken document retrieval

被引:8
|
作者
Crestani, F [1 ]
机构
[1] Univ Strathclyde, Dept Comp & Informat Sci, Glasgow G1 1XH, Lanark, Scotland
关键词
D O I
10.1177/016555103763031572
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Often users of information retrieval systems and document authors use different terms to refer to the same concept. For this simple reason, information retrieval is affected by the 'term mismatch' problem. The term mismatch problem does not only have the effect of hindering the retrieval of relevant documents, it also produces bad rankings of relevant documents. A similar problem can be found in spoken document retrieval, where terms misrecognized by the speech recognition process can hinder the retrieval of potentially relevant spoken documents. We will call this problem 'term misrecognition', by analogy to the term mismatch problem. This paper presents two classes of retrieval models that attempt to tackle both the term mismatch and the term misrecognition problems at retrieval time using term similarity information. The models use either complete or partial knowledge of semantic and phonetic term similarity, evaluated using statistical methods from the corpus.
引用
收藏
页码:87 / 96
页数:10
相关论文
共 50 条
  • [31] Spoken document retrieval for the languages of Hong Kong
    Meng, HM
    Hui, PY
    PROCEEDINGS OF 2001 INTERNATIONAL SYMPOSIUM ON INTELLIGENT MULTIMEDIA, VIDEO AND SPEECH PROCESSING, 2001, : 201 - 204
  • [32] Cambridge University spoken document retrieval system
    Johnson, S.E.
    Jourlin, P.
    Moore, G.L.
    Sparck Jones, K.
    Woodland, P.C.
    ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 1999, 1 : 49 - 52
  • [33] Automatic story segmentation for spoken document retrieval
    Hui, PY
    Tang, XO
    Meng, HM
    Lam, W
    Gao, XB
    10TH IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-3: MEETING THE GRAND CHALLENGE: MACHINES THAT SERVE PEOPLE, 2001, : 1319 - 1322
  • [34] IMPROVED LATTICE-BASED SPOKEN DOCUMENT RETRIEVAL BY DIRECTLY LEARNING FROM THE EVALUATION MEASURES
    Meng, Chao-hong
    Lee, Hung-yi
    Lee, Lin-shan
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4893 - +
  • [35] WEIGHTED MATRIX FACTORIZATION FOR SPOKEN DOCUMENT RETRIEVAL
    Chen, Kuan-Yu
    Wang, Hsin-Min
    Chen, Berlin
    Chen, Hsin-Hsi
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 8530 - 8534
  • [36] Speech transcription and spoken document retrieval in Finnish
    Kurimo, M
    Turunen, V
    Ekman, I
    MACHINE LEARNING FOR MULTIMODAL INTERACTION, 2005, 3361 : 253 - 262
  • [37] HANDLING VERBOSE QUERIES FOR SPOKEN DOCUMENT RETRIEVAL
    Lin, Shih-Hsiang
    Jan, Ea-Ee
    Chen, Berlin
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5552 - 5555
  • [38] Easy Listening: Spoken Document Retrieval in CHoral
    Heeren, Willemijn
    van der Werff, Laurens
    de Jong, Franciska
    Ordelman, Roeland
    Verschoor, Thijs
    van Hessen, Arjan
    Langelaar, Mies
    INTERDISCIPLINARY SCIENCE REVIEWS, 2009, 34 (2-3) : 236 - 252
  • [39] Open-Vocabulary Spoken Document Retrieval based on new subword models and subword phonetic similarity
    Iwata, Kohei
    Itoh, Yoshiaki
    Kojima, Kazunori
    Ishigame, Masaaki
    Tanaka, Kazuyo
    Lee, Shi-wook
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 325 - +
  • [40] SpeechFind: Advances in spoken document retrieval for a National Gallery of the Spoken Word
    Hansen, JH
    Huang, RQ
    Zhou, B
    Seadle, M
    Deller, JR
    Gurijala, AR
    Kurimo, M
    Angkititrakul, P
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2005, 13 (05): : 712 - 730