Cross-language speech retrieval: Establishing a baseline performance

被引:0
作者
Sheridan, P
Wechsler, M
Schauble, P
机构
来源
PROCEEDINGS OF THE 20TH ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL | 1997年
关键词
D O I
10.1145/258525.258544
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We present here the realisation of a cross-language speech retrieval system which retrieves German speech documents in response to user queries specified as French text. This has been achieved through the integration of two existing modules of the SPIDER information retrieval system, namely the query pseudo-translation module and the speech retrieval module. Our approach to cross-language retrieval uses an automatically contstructed corpus-based information structure called a similarity thesaurus. A similarity thesaurus can be constructed over any loosely comparable corpus - a parallel corpus is not necessary. The similarity thesaurus used here was constructed over a 330 MByte corpus of comparable German and French news stories. Our speech retrieval module is based on a speaker-independent phoneme recognizer and it indexes speech documents by N-grams of phonemic features. The speech retrieval module includes an additional probabilistic matching technique designed to aid retrieval from erroneous data such as the phonemic output of the speech recognition process. We have evaluated our cross-language speech retrieval system over a collection of 30 hours (3.4 GBytes) of German speech, comparing the effectiveness of French queries (cross-language) against performance on equivalent German queries (mono-lingual). It must be stressed that this work represents our first step in the direction of cross-language speech retrieval. Our aim here is to establish a baseline of performance on this task, against which we can then measure the success of our continuing research in this area.
引用
收藏
页码:99 / 108
页数:10
相关论文
共 50 条
  • [41] Word sense disambiguation for cross-language information retrieval
    Liu, MX
    Diamond, T
    Diekema, AR
    6TH APPLIED NATURAL LANGUAGE PROCESSING CONFERENCE/1ST MEETING OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE AND PROCEEDINGS OF THE ANLP-NAACL 2000 STUDENT RESEARCH WORKSHOP, 2000, : B35 - B40
  • [42] Cross-language Information Retrieval Based on Multiple Information
    Liu, Pengyuan
    Zheng, Zhijun
    Su, Qi
    2018 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE (WI 2018), 2018, : 623 - 626
  • [43] Online Learning to Rank for Cross-Language Information Retrieval
    Rahimi, Razieh
    Shakery, Azadeh
    SIGIR'17: PROCEEDINGS OF THE 40TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2017, : 1033 - 1036
  • [44] Comparative evaluation of cross-language information retrieval systems
    Peters, Carol
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2005, 3379 LNCS : 152 - 161
  • [45] Exploiting Comparable Corpora for Cross-Language Information Retrieval
    Sadat, Fatiha
    PRICAI 2010: TRENDS IN ARTIFICIAL INTELLIGENCE, 2010, 6230 : 662 - 667
  • [46] Cross-language retrieval using HAIRCUT at CLEF 2004
    McNamee, P
    Mayfield, J
    MULTILINGUAL INFORMATION ACCESS FOR TEXT, SPEECH AND IMAGES, 2005, 3491 : 50 - 59
  • [47] Query Expansion for Personalized Cross-Language Information Retrieval
    Zhou, Dong
    Lawless, Seamus
    Liu, Jianxun
    Zhang, Sanrong
    Xu, Yu
    10TH INTERNATIONAL WORKSHOP ON SEMANTIC AND SOCIAL MEDIA ADAPTATION AND PERSONALIZATION SMAP 2015, 2015, : 18 - 22
  • [48] Fast document translation for cross-language information retrieval
    McCarley, JS
    Roukos, S
    MACHINE TRANSLATION AND THE INFORMATION SOUP, 1998, 1529 : 150 - 157
  • [49] Term Discrimination Value for Cross-Language Information Retrieval
    Montazeralghaem, Ali
    Rahimi, Razieh
    Allan, James
    PROCEEDINGS OF THE 2019 ACM SIGIR INTERNATIONAL CONFERENCE ON THEORY OF INFORMATION RETRIEVAL (ICTIR'19), 2019, : 136 - 139
  • [50] On-Demand Associative Cross-Language Information Retrieval
    Geraldo, Andre Pinto
    Moreira, Viviane P.
    Goncalves, Marcos A.
    STRING PROCESSING AND INFORMATION RETRIEVAL, PROCEEDINGS, 2009, 5721 : 165 - +