Automatic construction of parallel English-Chinese corpus for cross-language information retrieval

被引:0
作者
Chen, J [1 ]
Nie, JY [1 ]
机构
[1] Univ Montreal, Dept Informat & Rech Operat, Montreal, PQ H3C 3J7, Canada
来源
6TH APPLIED NATURAL LANGUAGE PROCESSING CONFERENCE/1ST MEETING OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE AND PROCEEDINGS OF THE ANLP-NAACL 2000 STUDENT RESEARCH WORKSHOP | 2000年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A major obstacle to the construction of a probabilistic translation model is the lack of large parallel corpora. In this paper we first describe a parallel text mining system that finds parallel texts automatically on the Web. The generated Chinese-English parallel corpus is used to train a probabilistic translation model which translates queries for Chinese-English cross-language information retrieval (CLIR). We will discuss some problems in translation model training and show the preliminary CLIR results.
引用
收藏
页码:21 / 28
页数:8
相关论文
共 50 条
  • [41] Translation Techniques in Cross-Language Information Retrieval
    Zhou, Dong
    Truran, Mark
    Brailsford, Tim
    Wade, Vincent
    Ashman, Helen
    ACM COMPUTING SURVEYS, 2012, 45 (01)
  • [42] Translation Ambiguity in Cross-Language Information Retrieval
    Sadat, Fatiha
    BUSINESS TRANSFORMATION THROUGH INNOVATION AND KNOWLEDGE MANAGEMENT: AN ACADEMIC PERSPECTIVE, VOLS 1-2, 2010, : 301 - 303
  • [43] Neural Methods for Cross-Language Information Retrieval
    Yang, Eugene
    Lawrie, Dawn
    Mayfield, James
    Nair, Suraj
    Oard, Douglas W.
    PROCEEDINGS OF THE 46TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2023, 2023, : 3430 - 3431
  • [44] Improving query translation in English-Korean cross-language information retrieval
    Seo, HC
    Kim, SB
    Rim, HC
    Myaeng, SH
    INFORMATION PROCESSING & MANAGEMENT, 2005, 41 (03) : 507 - 522
  • [45] Japanese/English cross-language information retrieval: Exploration of query translation and transliteration
    Fujii, A
    Ishikawa, T
    COMPUTERS AND THE HUMANITIES, 2001, 35 (04): : 389 - 420
  • [46] The BETTER Cross-Language Information Retrieval Datasets
    Soboroff, Ian
    PROCEEDINGS OF THE 46TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2023, 2023, : 3047 - 3053
  • [47] Arabic Cross-Language Information Retrieval: A Review
    Elayeb, Bilel
    Bounhas, Ibrahim
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2016, 15 (03)
  • [48] Relevance feedback and cross-language information retrieval
    Orengo, Viviane Moreira
    Huyck, Christian
    INFORMATION PROCESSING & MANAGEMENT, 2006, 42 (05) : 1203 - 1217
  • [49] Cross-Language Information Retrieval: An analysis of errors
    Ruiz, ME
    Srinivasan, P
    PROCEEDINGS OF THE ASIS ANNUAL MEETING, 1998, 35 : 153 - 165
  • [50] Combining evidence for cross-language information retrieval
    Kamps, J
    Monz, C
    de Rijke, M
    ADVANCES IN CROSS-LANGUAGE INFORMATION RETRIEVAL, 2003, 2785 : 111 - 126