Automatic construction of parallel English-Chinese corpus for cross-language information retrieval

被引:0
|
作者
Chen, J [1 ]
Nie, JY [1 ]
机构
[1] Univ Montreal, Dept Informat & Rech Operat, Montreal, PQ H3C 3J7, Canada
来源
6TH APPLIED NATURAL LANGUAGE PROCESSING CONFERENCE/1ST MEETING OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE AND PROCEEDINGS OF THE ANLP-NAACL 2000 STUDENT RESEARCH WORKSHOP | 2000年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A major obstacle to the construction of a probabilistic translation model is the lack of large parallel corpora. In this paper we first describe a parallel text mining system that finds parallel texts automatically on the Web. The generated Chinese-English parallel corpus is used to train a probabilistic translation model which translates queries for Chinese-English cross-language information retrieval (CLIR). We will discuss some problems in translation model training and show the preliminary CLIR results.
引用
收藏
页码:21 / 28
页数:8
相关论文
共 50 条
  • [1] Research on English-Chinese cross-language information retrieval
    Zhang, Yue-Jie
    Zhang, Tao
    PROCEEDINGS OF 2007 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2007, : 3448 - +
  • [2] Research on english-chinese bi-directional cross-language information retrieval
    Zhang, Yuejie
    Zhang, Tao
    Cognitive Systems, 2007, 4429 : 84 - 95
  • [3] A lexical knowledge base approach for English-Chinese cross-language information retrieval
    Chen, JP
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2006, 57 (02): : 233 - 243
  • [4] Query Translation Optimization and Mathematical Modeling for English-Chinese Cross-Language Information Retrieval
    Zhao, Xia
    Song, Wanhe
    Li, Min
    Zhang, Jianjing
    APPLIED MATHEMATICS AND NONLINEAR SCIENCES, 2022, 8 (01) : 1777 - 1784
  • [5] ECIRS: An English-Chinese cross-language information-retrival system
    Liu, S
    2001 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-5: E-SYSTEMS AND E-MAN FOR CYBERNETICS IN CYBERSPACE, 2002, : 954 - 959
  • [6] Research on Chinese-English cross-language information retrieval
    Zhang, Tao
    Zhang, Yue-Jie
    PROCEEDINGS OF 2008 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2008, : 2591 - +
  • [7] Mining a Persian-English comparable corpus for cross-language information retrieval
    Hashemi, Homa B.
    Shakery, Azadeh
    INFORMATION PROCESSING & MANAGEMENT, 2014, 50 (02) : 384 - 398
  • [8] Query translation in Chinese-English cross-language information retrieval
    Zhang, YB
    Sun, L
    Du, L
    Sun, Y
    PROCEEDINGS OF THE 2000 JOINT SIGDAT CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND VERY LARGE CORPORA, 2000, : 104 - 109
  • [9] Study of Query Translation Dictionary Automatic Construction in Cross-Language Information Retrieval
    Xi, Su-Mei
    Cho, Young-Im
    INTELLIGENT AUTONOMOUS SYSTEMS 12 , VOL 2, 2013, 194 : 585 - +
  • [10] Automatic construction of English-Chinese translation lexicon from sentence aligned spoken language corpus
    Chen, Bo-Xing
    Du, Li-Min
    Jisuanji Xuebao/Chinese Journal of Computers, 2003, 26 (03): : 275 - 280