A Cross-Lingual Dictionary for English Wikipedia Concepts

被引:0
|
作者
Spitkovsky, Valentin I. [1 ]
Chang, Angel X.
机构
[1] Google Inc, Google Res, Mountain View, CA 94043 USA
来源
LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION | 2012年
关键词
cross-language information retrieval (CLIR); entity linking (EL); Wikipedia;
D O I
暂无
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
We present a resource for automatically associating strings of text with English Wikipedia concepts. Our machinery is bi-directional, in the sense that it uses the same fundamental probabilistic methods to map strings to empirical distributions over Wikipedia articles as it does to map article URLs to distributions over short, language-independent strings of natural language text. For maximal inter-operability, we release our resource as a set of flat line-based text files, lexicographically sorted and encoded with UTF-8. These files capture joint probability distributions underlying concepts (we use the terms article, concept and Wikipedia URL interchangeably) and associated snippets of text, as well as other features that can come in handy when working with Wikipedia articles and related information.
引用
收藏
页码:3168 / 3175
页数:8
相关论文
共 50 条
  • [1] English-to-Korean Cross-Lingual Link Detection for Wikipedia
    Marigomen, Ralph
    Kang, In-Su
    U- AND E-SERVICE, SCIENCE AND TECHNOLOGY, 2011, 264 : 274 - 280
  • [2] Untangling the Cross-Lingual Link Structure of Wikipedia
    de Melo, Gerard
    Weikum, Gerhard
    ACL 2010: 48TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2010, : 844 - 853
  • [3] Cross-Lingual Entity Linking in Wikipedia Infoboxes
    Yang, Juheng
    Wang, Zhichun
    KNOWLEDGE GRAPH AND SEMANTIC COMPUTING: KNOWLEDGE COMPUTING AND LANGUAGE UNDERSTANDING, 2019, 1134 : 38 - 49
  • [4] Detecting Cross-Lingual Information Gaps in Wikipedia
    Ashrafmoghari, Vahid
    COMPANION OF THE WORLD WIDE WEB CONFERENCE, WWW 2023, 2023, : 581 - 585
  • [5] Cross-lingual entity matching and infobox alignment in Wikipedia
    Rinser, Daniel
    Lange, Dustin
    Naumann, Felix
    INFORMATION SYSTEMS, 2013, 38 (06) : 887 - 907
  • [6] Ongoing Events in Wikipedia: A Cross-lingual Case Study
    Gottschalk, Simon
    Demidova, Elena
    Bernacchi, Viola
    Rogers, Richard
    PROCEEDINGS OF THE 2017 ACM WEB SCIENCE CONFERENCE (WEBSCI '17), 2017, : 387 - 388
  • [7] Providing Cross-Lingual Editing Assistance to Wikipedia Editors
    Yeung, Ching-man Au
    Duh, Kevin
    Nagata, Masaaki
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, PT II, 2011, 6609 : 377 - 389
  • [8] CAKES: Cross-lingual Wikipedia Knowledge Enrichment and Summarization
    Fionda, Valeria
    Pirro, Giuseppe
    20TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE (ECAI 2012), 2012, 242 : 901 - 902
  • [9] Cross-lingual Ontology Alignment using EuroWordNet and Wikipedia
    Bouma, Gosse
    LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010,
  • [10] Exploiting Wikipedia for cross-lingual and multilingual information retrieval
    Sorg, P.
    Cimiano, P.
    DATA & KNOWLEDGE ENGINEERING, 2012, 74 : 26 - 45