Crowdsourcing-Based Evaluation of Automatic References Between WordNet and Wikipedia

被引:3
作者
Szymanski, Julian [1 ]
Boinski, Tomasz [1 ]
机构
[1] Gdansk Univ Technol, Fac Elect Telecommun & Informat, 11-12 Narutowicza St, PL-80233 Gdansk, Poland
关键词
WordNet; Wikipedia; lexical resources integration; natural language processing; game with a purpose; SEMANTIC WEB; RECOGNITION; ONTOLOGY; GAMES;
D O I
10.1142/S0218194019500141
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The paper presents an approach to build references (also called mappings) between WordNet and Wikipedia. We propose four algorithms used for automatic construction of the references. Then, based on an aggregation algorithm, we produce an initial set of mappings that has been evaluated in a cooperative way. For that purpose, we implement a system for the distribution of evaluation tasks, that have been solved by the user community. To make the tasks more attractive, we embed them into a game. Results show the initial mappings have good quality, and they have also been improved by the community. As a result, we deliver a high quality dataset of the mappings between two lexical repositories: WordNet and Wikipedia, that can be used in a wide range of NLP tasks. We also show that the framework for collaborative validation can be used in other tasks that require human judgments.
引用
收藏
页码:317 / 344
页数:28
相关论文
共 56 条
[1]  
Ahn L. V., 2007, IEEE INT C AC SPEECH, V4
[2]  
AJT, 2015, LESS DUOL EFF SUPP F
[3]  
Anderson DP, 2006, SIXTH IEEE INTERNATIONAL SYMPOSIUM ON CLUSTER COMPUTING AND THE GRID, P73
[4]   The Semantic Web - A new form of Web content that is meaningful to computers will unleash a revolution of new possibilities [J].
Berners-Lee, T ;
Hendler, J ;
Lassila, O .
SCIENTIFIC AMERICAN, 2001, 284 (05) :34-+
[5]   Linked Data - The Story So Far [J].
Bizer, Christian ;
Heath, Tom ;
Berners-Lee, Tim .
INTERNATIONAL JOURNAL ON SEMANTIC WEB AND INFORMATION SYSTEMS, 2009, 5 (03) :1-22
[6]   Game with a Purpose for Mappings Verification [J].
Boinski, Tomasz .
PROCEEDINGS OF THE 2016 FEDERATED CONFERENCE ON COMPUTER SCIENCE AND INFORMATION SYSTEMS (FEDCSIS), 2016, 8 :405-409
[7]   A TECHNIQUE FOR COMPUTER DETECTION AND CORRECTION OF SPELLING ERRORS [J].
DAMERAU, FJ .
COMMUNICATIONS OF THE ACM, 1964, 7 (03) :171-176
[8]   The semantic web: yet another hip? [J].
Ding, Y ;
Fensel, D ;
Klein, M ;
Omelayenko, B .
DATA & KNOWLEDGE ENGINEERING, 2002, 41 (2-3) :205-227
[9]   Introduction to "This is Watson" [J].
Ferrucci, D. A. .
IBM JOURNAL OF RESEARCH AND DEVELOPMENT, 2012, 56 (3-4)
[10]   "Prove You're Human": Fetishizing Material Embodiment and Immaterial Labor in Information Networks [J].
Foley, Megan .
CRITICAL STUDIES IN MEDIA COMMUNICATION, 2014, 31 (05) :365-379