Searching for Translated Plagiarism with the Help of Desktop Grids

被引:0
作者
Máté Pataki
Attila Csaba Marosi
机构
[1] Computer and Automation Research Institute,
[2] MTA SZTAKI,undefined
来源
Journal of Grid Computing | 2013年 / 11卷
关键词
Cross-language plagiarism; Desktop Grid; Wikipedia; Volunteer computing;
D O I
暂无
中图分类号
学科分类号
摘要
Translated or cross-lingual plagiarism is defined as the translation of someone else’s work or words without marking it as such or without giving credit to the original author. The existence of cross-lingual plagiarism is not new, but only in recent years, due to the rapid development of the natural language processing, appeared the first algorithms which tackled the difficult task of detecting it. Most of these algorithms utilize machine translation to compare texts written in different languages. We propose a different method, which can effectively detect translations between language-pairs where machine translations still produce low quality results. Our new algorithm presented in this paper is based on information retrieval (IR) and a dictionary based similarity metric. The preprocessing of the candidate documents for the IR is computationally intensive, but easily parallelizable. We propose a desktop Grid solution for this task. As the application is time sensitive and the desktop Grid peers are unreliable, a resubmission mechanism is used which assures that all jobs of a batch finish within a reasonable time period without dramatically increasing the load on the whole system.
引用
收藏
页码:149 / 166
页数:17
相关论文
共 5 条
  • [1] Kacsuk P(2009)SZTAKI Desktop Grid, (SZDG): a flexible and scalable desktop Grid system J. Grid Computing 7 439-461
  • [2] Kondo D(2007)Scheduling task parallel applications for rapid application turnaround on enterprise desktop Grids J. Grid Comput. 5 379-405
  • [3] Lázaro D(2012)Long-term availability prediction for groups of volunteer resources J. Parallel Distributed Comput. 72 281-296
  • [4] Kondo D(undefined)undefined undefined undefined undefined-undefined
  • [5] Marquès JM(undefined)undefined undefined undefined undefined-undefined