Fast Plagiarism Detection in Large-Scale Data

被引:0
|
作者
Szmit, Radoslaw [1 ]
机构
[1] Warsaw Univ Technol, Inst Control & Ind Elect, Ul Koszykowa 75, PL-00662 Warsaw, Poland
来源
BEYOND DATABASES, ARCHITECTURES AND STRUCTURES: TOWARDS EFFICIENT SOLUTIONS FOR DATA ANALYSIS AND KNOWLEDGE REPRESENTATION | 2017年 / 716卷
关键词
Plagiarism detection; Sentence hashing; Cloud computing; Semantic comparison; Big Data; NEKST; OSA; PERFECT HASH FUNCTIONS; ALGORITHM;
D O I
10.1007/978-3-319-58274-0_27
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents some research results involved in building Polish semantic Internet search engine called the Natively Enhanced Knowledge Sharing Technologies (NEKST) and its plagiarism detection module. The main goal is to describe tools and algorithms of the engine and its usage within the Open System for Antiplagiarism (OSA).
引用
收藏
页码:329 / 343
页数:15
相关论文
共 50 条
  • [1] Fast Unsupervised Projection for Large-Scale Data
    Wang, Jingyu
    Wang, Lin
    Nie, Feiping
    Li, Xuelong
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (08) : 3634 - 3644
  • [2] Citation-Based Plagiarism Detection: Practicability on a Large-Scale Scientific Corpus
    Gipp, Bela
    Meuschke, Norman
    Breitinger, Corinna
    JOURNAL OF THE ASSOCIATION FOR INFORMATION SCIENCE AND TECHNOLOGY, 2014, 65 (08) : 1527 - 1540
  • [3] Fast shared boosting for large-scale concept detection
    Hervé Le Borgne
    Nicolas Honnorat
    Multimedia Tools and Applications, 2012, 60 : 389 - 402
  • [4] Fast and Flexible Large-Scale Clone Detection with CloneWorks
    Svajlenko, Jeffrey
    Roy, Chanchal K.
    PROCEEDINGS OF THE 2017 IEEE/ACM 39TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING COMPANION (ICSE-C 2017), 2017, : 27 - 30
  • [5] Fast shared boosting for large-scale concept detection
    Le Borgne, Herve
    Honnorat, Nicolas
    MULTIMEDIA TOOLS AND APPLICATIONS, 2012, 60 (02) : 389 - 402
  • [6] Fast detection of worm infection for large-scale networks
    He, Hui
    Hu, Mingzeng
    Zhang, Weizhe
    Zhang, Hongli
    ADVANCES IN MACHINE LEARNING AND CYBERNETICS, 2006, 3930 : 672 - 681
  • [7] A Fast Retrieval Algorithm for Large-Scale XML Data
    Tanioka, Hiroki
    FOCUSED ACCESS TO XML DOCUMENTS, 2008, 4862 : 129 - 137
  • [8] Application of an Automatic Plagiarism Detection System in a Large-scale Assessment of English Speaking Proficiency
    Wang, Xinhao
    Evanini, Keelan
    Mulholland, Matthew
    Qian, Yao
    Bruno, James, V
    INNOVATIVE USE OF NLP FOR BUILDING EDUCATIONAL APPLICATIONS, 2019, : 435 - 443
  • [9] QuartetS: a fast and accurate algorithm for large-scale orthology detection
    Yu, Chenggang
    Zavaljevski, Nela
    Desai, Valmik
    Reifman, Jaques
    NUCLEIC ACIDS RESEARCH, 2011, 39 (13) : e88
  • [10] Fast Semisupervised Learning With Bipartite Graph for Large-Scale Data
    He, Fang
    Nie, Feiping
    Wang, Rong
    Li, Xuelong
    Jia, Weimin
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (02) : 626 - 638