Comparing and Combining Content- and Citation-Based Approaches for Plagiarism Detection

被引:24
作者
Pertile, Solange de L. [1 ]
Moreira, Viviane P. [1 ]
Rosso, Paolo [2 ]
机构
[1] Univ Fed Rio Grande do Sul, Inst Informat, Porto Alegre, RS, Brazil
[2] Univ Politecn Valencia, Nat Language Engn Lab, PRHLT Res Ctr, Valencia, Spain
关键词
plagiarism; bibliographic coupling; co-citation analysis; UNIVERSITY-STUDENTS PERCEPTIONS; SELF-PLAGIARISM;
D O I
10.1002/asi.23593
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The vast amount of scientific publications available online makes it easier for students and researchers to reuse text from other authors and makes it harder for checking the originality of a given text. Reusing text without crediting the original authors is considered plagiarism. A number of studies have reported the prevalence of plagiarism in academia. As a consequence, numerous institutions and researchers are dedicated to devising systems to automate the process of checking for plagiarism. This work focuses on the problem of detecting text reuse in scientific papers. The contributions of this paper are twofold: (a) we survey the existing approaches for plagiarism detection based on content, based on content and structure, and based on citations and references; and (b) we compare content and citation-based approaches with the goal of evaluating whether they are complementary and if their combination can improve the quality of the detection. We carry out experiments with real data sets of scientific papers and concluded that a combination of the methods can be beneficial.
引用
收藏
页码:2511 / 2526
页数:16
相关论文
共 56 条
[1]   Using Structural Information and Citation Evidence to Detect Significant Plagiarism Cases in Scientific Publications [J].
Alzahrani, Salha ;
Palade, Vasile ;
Salim, Naomie ;
Abraham, Ajith .
JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2012, 63 (02) :286-312
[2]   The problem of plagiarism [J].
Anderson, Melissa S. ;
Steneck, Nicholas H. .
UROLOGIC ONCOLOGY-SEMINARS AND ORIGINAL INVESTIGATIONS, 2011, 29 (01) :90-94
[3]  
[Anonymous], 2014, Proceedings of the Conference and Labs of the Evaluation Forum
[4]  
[Anonymous], CLEF ONLINE WORKING
[5]  
[Anonymous], CLEF NOTEBOOK PAPERS
[6]  
[Anonymous], 5 WAYS DEFEAT AUTOMA
[7]  
[Anonymous], 2010, P 21 ACM C HYP HYP T
[8]  
[Anonymous], CLEF NOTEBOOK PAPERS
[9]  
[Anonymous], REPORT NEED PROVISIO
[10]  
[Anonymous], CLEF NOTEBOOK PAPERS