Towards the Detection of Cross-Language Source Code Reuse

被引:0
作者
Flores, Enrique [1 ]
Barron-Cedeno, Alberto [1 ]
Rosso, Paolo [1 ]
Moreno, Lidia [1 ]
机构
[1] Univ Politecn Valencia, Dept Sistemas Informat & Comp, E-46022 Valencia, Spain
来源
NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS | 2011年 / 6716卷
关键词
Source code reuse; cross-language source code reuse analysis; plagiarism detection; PLAGIARISM;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Internet has made available huge amounts of information, also source code. Source code repositories and, in general, programming related websites, facilitate its reuse. In this work, we propose a simple approach to the detection of cross-language source code reuse, a nearly investigated problem. Our preliminary experiments, based on character n-grams comparison, show that considering different sections of the code (i.e., comments, code, reserved words, etc.), leads to different results. When considering three programming languages: C++, Java, and Python, the best result is obtained when comments are discarded and the entire source code is considered.
引用
收藏
页码:250 / 253
页数:4
相关论文
共 7 条
[1]  
[Anonymous], 2006, 29 AUSTRALASIAN COMP
[2]   AN EMPIRICAL-APPROACH FOR DETECTING PROGRAM SIMILARITY AND PLAGIARISM WITHIN A UNIVERSITY PROGRAMMING ENVIRONMENT [J].
FAIDHI, JAW ;
ROBINSON, SK .
COMPUTERS & EDUCATION, 1987, 11 (01) :11-19
[3]  
JANKOWITZ HT, 1988, THE COMPUTER JOURNAL, V31, P1
[4]   A statistical approach to crosslingual natural language tasks [J].
Pinto, David ;
Civera, Jorge ;
Barron-Cedeno, Alberto ;
Juan, Alfons ;
Rosso, Paolo .
JOURNAL OF ALGORITHMS-COGNITION INFORMATICS AND LOGIC, 2009, 64 (01) :51-60
[5]   Cross-language plagiarism detection [J].
Potthast, Martin ;
Barron-Cedeno, Alberto ;
Stein, Benno ;
Rosso, Paolo .
LANGUAGE RESOURCES AND EVALUATION, 2011, 45 (01) :45-62
[6]   Detection of plagiarism in programming assignments [J].
Rosales, Francisco ;
Garcia, Antonio ;
Rodriguez, Santiago ;
Pedraza, Jose L. ;
Mendez, Rafael ;
Nieto, Manuel M. .
IEEE TRANSACTIONS ON EDUCATION, 2008, 51 (02) :174-183
[7]  
Stamatatos E., 2009, P SEPLN 2009 WORKSH, P38