Word Sequence-based Newspaper Articles Plagiarism Detection

被引:0
作者
Chung, Hyun-Sook [2 ]
Park, Jong-An [3 ]
An, Young-Eun [3 ]
Kim, Jung-Min [1 ]
机构
[1] Daejin Univ, Dept Comp Engn, Pochon 487711, Gyeonggi Do, South Korea
[2] Chosun Univ, Dept Comp Engn, Kwangju 501759, South Korea
[3] Chosun Univ, Dept Informat & Commun Engn, Kwangju 501759, South Korea
来源
INFORMATION-AN INTERNATIONAL INTERDISCIPLINARY JOURNAL | 2011年 / 14卷 / 06期
关键词
Plagiarism detection; Newspaper plagiarism; Gene sequence alignment; SIMILARITY;
D O I
暂无
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Currently, most media publishing companies publish their news in real time via the World Wide Web, which enables instant access to newspaper articles. In this rapid publishing environment, original new articles can be readily reproduced and partially updated. Plagiarism of newspaper articles violates timeliness, and can severely diminish the economic value of newspaper articles, particularly that of the original newspaper article. Until now, however, plagiarism between newspaper articles in the news industry has yet to be addressed thoroughly. In this paper, we propose a semi-automatic plagiarism detection method based on gene sequence alignment for Korean newspapers. In order to solve the problem of the difficulty inherent to manual examination, our plagiarism detection system automatically recommends plagiarism candidates to experts after processing, including stop-words elimination, morphological analysis, word sequence alignment, similarity computation, and plagiarism determination steps. Herein, we demonstrate experimentally that our system evidences reasonable precision and recall.
引用
收藏
页码:2095 / 2113
页数:19
相关论文
共 50 条
[21]   Software Plagiarism Detection: A Graph-based Approach [J].
Chae, Dong-Kyu ;
Ha, Jiwoon ;
Kim, Sang-Wook ;
Kang, BooJoong ;
Im, Eul Gyu .
PROCEEDINGS OF THE 22ND ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM'13), 2013, :1577-1580
[22]   A Tree-based Conceptual Matching For Plagiarism Detection [J].
Osman, Ahmed Hamza ;
Salim, Naomie ;
Elhadi, Ammar Ahmed E. .
2013 INTERNATIONAL CONFERENCE ON COMPUTING, ELECTRICAL AND ELECTRONICS ENGINEERING (ICCEEE), 2013, :571-579
[23]   A Practical Algorithm for Plagiarism Detection Based on Search Engine [J].
Qiu, Zhuangli ;
Xu, Dongling .
MECHANICAL, MATERIALS AND MANUFACTURING ENGINEERING, PTS 1-3, 2011, 66-68 :2287-2290
[24]   A Cross Language Plagiarism Detection Based on Cloud Computing [J].
Fan, Chih-Tien ;
Nguyen Dang Minh ;
Muhammad, Husaini .
INTELLIGENT SYSTEMS AND APPLICATIONS (ICS 2014), 2015, 274 :2090-2099
[25]   Fast Plagiarism Detection Based on Simple Document Similarity [J].
Baba, Kensuke .
2017 TWELFTH INTERNATIONAL CONFERENCE ON DIGITAL INFORMATION MANAGEMENT (ICDIM), 2017, :54-58
[26]   Plagiarism Detection with Genetic-Based Parameter Tuning [J].
Sanchez-Perez, Miguel A. ;
Gelbukh, Alexander ;
Sidorov, Grigori ;
Gomez-Adorno, Helena .
INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2018, 32 (01)
[27]   Plagiarism detection in Chinese based on chunk and paragraph weight [J].
Wang, Tao ;
Fan, Xiao-Zhong ;
Liu, Jie .
PROCEEDINGS OF 2008 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2008, :2574-2579
[28]   Figure Plagiarism Detection based on Textual Features Representation [J].
Eisa, Taiseer Abdalla Elfadil ;
Salim, Naomie ;
Alzahrani, Salha .
2017 6TH ICT INTERNATIONAL STUDENT PROJECT CONFERENCE (ICT-ISPC), 2017,
[29]   The DOMJudge based Online Judge System with Plagiarism Detection [J].
Minh Tuan Pham ;
Tan Bao Nguyen .
2019 IEEE - RIVF INTERNATIONAL CONFERENCE ON COMPUTING AND COMMUNICATION TECHNOLOGIES (RIVF), 2019, :213-218
[30]   An Adaptive Image-based Plagiarism Detection Approach [J].
Meuschke, Norman ;
Gondek, Christopher ;
Seebacher, Daniel ;
Breitinger, Corinna ;
Keim, Daniel ;
Gipp, Bela .
JCDL'18: PROCEEDINGS OF THE 18TH ACM/IEEE JOINT CONFERENCE ON DIGITAL LIBRARIES, 2018, :131-140