Text-based Authorship Identification - A survey

被引:0
作者
Alhijawi, Bushra [1 ]
Hriez, Safaa [1 ]
Awajan, Arafat [1 ]
机构
[1] Princess Sumaya Univ Technol Amman, King Hussien Sch Informat Technol, Amman, Jordan
来源
2018 FIFTH INTERNATIONAL SYMPOSIUM ON INNOVATION IN INFORMATION AND COMMUNICATION TECHNOLOGY (ISIICT 2018) | 2018年
关键词
Forensic Analysis; Authorship Analysis; Authorship Identification; Machine Learning; Datasets; Application; Writeprint; Features; E-MAIL; ATTRIBUTION; CATEGORIZATION; VERIFICATION; MESSAGES; GENRE;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The virtual world provides criminals with an anonymous environment to conduct malicious activities such as malware, sending ransom messages, spamming, theft intellectual property and sending ransom e-mails. All these activities are text in somehow. Therefore, there is a need for a tool in order to identify the author or creator of this illegal activity by analyzing the text. Text-based Authorship Identification techniques are used to identify the most possible author from a group of potential suspects of text. This paper is meant to explore the text-based authorship identification researches within the period 2007-2017. The researches were classified based on the application into email authorship, source code authorship, online text authorship, gender identification and online messages authorship. Also, the paper reviews and reports the datasets which used in the experiments of text-based authorship identification techniques. Finally, it reported the techniques which were used in authorship identification.
引用
收藏
页码:137 / 143
页数:7
相关论文
共 78 条
[1]   Sentiment analysis in multiple languages: Feature selection for opinion classification in Web forums [J].
Abbasi, Ahmed ;
Chen, Hsinchun ;
Salem, Arab .
ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2008, 26 (03)
[2]   Descriptive Analytics: Examining Expert Hackers in Web Forums [J].
Abbasi, Ahmed ;
Li, Weifeng ;
Benjamin, Victor ;
Hu, Shiyu ;
Chen, Hsinchun .
2014 IEEE JOINT INTELLIGENCE AND SECURITY INFORMATICS CONFERENCE (JISIC), 2014, :56-63
[3]  
Abdallah Emad E., 2013, International Journal of Security and Networks, V8, P72
[4]   Naive Bayes classifiers for authorship attribution of Arabic texts [J].
Altheneyan, Alaa Saleh ;
Menai, Mohamed El Bachir .
JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2014, 26 (04) :473-484
[5]  
[Anonymous], 2011, FUZZ INF PROC SOC NA
[6]  
[Anonymous], 2005, P 2005 ACH ALLC C
[7]  
[Anonymous], 2012, ARXIV12086268
[8]  
[Anonymous], ARXIV14016118
[9]  
[Anonymous], 2010, Proceedings of the ACL 2010 Conference Short Papers, ACLShort'10, page
[10]  
Baayen H., 1996, Literary & Linguistic Computing, V11, P121, DOI 10.1093/llc/11.3.121