Authorship attribution of short texts using multi-layer perceptron

被引:6
作者
Saha, Nilan [1 ]
Das, Pratyush [1 ]
Saha, Himadri Nath [2 ,3 ]
机构
[1] Inst Engn & Management, Comp Sci & Engn, Kolkata, India
[2] Inst Engn & Management, Kolkata, India
[3] Inst Engn & Management, Comp Sci & Engn Dept, Kolkata, India
关键词
multi-layer perceptron; stylometry;
D O I
10.1504/IJAPR.2018.094819
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Authorship attribution using stylometry techniques to analyse texts has grown out from earlier times for verifying the authenticity of evidence, authorial identity among other things. With the advent of the digital era, traditional pen paper writing is replaced by electronic documents making earlier techniques of handwriting analysis impossible because their electronic nature eliminates the informative differences in authorial style. Previously, authorship attributions focused mainly on unmasking the author of long pieces of digital texts but in this study, we are going to do the same for short texts that are shared on social platforms and boards. We have used a multi-layer perceptron to correctly attribute short texts to their authors using a Twitter dataset of four authors and 400 tweets for each author with 96.44% accuracy.
引用
收藏
页码:251 / 259
页数:9
相关论文
共 17 条
[1]  
Bailey R. W., 1979, ADV COMPUTER AIDED L
[2]  
Barathi Ganesh H. B., 2015, INT C ADV COMP COMM
[3]  
Bhargava Mudit, 2013, Big Data Analytics. Second International Conference, BDA 2013. Proceedings: LNCS 8302, P37, DOI 10.1007/978-3-319-03689-2_3
[4]  
Burrows S., 2012, SOFTWARE PRACTICE EX, V44, P1
[5]  
Castillo E., 2015, INT C EL COMM COMP C
[6]  
Cho D., 2012, 45 HAW INT C SYST SC
[7]   AUTHORSHIP ATTRIBUTION [J].
HOLMES, DI .
COMPUTERS AND THE HUMANITIES, 1994, 28 (02) :87-106
[8]  
Kumar P. N., 2016, INT C DAT MIN ADV CO
[9]  
Li Y., 2016, IEEE S SERIES COMPUT
[10]  
Mohsen A. M., 2016, 15 IEEE INT C MACH L