Authorship Attribution of Small Messages Through Language Models

被引:1
作者
Theophilo, Antonio [1 ,2 ]
Rocha, Anderson [1 ]
机构
[1] Univ Estadual Campinas, Artificial Intelligence Lab Recod Ai, Inst Comp, Campinas, Brazil
[2] Ctr Informat Technol Renato Archer, Campinas, Brazil
来源
2022 IEEE INTERNATIONAL WORKSHOP ON INFORMATION FORENSICS AND SECURITY (WIFS) | 2022年
关键词
Authorship Attribution; Social Media; Language Models; Multimedia Forensics; Deep Learning;
D O I
10.1109/WIFS55849.2022.9975413
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Social media platforms brought numerous benefits to our society, but several wrongdoings such as racism, misogyny, hate speech, anti-science statements, and large-scale misinformation came alongside. Authorship attribution is a forensics tool that can help fight against these misconducts when applied to the small texts posted on these platforms. In this work, we exploit the recent developments in language models to tackle the problem of authorship attribution of small messages. Training one model per suspect, we devise a generative approach and compare it against a state-of-the-art discriminative method. Our results show that generative and discriminative features are complementary and can be leveraged to improve the results of current methods. Finally, we propose a strategy to use discriminative and generative models jointly and draw future research paths.
引用
收藏
页数:6
相关论文
共 26 条
[1]  
[Anonymous], 2014, C EMPIRICAL METHODS
[2]  
[Anonymous], 2016, IEEE T INFORM FORENS
[3]  
Bhargava P., 2021, P 2 WORKSHOP INSIGHT, P125
[4]  
Chen A., 2015, THE AGENCY
[5]  
Chiu B., 2016, P 1 WORKSH EV VECT S
[6]  
Davey-Attlee F., 2017, FAKE NEWS MACHINE IN
[7]  
Devlin J, 2019, Arxiv, DOI [arXiv:1810.04805, 10.48550/arXiv.1810.04805]
[8]   Learning Stylometric Representations for Authorship Analysis [J].
Ding, Steven H. H. ;
Fung, Benjamin C. M. ;
Iqbal, Farkhund ;
Cheung, William K. .
IEEE TRANSACTIONS ON CYBERNETICS, 2019, 49 (01) :107-121
[9]  
Freitag M., 2017, PROC 1 WORKSHOP NEUR
[10]  
Ge ZH, 2016, AAAI CONF ARTIF INTE, P4212