Analysis and classification of privacy-sensitive content in social media posts

被引:0
作者
Livio Bioglio
Ruggero G. Pensa
机构
[1] University of Turin,
来源
EPJ Data Science | / 11卷
关键词
Privacy; Text classification; Content analysis;
D O I
暂无
中图分类号
学科分类号
摘要
User-generated contents often contain private information, even when they are shared publicly on social media and on the web in general. Although many filtering and natural language approaches for automatically detecting obscenities or hate speech have been proposed, determining whether a shared post contains sensitive information is still an open issue. The problem has been addressed by assuming, for instance, that sensitive contents are published anonymously, on anonymous social media platforms or with more restrictive privacy settings, but these assumptions are far from being realistic, since the authors of posts often underestimate or overlook their actual exposure to privacy risks. Hence, in this paper, we address the problem of content sensitivity analysis directly, by presenting and characterizing a new annotated corpus with around ten thousand posts, each one annotated as sensitive or non-sensitive by a pool of experts. We characterize our data with respect to the closely-related problem of self-disclosure, pointing out the main differences between the two tasks. We also present the results of several deep neural network models that outperform previous naive attempts of classifying social media posts according to their sensitivity, and show that state-of-the-art approaches based on anonymity and lexical analysis do not work in realistic application scenarios.
引用
收藏
相关论文
共 50 条
[41]   #mothersday: Constructions of motherhood and femininity in social media posts [J].
Capdevila, Rose ;
Dann, Charlotte ;
Lazard, Lisa ;
Roper, Sandra ;
Locke, Abigail .
FEMINISM & PSYCHOLOGY, 2022, 32 (03) :336-356
[42]   Domain Identification for Intention Posts on Online Social Media [J].
Thai-Le Luong ;
Quoc-Tuan Truong ;
Hai-Trieu Dang ;
Xuan-Hieu Phan .
PROCEEDINGS OF THE SEVENTH SYMPOSIUM ON INFORMATION AND COMMUNICATION TECHNOLOGY (SOICT 2016), 2016, :52-57
[43]   Spinal Cord Stimulation and Related Health Information on Social Media: An Analysis of Instagram Posts [J].
Aydin, Serdar O. ;
Tasargol, Omer .
CUREUS JOURNAL OF MEDICAL SCIENCE, 2023, 15 (09)
[44]   Investigating Media Coverage and Public Perceptions of the HPV Vaccine in China - A Content Analysis of Weibo Posts [J].
Hu, Junyi ;
Whyke, Thomas William ;
Lopez-Mugica, Joaquin .
SEXUALITY & CULTURE-AN INTERDISCIPLINARY JOURNAL, 2023, 27 (02) :363-388
[45]   CONTENT ANALYSIS OF SOCIAL MEDIA: A GROUNDED THEORY APPROACH [J].
Lai, Linda S. L. ;
To, W. M. .
JOURNAL OF ELECTRONIC COMMERCE RESEARCH, 2015, 16 (02) :138-152
[46]   Circles, Posts and Privacy in Egocentric Social Networks: An Exploratory Visualization Approach [J].
Gao, Bo ;
Berendt, Bettina .
2013 IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING (ASONAM), 2013, :798-802
[47]   Will You Log into Tinder using your Facebook Account? Adoption of Single Sign-On for Privacy-Sensitive Apps [J].
Cho, Eugene ;
Kim, Jinyoung ;
Sundar, S. Shyam .
CHI'20: EXTENDED ABSTRACTS OF THE 2020 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, 2020,
[48]   Social Media and the Scourge of Visual Privacy [J].
DeHart, Jasmine ;
Stell, Makya ;
Grant, Christan .
INFORMATION, 2020, 11 (02)
[49]   Privacy Policy Negotiation in Social Media [J].
Such, Jose M. ;
Rovatsos, Michael .
ACM TRANSACTIONS ON AUTONOMOUS AND ADAPTIVE SYSTEMS, 2016, 11 (01)
[50]   Messaging strategies for communicating health-related information in social media-a content and effectiveness analysis of organ donation posts on Instagram in Germany [J].
Olsacher, Alexandra ;
Bade, Celina ;
Ehlers, Jan ;
Freitag, Bettina ;
Fehring, Leonard .
BMC PUBLIC HEALTH, 2023, 23 (01)