Analysis and classification of privacy-sensitive content in social media posts

被引:0
作者
Livio Bioglio
Ruggero G. Pensa
机构
[1] University of Turin,
来源
EPJ Data Science | / 11卷
关键词
Privacy; Text classification; Content analysis;
D O I
暂无
中图分类号
学科分类号
摘要
User-generated contents often contain private information, even when they are shared publicly on social media and on the web in general. Although many filtering and natural language approaches for automatically detecting obscenities or hate speech have been proposed, determining whether a shared post contains sensitive information is still an open issue. The problem has been addressed by assuming, for instance, that sensitive contents are published anonymously, on anonymous social media platforms or with more restrictive privacy settings, but these assumptions are far from being realistic, since the authors of posts often underestimate or overlook their actual exposure to privacy risks. Hence, in this paper, we address the problem of content sensitivity analysis directly, by presenting and characterizing a new annotated corpus with around ten thousand posts, each one annotated as sensitive or non-sensitive by a pool of experts. We characterize our data with respect to the closely-related problem of self-disclosure, pointing out the main differences between the two tasks. We also present the results of several deep neural network models that outperform previous naive attempts of classifying social media posts according to their sensitivity, and show that state-of-the-art approaches based on anonymity and lexical analysis do not work in realistic application scenarios.
引用
收藏
相关论文
共 50 条
[31]   Privacy Dictionary: A Linguistic Taxonomy of Privacy for Content Analysis [J].
Gill, Alastair J. ;
Vasalou, Asimina ;
Papoutsi, Chrysanthi ;
Joinson, Adam .
29TH ANNUAL CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, 2011, :3227-3236
[32]   Differing Content and Language Based on Poster-Patient Relationships on the Chinese Social Media Platform Weibo: Text Classification, Sentiment Analysis, and Topic Modeling of Posts on Breast Cancer [J].
Zhang, Zhouqing ;
Liew, Kongmeng ;
Kuijer, Roeline ;
She, Wan Jou ;
Yada, Shuntaro ;
Wakamiya, Shoko ;
Aramaki, Eiji .
JMIR CANCER, 2024, 10
[33]   The obstacles to China?s rural toilet revolution discussed on social media: A content analysis of Weibo posts and Zhihu answers data [J].
Zhang, Yang ;
Li, Fangshu ;
Lei, Yongsen ;
Chen, Beilei ;
Xiong, Tianyi ;
Wu, Jinjia .
ENVIRONMENTAL SCIENCE & POLICY, 2023, 142 :173-182
[34]   A Privacy Settings Prediction Model for Textual Posts on Social Networks [J].
Chen, Lijun ;
Xu, Ming ;
Yang, Xue ;
Zheng, Ning ;
Wu, Yiming ;
Xu, Jian ;
Qiao, Tong ;
Liu, Hongbin .
COLLABORATIVE COMPUTING: NETWORKING, APPLICATIONS AND WORKSHARING, COLLABORATECOM 2017, 2018, 252 :578-588
[35]   Privacy and Social Media Conceptual Review on Private Turbulence in Communication Privacy Management of Social Media [J].
Yuliarti, Monika Sri ;
Anggreni, Likha Sari ;
Utari, Prahastiwi .
PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON MEDIA AND COMMUNICATION STUDIES (ICOMACS 2018), 2018, 260
[36]   A Novel Co-Training-Based Approach for the Classification of Mental Illnesses Using Social Media Posts [J].
Tariq, Subhan ;
Akhtar, Nadeem ;
Afzal, Humaira ;
Khalid, Shahzad ;
Mufti, Muhammad Rafiq ;
Hussain, Shahid ;
Habib, Asad ;
Ahmad, Ghufran .
IEEE ACCESS, 2019, 7 :166165-166172
[37]   Messaging strategies for communicating health-related information in social media—a content and effectiveness analysis of organ donation posts on Instagram in Germany [J].
Alexandra Olsacher ;
Celina Bade ;
Jan Ehlers ;
Bettina Freitag ;
Leonard Fehring .
BMC Public Health, 23
[38]   Radar-Based Activity Recognition in Strictly Privacy-Sensitive Settings Through Deep Feature Learning [J].
Diraco, Giovanni ;
Rescio, Gabriele ;
Leone, Alessandro .
BIOMIMETICS, 2025, 10 (04)
[39]   Sensitive Information for Privacy on Social Networks [J].
Wang, Ruby Ching-Ying ;
Wang, Rui Yi ;
Tai, Chih-Hua ;
Yang, De-Nian .
2016 CONFERENCE ON TECHNOLOGIES AND APPLICATIONS OF ARTIFICIAL INTELLIGENCE (TAAI), 2016, :46-51
[40]   Enhancing Federated Learning Security with a Defense Framework Against Adversarial Attacks in Privacy-Sensitive Healthcare Applications [J].
Ayensu, Frederick ;
Turner, Claude ;
Osunmakinde, Isaac .
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2025, 16 (05) :1-13