Analysis and classification of privacy-sensitive content in social media posts

被引:0
|
作者
Livio Bioglio
Ruggero G. Pensa
机构
[1] University of Turin,
来源
EPJ Data Science | / 11卷
关键词
Privacy; Text classification; Content analysis;
D O I
暂无
中图分类号
学科分类号
摘要
User-generated contents often contain private information, even when they are shared publicly on social media and on the web in general. Although many filtering and natural language approaches for automatically detecting obscenities or hate speech have been proposed, determining whether a shared post contains sensitive information is still an open issue. The problem has been addressed by assuming, for instance, that sensitive contents are published anonymously, on anonymous social media platforms or with more restrictive privacy settings, but these assumptions are far from being realistic, since the authors of posts often underestimate or overlook their actual exposure to privacy risks. Hence, in this paper, we address the problem of content sensitivity analysis directly, by presenting and characterizing a new annotated corpus with around ten thousand posts, each one annotated as sensitive or non-sensitive by a pool of experts. We characterize our data with respect to the closely-related problem of self-disclosure, pointing out the main differences between the two tasks. We also present the results of several deep neural network models that outperform previous naive attempts of classifying social media posts according to their sensitivity, and show that state-of-the-art approaches based on anonymity and lexical analysis do not work in realistic application scenarios.
引用
收藏
相关论文
共 50 条
  • [1] Analysis and classification of privacy-sensitive content in social media posts
    Bioglio, Livio
    Pensa, Ruggero G.
    EPJ DATA SCIENCE, 2022, 11 (01)
  • [2] Effects of Social Behaviors of Robots in Privacy-Sensitive Situations
    Yang, Daseul
    Chae, Yu-Jung
    Kim, Doogon
    Lim, Yoonseob
    Kim, Dong Hwan
    Kim, ChangHwan
    Park, Sung-Kee
    Nam, Changjoo
    INTERNATIONAL JOURNAL OF SOCIAL ROBOTICS, 2022, 14 (02) : 589 - 602
  • [3] Effects of Social Behaviors of Robots in Privacy-Sensitive Situations
    Daseul Yang
    Yu-Jung Chae
    Doogon Kim
    Yoonseob Lim
    Dong Hwan Kim
    ChangHwan Kim
    Sung-Kee Park
    Changjoo Nam
    International Journal of Social Robotics, 2022, 14 : 589 - 602
  • [4] Privacy-Sensitive Congestion Charging
    Beresford, Alastair R.
    Davies, Jonathan J.
    Harle, Robert K.
    SECURITY PROTOCOLS, 2009, 5087 : 97 - 104
  • [5] A privacy-sensitive data identification model in online social networks
    Yi, Yuzi
    Zhu, Nafei
    He, Jingsha
    Jurcut, Anca Delia
    Ma, Xiangjun
    Luo, Yehong
    TRANSACTIONS ON EMERGING TELECOMMUNICATIONS TECHNOLOGIES, 2024, 35 (01):
  • [6] Privacy-Sensitive Data in Connected Cars
    Nawrath, T.
    Fischer, D.
    Markscheffel, B.
    2016 11TH INTERNATIONAL CONFERENCE FOR INTERNET TECHNOLOGY AND SECURED TRANSACTIONS (ICITST), 2016, : 392 - 393
  • [7] Towards Privacy-Sensitive Participatory Sensing
    Huang, Kuan Lun
    Kanhere, Salil S.
    Hu, Wen
    2009 IEEE INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING AND COMMUNICATIONS (PERCOM), VOLS 1 AND 2, 2009, : 637 - +
  • [8] Manifestations of Depression on Social Media: a Content Analysis of Twitter Posts
    Tambling R.R.
    D’Aniello - Heyda C.
    Hynes K.C.
    Journal of Technology in Behavioral Science, 2024, 9 (2) : 252 - 261
  • [9] Compressed and Privacy-Sensitive Sparse Regression
    Zhou, Shuheng
    Lafferty, John
    Wasserman, Larry
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2009, 55 (02) : 846 - 866
  • [10] A privacy-sensitive approach to distributed clustering
    Merugu, S
    Ghosh, J
    PATTERN RECOGNITION LETTERS, 2005, 26 (04) : 399 - 410