Improving Object and Event Monitoring on Twitter Through Lexical Analysis and User Profiling

被引:4
|
作者
Zhang, Yihong [1 ]
Szabo, Claudia [1 ]
Sheng, Quan Z. [1 ]
机构
[1] Univ Adelaide, Sch Comp Sci, Adelaide, SA 5005, Australia
来源
WEB INFORMATION SYSTEMS ENGINEERING - WISE 2016, PT II | 2016年 / 10042卷
关键词
Twitter; Microblog content classification; User profiling;
D O I
10.1007/978-3-319-48743-4_2
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Personal users on Twitter frequently post observations about their immediate environment as part of the 500 million tweets posted everyday. These observations and their implicitly associated time and location data are a valuable source of information for monitoring objects and events, such as earthquake, hailstorm, and shooting incidents. However, given the informal and uncertain expressions used in personal Twitter messages, and the various type of accounts existing on Twitter, capturing personal observations of objects and events is challenging. In contrast to the existing supervised approaches, which require significant efforts for annotating examples, in this paper, we propose an unsupervised approach for filtering personal observations. Our approach employs lexical analysis, user profiling and classification components to significantly improve filtering precision. To identify personal accounts, we define and compute a mean user profile for a dataset and employ distance metrics to evaluate the similarity of the user profiles under analysis to the mean. Our extensive experiments with real Twitter data show that our approach consistently improves filtering precision of personal observations by around 22 %.
引用
收藏
页码:19 / 34
页数:16
相关论文
共 11 条
  • [11] Comprehensive Analysis of Personalized Web Search Engines Through Information Retrieval Feedback System and User Profiling
    Makvana, Kamlesh
    Patel, Jay
    Shah, Parth
    Thakkar, Amit
    ADVANCED INFORMATICS FOR COMPUTING RESEARCH, PT II, 2019, 956 : 155 - 164