Redefining Event Detection and Information Dissemination: Lessons from X (Twitter) Data Streams and Beyond

被引:0
作者
Srivastava, Harshit [1 ]
Sankar, Ravi [1 ]
机构
[1] Univ S Florida, Dept Elect Engn, iCONS Lab, Tampa, FL 33630 USA
关键词
social data analytics; natural language processing; social computing; event detection; cooperative learning;
D O I
10.3390/computers14020042
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
X (formerly known as Twitter), Reddit, and other social media forums have dramatically changed the way society interacts with live events in this day and age. The huge amount of data generated by these platforms presents challenges, especially in terms of processing speed and the complexity of finding meaningful patterns and events. These data streams are generated in multiple formats, with constant updating, and are real-time in nature; thus, they require sophisticated algorithms capable of dynamic event detection in this dynamic environment. Event detection techniques have recently achieved substantial development, but most research carried out so far evaluates only single methods, not comparing the overall performance of these methods across multiple platforms and types of data. With that view, this paper represents a deep investigation of complex state-of-the-art event detection algorithms specifically customized for streams of data from X. We review various current techniques based on a thorough comparative performance test and point to problems inherently related to the detection of patterns in high-velocity streams with noise. We introduce some novelty to this research area, supported by appropriate robust experimental frameworks, to performed comparisons quantitatively and qualitatively. We provide insight into how those algorithms perform under varying conditions by defining a set of clear, measurable metrics. Our findings contribute new knowledge that will help inform future research into the improvement of event detection systems for dynamic data streams and enhance their capabilities for real-time and actionable insights. This paper will go a step further than the present knowledge of event detection and discuss how algorithms can be adapted and refined in view of the emerging demands imposed by data streams.
引用
收藏
页数:19
相关论文
共 67 条
  • [21] Sankaranarayanan J., Samet H., Teitler B.E., Lieberman M.D., Sperling J., Twitterstand: News in tweets, Proceedings of the 17th Acm Sigspatial International Conference on Advances in Geographic Information Systems, pp. 42-51
  • [22] Walther M., Kaisser M., Geo-spatial event detection in the twitter stream, Proceedings of the European Conference on Information Retrieval, pp. 356-367
  • [23] Meladianos P., Nikolentzos G., Rousseau F., Stavrakas Y., Vazirgiannis M., Degeneracy-based real-time sub-event detection in twitter stream, Proceedings of the International AAAI Conference on Web and Social Media, 9, pp. 248-257
  • [24] Guille A., Favre C., Mention-anomaly-based event detection and tracking in twitter, Proceedings of the 2014 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2014), pp. 375-382, (2014)
  • [25] Smith M., Rainie L., Shneiderman B., Himelboim I., Mapping Twitter Topic Networks: From Polarized Crowds to Community Clusters. Pew Research Center in Association with the Social Media Research Foundation, pp. 1-56, (2014)
  • [26] Petrovic S., Osborne M., Lavrenko V., Using paraphrases for improving first story detection in news and Twitter, Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 338-346
  • [27] Marcus A., Bernstein M.S., Badar O., Karger D.R., Madden S., Miller R.C., Twitinfo: Aggregating and visualizing microblogs for event exploration, Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, pp. 227-236
  • [28] Popescu A.M., Pennacchiotti M., Paranjpe D., Extracting events and event descriptions from twitter, Proceedings of the 20th International Conference Companion on World Wide Web, pp. 105-106
  • [29] Ishikawa S., Arakawa Y., Tagashira S., Fukuda A., Hot topic detection in local areas using Twitter and Wikipedia, Proceedings of the ARCS 2012, pp. 1-5, (2012)
  • [30] Nishida K., Hoshide T., Fujimura K., Improving tweet stream classification by detecting changes in word probability, Proceedings of the 35th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 971-980