A Streaming Machine Learning Framework for Online Aggression Detection on Twitter

被引:5
|
作者
Herodotou, Herodotos [1 ]
Chatzakou, Despoina [2 ]
Kourtellis, Nicolas [3 ]
机构
[1] Cyprus Univ Technol, Limassol, Cyprus
[2] Ctr Res & Technol Hellas, Thessaloniki, Greece
[3] Telefon Res, Barcelona, Spain
来源
2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA) | 2020年
关键词
online aggression detection; streaming machine learning; social media;
D O I
10.1109/BigData50022.2020.9377980
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The rise of online aggression on social media is evolving into a major point of concern. Several machine and deep learning approaches have been proposed recently for detecting various types of aggressive behavior. However, social media are fast paced, generating an increasing amount of content, while aggressive behavior evolves over time. In this work, we introduce the first, practical, real-time framework for detecting aggression on Twitter via embracing the streaming machine learning paradigm. Our method adapts its ML classifiers in an incremental fashion as it receives new annotated examples and is able to achieve the same (or even higher) performance as batch-based ML models, with over 90% accuracy, precision, and recall. At the same time, our experimental analysis on real Twitter data reveals how our framework can easily scale to accommodate the entire Twitter Firehose (of 778 million tweets per day) with only 3 commodity machines. Finally, we show that our framework is general enough to detect other related behaviors such as sarcasm, racism, and sexism in real time.
引用
收藏
页码:5056 / 5067
页数:12
相关论文
共 50 条
  • [1] Rumor Detection on Twitter Using a Supervised Machine Learning Framework
    Thakur, Hardeo Kumar
    Gupta, Anand
    Bhardwaj, Ayushi
    Verma, Devanshi
    INTERNATIONAL JOURNAL OF INFORMATION RETRIEVAL RESEARCH, 2018, 8 (03) : 1 - 13
  • [2] Extending a Distributed Online Machine Learning Framework for Streaming Video Analysis
    Tsuji, Yusuke
    Huang, Hung-Hsuan
    Kawagoe, Kyoji
    2013 SECOND IIAI INTERNATIONAL CONFERENCE ON ADVANCED APPLIED INFORMATICS (IIAI-AAI 2013), 2013, : 279 - 283
  • [3] Extended Framework and Evaluation for Multivariate Streaming Anomaly Detection with Machine Learning
    Koch, Andreas
    Petry, Michael
    Werner, Martin
    2024 IEEE 40TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOP, ICDEW, 2024, : 144 - 152
  • [4] A Framework of Online Learning with Imbalanced Streaming Data
    Yan, Yan
    Yang, Tianbao
    Yang, Yi
    Chen, Jianhui
    THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 2817 - 2823
  • [5] MACORD: Online Adaptive Machine Learning Framework for Silent Error Detection
    Subasi, Omer
    Di, Sheng
    Balaprakash, Prasanna
    Unsal, Osman
    Labarta, Jesus
    Cristal, Adrian
    Krishnamoorthy, Sriram
    Cappello, Franck
    2017 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2017, : 717 - 724
  • [6] Online Phishing Detection: A Heuristic-Based Machine Learning Framework
    Elgharbi, Salah Eddine
    Yahia, Messaoud Ait
    Ouchani, Samir
    2024 13TH MEDITERRANEAN CONFERENCE ON EMBEDDED COMPUTING, MECO 2024, 2024, : 302 - 305
  • [7] Machine Learning for the Detection of Spam in Twitter Networks
    Wang, Alex Hai
    E-BUSINESS AND TELECOMMUNICATIONS, 2012, 222 : 319 - 333
  • [8] A Machine Learning Approach for Twitter Spammers Detection
    Meda, Claudia
    Bisio, Federica
    Gastaldo, Paolo
    Zunino, Rodolfo
    2014 INTERNATIONAL CARNAHAN CONFERENCE ON SECURITY TECHNOLOGY (ICCST), 2014,
  • [9] HTTP Adaptive Streaming Framework with Online Reinforcement Learning
    Kang, Jeongho
    Chung, Kwangsue
    APPLIED SCIENCES-BASEL, 2022, 12 (15):