A Streaming Machine Learning Framework for Online Aggression Detection on Twitter

被引：5

作者：

Herodotou, Herodotos ^{[1
]}

Chatzakou, Despoina ^{[2
]}

Kourtellis, Nicolas ^{[3
]}

机构：

[1] Cyprus Univ Technol, Limassol, Cyprus

[2] Ctr Res & Technol Hellas, Thessaloniki, Greece

[3] Telefon Res, Barcelona, Spain

来源：

2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA) | 2020年

关键词：

online aggression detection; streaming machine learning; social media;

D O I：

10.1109/BigData50022.2020.9377980

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The rise of online aggression on social media is evolving into a major point of concern. Several machine and deep learning approaches have been proposed recently for detecting various types of aggressive behavior. However, social media are fast paced, generating an increasing amount of content, while aggressive behavior evolves over time. In this work, we introduce the first, practical, real-time framework for detecting aggression on Twitter via embracing the streaming machine learning paradigm. Our method adapts its ML classifiers in an incremental fashion as it receives new annotated examples and is able to achieve the same (or even higher) performance as batch-based ML models, with over 90% accuracy, precision, and recall. At the same time, our experimental analysis on real Twitter data reveals how our framework can easily scale to accommodate the entire Twitter Firehose (of 778 million tweets per day) with only 3 commodity machines. Finally, we show that our framework is general enough to detect other related behaviors such as sarcasm, racism, and sexism in real time.

引用

页码：5056 / 5067

页数：12

共 50 条

[1] Rumor Detection on Twitter Using a Supervised Machine Learning Framework
Thakur, Hardeo Kumar
Gupta, Anand
Bhardwaj, Ayushi
Verma, Devanshi
INTERNATIONAL JOURNAL OF INFORMATION RETRIEVAL RESEARCH, 2018, 8 (03) : 1 - 13
[2] Extending a Distributed Online Machine Learning Framework for Streaming Video Analysis
Tsuji, Yusuke
Huang, Hung-Hsuan
Kawagoe, Kyoji
2013 SECOND IIAI INTERNATIONAL CONFERENCE ON ADVANCED APPLIED INFORMATICS (IIAI-AAI 2013), 2013, : 279 - 283
[3] Extended Framework and Evaluation for Multivariate Streaming Anomaly Detection with Machine Learning
Koch, Andreas
Petry, Michael
Werner, Martin
2024 IEEE 40TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOP, ICDEW, 2024, : 144 - 152
[4] A Framework of Online Learning with Imbalanced Streaming Data
Yan, Yan
Yang, Tianbao
Yang, Yi
Chen, Jianhui
THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 2817 - 2823
[5] MACORD: Online Adaptive Machine Learning Framework for Silent Error Detection
Subasi, Omer
Di, Sheng
Balaprakash, Prasanna
Unsal, Osman
Labarta, Jesus
Cristal, Adrian
Krishnamoorthy, Sriram
Cappello, Franck
2017 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2017, : 717 - 724
[6] Online Phishing Detection: A Heuristic-Based Machine Learning Framework
Elgharbi, Salah Eddine
Yahia, Messaoud Ait
Ouchani, Samir
2024 13TH MEDITERRANEAN CONFERENCE ON EMBEDDED COMPUTING, MECO 2024, 2024, : 302 - 305
[7] Machine Learning for the Detection of Spam in Twitter Networks
Wang, Alex Hai
E-BUSINESS AND TELECOMMUNICATIONS, 2012, 222 : 319 - 333
[8] A Machine Learning Approach for Twitter Spammers Detection
Meda, Claudia
Bisio, Federica
Gastaldo, Paolo
Zunino, Rodolfo
2014 INTERNATIONAL CARNAHAN CONFERENCE ON SECURITY TECHNOLOGY (ICCST), 2014,
[9] HTTP Adaptive Streaming Framework with Online Reinforcement Learning
Kang, Jeongho
Chung, Kwangsue
APPLIED SCIENCES-BASEL, 2022, 12 (15):
[10] Streaming trend detection in Twitter
Benhardus, J. (benha015@umn.edu), 2013, Inderscience Enterprises Ltd., Switzerland (09)

← 1 2 3 4 5 →