Arabic Offensive Language Classification on Twitter

被引:14
|
作者
Mubarak, Hamdy [1 ]
Darwish, Kareem [1 ]
机构
[1] HBKU, Qatar Comp Res Inst, Doha, Qatar
来源
SOCIAL INFORMATICS, SOCINFO 2019 | 2019年 / 11864卷
关键词
Offensive language; Obscenities; Text classification;
D O I
10.1007/978-3-030-34971-4_18
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Social media users often employ offensive language in their communication. Detecting offensive language on Twitter has many applications ranging from detecting/predicting conflict to measuring polarization. In this paper, we focus on building effective offensive tweet detection. We show that we can rapidly build a training set using a seed list of offensive words. Given the automatically created dataset, we trained a character n-gram based deep learning classifier that can effectively classify tweets with F1 score of 90%. We also show that we can expand our offensive word list by contrasting offensive and non-offensive tweets.
引用
收藏
页码:269 / 276
页数:8
相关论文
共 50 条
  • [1] Arabic Offensive Language Classification: Leveraging Transformer, LSTM, and SVM
    Rasheed, Areeg Fahad
    Zarkoosh, M.
    Abbas, Safa F.
    Al-Azzawi, Sana Sabah
    2023 IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLIED NETWORK TECHNOLOGIES, ICMLANT, 2023, : 115 - 120
  • [2] A Survey of Offensive Language Detection for the Arabic Language
    Husain, Fatemah
    Uzuner, Ozlem
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2021, 20 (01)
  • [3] Offensive Language Detection from Arabic Texts
    Awajan, Arafat A.
    INTELLIGENT COMPUTING, VOL 3, 2024, 2024, 1018 : 77 - 91
  • [4] BERT-based Approach to Arabic Hate Speech and Offensive Language Detection in Twitter: Exploiting Emojis and Sentiment Analysis
    Althobaiti, Maha Jarallah
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (05) : 972 - 980
  • [5] Emojis as anchors to detect Arabic offensive language and hate speech
    Mubarak, Hamdy
    Hassan, Sabit
    Chowdhury, Shammur Absar
    NATURAL LANGUAGE ENGINEERING, 2023, 29 (06) : 1436 - 1457
  • [6] Towards Accurate Detection of Offensive Language in Online Communication in Arabic
    Alakrot, Azalden
    Murray, Liam
    Nikolov, Nikola S.
    ARABIC COMPUTATIONAL LINGUISTICS, 2018, 142 : 315 - 320
  • [7] Transfer Learning Across Arabic Dialects for Offensive Language Detection
    Husain, Fatemah
    Uzuner, Ozlem
    2022 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2022), 2022, : 196 - 205
  • [8] Classification of Arabic Language Sciences
    Hicazi, Mahmud Fehmi
    Nalcakan, Zubeyt
    HITIT UNIVERSITESI ILAHIYAT FAKULTESI DERGISI-JOURNAL OF DIVINITY FACULTY OF HITIT UNIVERSITY, 2018, 17 (33): : 283 - 296
  • [9] An Enhanced Twitter Corpus for the Classification of Arabic Speech Acts
    Ahed, Majdi
    Hammo, Bassam H.
    Abushariah, Mohammad A. M.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (03) : 207 - 215
  • [10] Offensive Language Detection of Arabic Tweets Using Deep Learning Algorithm
    AlSukhni, Emad
    AlAzzam, Iyad
    Hanandeh, Sereen
    2024 15TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION SYSTEMS, ICICS 2024, 2024,