Arabic Offensive Language Classification on Twitter

被引:14
|
作者
Mubarak, Hamdy [1 ]
Darwish, Kareem [1 ]
机构
[1] HBKU, Qatar Comp Res Inst, Doha, Qatar
来源
SOCIAL INFORMATICS, SOCINFO 2019 | 2019年 / 11864卷
关键词
Offensive language; Obscenities; Text classification;
D O I
10.1007/978-3-030-34971-4_18
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Social media users often employ offensive language in their communication. Detecting offensive language on Twitter has many applications ranging from detecting/predicting conflict to measuring polarization. In this paper, we focus on building effective offensive tweet detection. We show that we can rapidly build a training set using a seed list of offensive words. Given the automatically created dataset, we trained a character n-gram based deep learning classifier that can effectively classify tweets with F1 score of 90%. We also show that we can expand our offensive word list by contrasting offensive and non-offensive tweets.
引用
收藏
页码:269 / 276
页数:8
相关论文
共 50 条
  • [31] Offensive language
    Barreda, Rene
    FORTUNE, 2007, 155 (03) : 13 - 13
  • [32] OFFENSIVE LANGUAGE
    DIDION, C
    TRAINING AND DEVELOPMENT JOURNAL, 1986, 40 (09): : 6 - 6
  • [33] Offensive language?
    Pilgrim, David
    PSYCHOLOGIST, 2009, 22 (08) : 654 - 654
  • [34] Offensive language
    Arana, P
    JOURNAL OF THE AMERICAN DENTAL ASSOCIATION, 1997, 128 (02): : 150 - 150
  • [35] Offensive Language
    Olson, Bennett
    DOWN BEAT, 2010, 77 (04): : 10 - 10
  • [36] High dimensional autonomous computing on Arabic language classification
    Rady, George Samy
    Mohamed, Sara Salah
    Mohamed, Mamdouh Farouk
    Hussain, Khaled F.
    COMPUTERS & ELECTRICAL ENGINEERING, 2022, 100
  • [37] Automatic offensive language detection from Twitter data using machine learning and feature selection of metadata
    De Souza, Gabriel Araujo
    Da Costa-Abreu, Marjory
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [38] ExaAUAC: Arabic Twitter user age prediction corpus based on language and metadata features
    Sadeghi R.
    Akbari A.
    Jaziriyan M.M.
    Discover Artificial Intelligence, 2024, 4 (01):
  • [39] Arabic Sarcasm Detection in Twitter
    Al-Ghadhban, Dana
    Alnkhilan, Eman
    Tatwany, Lamma
    Alrazgan, Muna
    2017 INTERNATIONAL CONFERENCE ON ENGINEERING & MIS (ICEMIS), 2017,
  • [40] Arabic Twitter Profiling For Arabic-Speaking Users
    Alhozaimi, Amani
    Almishari, Mishari
    2018 21ST SAUDI COMPUTER SOCIETY NATIONAL COMPUTER CONFERENCE (NCC), 2018,